ollama-for-amd

mirror of https://github.com/likelovewant/ollama-for-amd.git synced 2025-12-21 14:26:30 +00:00

Files

Santosh Bhavani 8fafc8af77 ml/backend/ggml: NVML fallback for unified memory GPUs (#12619 )

* Simplify NVML fallback for unified memory GPUs

Remove device-specific checks and environment variable dependency for
NVML_ERROR_NOT_SUPPORTED fallback. When NVML doesn't support memory
queries, unconditionally use /proc/meminfo instead of checking device
names or OLLAMA_UNIFIED_MEMORY environment variable.

This provides better memory reporting by using MemAvailable which
accounts for reclaimable memory, avoiding the underreporting issue
described in NVIDIA support article a_id/5728.

Tested on NVIDIA GB10 unified memory iGPU with consistent and accurate
memory reporting across multiple model load/unload cycles.

* Add NVML fallback patch for unified memory GPUs

2025-10-15 11:40:06 -07:00

backend

ml/backend/ggml: NVML fallback for unified memory GPUs (#12619 )

2025-10-15 11:40:06 -07:00

Use runners for GPU discovery (#12090 )

2025-10-01 15:12:32 -07:00

backend.go

Use runners for GPU discovery (#12090 )

2025-10-01 15:12:32 -07:00

device.go

Use runners for GPU discovery (#12090 )

2025-10-01 15:12:32 -07:00