ollama-for-amd/server/sched.go at ff4f0cbd1d54ba5acc89c97b49af017eb0d2512d

mirror of https://github.com/likelovewant/ollama-for-amd.git synced 2025-12-21 22:33:56 +00:00

Files

Daniel Hiltgen ff4f0cbd1d Prevent multiple concurrent loads on the same gpus

While models are loading, the VRAM metrics are dynamic, so try
to load on a GPU that doesn't have a model actively loading, or wait
to avoid races that lead to OOMs

2024-06-14 14:51:40 -07:00

24 KiB

Raw Blame History

View Raw

24 KiB Raw Blame History

24 KiB

Raw Blame History