ollama-for-amd/server/sched.go at 71399aa682726e472ca271f02417d87f6f8be429

mirror of https://github.com/likelovewant/ollama-for-amd.git synced 2025-12-22 06:43:57 +00:00

Files

Daniel Hiltgen 345420998e Prevent partial loading on mixed GPU brands

In mult-brand GPU setups, if we couldn't fully load the model we
would fall through the scheduler and mistakenly try to load across
a mix of brands.  This makes sure we find the set of GPU(s) that
best fit for the partial load.

2024-07-30 11:00:55 -07:00

28 KiB

Raw Blame History

View Raw

28 KiB Raw Blame History

28 KiB

Raw Blame History