Files
ollama-for-amd/runner/ollamarunner/cache.go
Jesse Gross 756c78cfc7 ggml: Support closing backends
In order to iteratively find the best memory allocation, we need to
be able to free backend memory so we can try again.
2025-08-08 14:57:13 -07:00

7.6 KiB