mirror of
https://github.com/likelovewant/ollama-for-amd.git
synced 2025-12-21 22:33:56 +00:00
In order to iteratively find the best memory allocation, we need to be able to free backend memory so we can try again.
In order to iteratively find the best memory allocation, we need to be able to free backend memory so we can try again.