ggml: Support closing backends

In order to iteratively find the best memory allocation, we need to be able to free backend memory so we can try again.
2025-12-21 14:26:30 +00:00 · 2025-04-17 17:12:01 -07:00
parent d7f4f788d1
commit 756c78cfc7
4 changed files with 71 additions and 24 deletions
--- a/ml/backend.go
+++ b/ml/backend.go
@@ -15,6 +15,9 @@ import (
 )

 type Backend interface {
+	// Close frees all memory associated with this backend
+	Close()
+
 	Load(ctx context.Context, progress func(float32)) error

 	// BackendMemory returns the memory allocations that were made for this model