ollama-for-amd/server/quantization.go at 424810450f3043e97aca539f1250d149a26cd99e

mirror of https://github.com/likelovewant/ollama-for-amd.git synced 2025-12-21 14:26:30 +00:00

Files

Daniel Hiltgen 424810450f Move quantization to new backend (#10363 )

* Move quantization logic to GGML via new backend

This moves the model aware logic to Go code and calls GGMLs quantization code for model creation.

* Remove "add model quantizations"

This is no longer needed now that quantization is implemented in Go+GGML code directly.

2025-05-06 11:20:48 -07:00

9.3 KiB

Raw Blame History

View Raw

9.3 KiB Raw Blame History

9.3 KiB

Raw Blame History