Daniel Hiltgen
424810450f
Move quantization to new backend ( #10363 )
...
* Move quantization logic to GGML via new backend
This moves the model aware logic to Go code and calls GGMLs quantization code for model creation.
* Remove "add model quantizations"
This is no longer needed now that quantization is implemented in Go+GGML code directly.
2025-05-06 11:20:48 -07:00
..
2025-04-25 16:58:49 -07:00
2024-12-10 12:58:06 -08:00
2024-07-26 14:14:48 -07:00
2025-02-28 16:10:43 -08:00
2025-05-06 11:20:48 -07:00
2025-03-28 11:50:22 -07:00
2024-03-14 20:18:06 -07:00
2024-03-14 20:18:06 -07:00
2025-04-01 15:21:46 -07:00
2025-05-01 16:50:20 -07:00
2024-11-05 14:21:45 -08:00
2024-11-05 14:21:45 -08:00
2024-11-05 14:21:45 -08:00
2024-12-31 18:02:30 -08:00
2025-05-06 11:20:48 -07:00
2024-12-11 15:29:59 -08:00
2025-03-28 11:50:22 -07:00
2024-12-09 11:02:55 -08:00
2025-03-14 15:38:54 -07:00
2025-05-06 11:20:48 -07:00
2025-05-06 11:20:48 -07:00
2025-05-06 11:20:48 -07:00
2024-12-31 18:02:30 -08:00
2025-05-06 11:20:48 -07:00
2024-12-31 18:02:30 -08:00
2025-04-30 13:57:45 -07:00
2025-04-30 13:57:45 -07:00
2025-05-06 11:20:48 -07:00
2025-05-05 09:01:33 -07:00
2024-08-09 12:16:19 -07:00
2024-08-09 12:16:19 -07:00
2025-02-04 19:30:49 -08:00