ollama-for-amd

mirror of https://github.com/likelovewant/ollama-for-amd.git synced 2025-12-22 06:43:57 +00:00

Files

Jesse Gross 19e6796eac llm: Support KV cache quantization with gpt-oss

With the new version of GGML in #12245, KV cache quantization
no longer causes a fallback to CPU.

2025-10-03 16:31:58 -07:00

ggml_test.go

2025-04-27 11:38:06 -07:00

ggml.go

2025-10-03 16:31:58 -07:00

gguf_test.go

2025-08-26 13:57:46 -07:00

gguf.go

2025-08-26 13:57:46 -07:00

type.go

2025-08-26 16:41:02 -07:00