Files
ollama-for-amd/ml/nn/attention.go
Grace 584e2d646f Add deepseek v3.1 (#13063)
* Add mla for flash attention
* Revert to using chunks
2025-11-17 18:03:21 -08:00

2.8 KiB