This website requires JavaScript.
Explore
Help
Sign In
mirrors
/
ollama-for-amd
Watch
1
Star
0
Fork
0
You've already forked ollama-for-amd
mirror of
https://github.com/likelovewant/ollama-for-amd.git
synced
2025-12-21 22:33:56 +00:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
Files
a4770107a6ea6b4f5adc235d37d08417dc3b9184
ollama-for-amd
/
server
/
quantization.go
Michael Yang
d0b32def60
skip quantizing per_layer_token_embd (
#11207
)
...
this tensor isn't compatible with cuda when quantized to q4_K so skip it
2025-06-26 21:49:35 -07:00
8.2 KiB
Raw
Blame
History
View Raw
Reference in New Issue
View Git Blame
Copy Permalink