Files
ollama-for-amd/model/models/qwen3/model.go
Michael Yang 6c833d5f8d fix(qwen3): deepseek distill
deepseek's qwen3 distill uses a different rope scheme so support both
2025-10-13 13:30:30 -07:00

8.3 KiB