mirror of
https://github.com/likelovewant/ollama-for-amd.git
synced 2025-12-21 22:33:56 +00:00
cross attention Q and K projections needs to have their heads swapped, similar to non-cross attention Q and K tensors
6.5 KiB
6.5 KiB