mirror of
https://github.com/likelovewant/ollama-for-amd.git
synced 2025-12-21 22:33:56 +00:00
It can be important for a tensor to know what backend it came from - for example, to know if flash attention is enabled.
It can be important for a tensor to know what backend it came from - for example, to know if flash attention is enabled.