ollama-for-amd/ml/backend/ggml/ggml.go at d773b7d67161cba40a342e74d66b7363dfdd38d2

mirror of https://github.com/likelovewant/ollama-for-amd.git synced 2025-12-22 14:53:56 +00:00

Files

Jesse Gross d773b7d671 backend: API to support full precision matmul

Most tensor backends try to optimize performance by using a lower
precision for matmuls. However, some operations (such as kq) on
some models are sensitive to this and require full precision.

2025-02-13 17:09:26 -08:00

14 KiB

Raw Blame History

View Raw

14 KiB Raw Blame History

14 KiB

Raw Blame History