Files
ollama-for-amd/llama/patches/0036-ggml-cuda-skip-large-batches.patch
Michael Yang 0796d79d19 cuda: skip large batches
cuda panics on batches larger than 1024 so skip those and fallback to
cpu
2025-11-18 16:11:37 -08:00

1.1 KiB