ollama-for-amd/integration/context_test.go at 5cae567ee861b4bb3e1ff99749777d8d6873e9de

mirror of https://github.com/likelovewant/ollama-for-amd.git synced 2025-12-21 14:26:30 +00:00

Files

Daniel Hiltgen 73e2c8f68f Fix context exhaustion integration test for small gpus

On the smaller GPUs, the initial model load of llama2 took over 30s (the
default timeout for the DoGenerate helper)

2024-07-09 16:24:14 -07:00

View Raw