ollama-for-amd/llama/runner/runner.go at 3fc1dc0e6f32a22063db22a4dc72a75f8411a663

mirror of https://github.com/likelovewant/ollama-for-amd.git synced 2025-12-23 23:18:26 +00:00

Files

Jesse Gross 3fc1dc0e6f runner.go: Hard fail on errors rather than potentially infinite looping

We try to recover from errors by dropping the tokens that caused the
problem and re-trying. However, dropping the tokens is not correct
and continuing often leads to infinite loops. To avoid, this we
end the sequence if such a condition is detected, which is also
surprising.

At this point, it is better to just report the error. This will make
it easier to find problems and the alternatives are perhaps even more
surprising to users.

This is not a very satisfactory solution either - we should isolate
the error and return it to the user without killing the whole process.
However, this is an incremental step and consistent with most other
failures (which either manifest as abort() or panic).

2024-11-20 12:49:24 -08:00

24 KiB

Raw Blame History

View Raw

24 KiB Raw Blame History

24 KiB

Raw Blame History