ollama-for-amd/llm/payload.go at ad90b9ab3d94e330970b00ae29be50b6ed62a8fb

mirror of https://github.com/likelovewant/ollama-for-amd.git synced 2025-12-22 06:43:57 +00:00

Files

Daniel Hiltgen 58d95cc9bd Switch back to subprocessing for llama.cpp

This should resolve a number of memory leak and stability defects by allowing
us to isolate llama.cpp in a separate process and shutdown when idle, and
gracefully restart if it has problems.  This also serves as a first step to be
able to run multiple copies to support multiple models concurrently.

2024-04-01 16:48:18 -07:00

5.0 KiB

Raw Blame History

View Raw

5.0 KiB Raw Blame History

5.0 KiB

Raw Blame History