ollama-for-amd

mirror of https://github.com/likelovewant/ollama-for-amd.git synced 2025-12-24 07:28:27 +00:00

Author	SHA1	Message	Date
likelovewant	45e42a9a02	fix modules mismatch	2024-07-31 14:58:13 +08:00
likelovewant	ad5ad895fb	fix	2024-07-31 13:37:19 +08:00
likelovewant	8ebfa2b4ec	fix links	2024-07-22 08:18:55 +08:00
likelovewant	00beadf67e	update	2024-07-10 23:40:16 +08:00
Daniel Hiltgen	fac9060da5	Init submodule with new path	2024-01-04 13:00:13 -08:00
Daniel Hiltgen	a554616f8e	remove old llama.cpp submodule path	2024-01-04 12:12:21 -08:00
Daniel Hiltgen	77d96da94b	Code shuffle to clean up the llm dir	2024-01-04 12:12:05 -08:00
Bruce MacDonald	811b1f03c8	deprecate ggml - remove ggml runner - automatically pull gguf models when ggml detected - tell users to update to gguf in the case automatic pull fails Co-Authored-By: Jeffrey Morgan <jmorganca@gmail.com>	2023-12-19 09:05:46 -08:00
Michael Yang	058d0cd04b	silence warm up log	2023-09-21 14:53:33 -07:00
Bruce MacDonald	09dd2aeff9	GGUF support (#441 )	2023-09-07 13:55:37 -04:00
Jeffrey Morgan	a82eb275ff	update docs for subprocess	2023-08-30 17:54:02 -04:00
Bruce MacDonald	42998d797d	subprocess llama.cpp server (#401 ) * remove c code * pack llama.cpp * use request context for llama_cpp * let llama_cpp decide the number of threads to use * stop llama runner when app stops * remove sample count and duration metrics * use go generate to get libraries * tmp dir for running llm	2023-08-30 16:35:03 -04:00