ollama-for-amd

mirror of https://github.com/likelovewant/ollama-for-amd.git synced 2025-12-24 07:28:27 +00:00

Author	SHA1	Message	Date
likelovewant	86a1575ee3	fix api v0.2.8	2024-07-23 14:57:33 +08:00
likelovewant	fbfc13b6ca	Merge branch 'ollama:main' into main	2024-07-23 14:49:32 +08:00
Daniel Hiltgen	c78089263a	Merge pull request #5864 from dhiltgen/bump_go Bump Go patch version	2024-07-22 16:34:18 -07:00
Daniel Hiltgen	3e5ea035d5	Merge pull request #5757 from lreed-mdsol/lreed/bump-go-version-fix-vulnerabilities bump go version to 1.22.5 to fix security vulnerabilities in docker	2024-07-22 16:32:43 -07:00
Daniel Hiltgen	5d604eec5b	Bump Go patch version	2024-07-22 16:16:28 -07:00
Josh	db0968f30c	fix dupe err message (#5857 )	2024-07-22 15:48:15 -07:00
royjhan	c0648233f2	api embed docs (#5282 )	2024-07-22 13:37:08 -07:00
Jeffrey Morgan	d835368eb8	convert: capture `head_dim` for mistral (#5818 )	2024-07-22 16:16:22 -04:00
Daniel Hiltgen	5784c05397	Merge pull request #5854 from dhiltgen/win_exit_status Refine error reporting for subprocess crash	2024-07-22 10:40:22 -07:00
Daniel Hiltgen	f14aa5435d	Merge pull request #5855 from dhiltgen/remove_max_vram Remove no longer supported max vram var	2024-07-22 10:35:29 -07:00
Jeffrey Morgan	f8fedbda20	Update llama.cpp submodule commit to `d94c6e0c` (#5805 )	2024-07-22 12:42:00 -04:00
Jeffrey Morgan	b3e5491e41	server: collect nested tool call objects when parsing (#5824 )	2024-07-22 12:38:03 -04:00
Daniel Hiltgen	cc269ba094	Remove no longer supported max vram var The OLLAMA_MAX_VRAM env var was a temporary workaround for OOM scenarios. With Concurrency this was no longer wired up, and the simplistic value doesn't map to multi-GPU setups. Users can still set `num_gpu` to limit memory usage to avoid OOM if we get our predictions wrong.	2024-07-22 09:08:11 -07:00
Daniel Hiltgen	a3c20e3f18	Refine error reporting for subprocess crash On windows, the exit status winds up being the search term many users search for and end up piling in on issues that are unrelated. This refines the reporting so that if we have a more detailed message we'll suppress the exit status portion of the message.	2024-07-22 08:52:16 -07:00
likelovewant	c44ff579a3	fix mismatch	2024-07-22 19:47:58 +08:00
likelovewant	04325ba40a	fix typo	2024-07-22 19:35:43 +08:00
likelovewant	3f03ae5808	update gen_windows.ps1 ,keep track with upstream	2024-07-22 19:00:40 +08:00
likelovewant	24641ae3a5	update gen_windows.ps1 ,keep track with upstream	2024-07-22 18:48:21 +08:00
likelovewant	381e89da2e	remove unecessary files	2024-07-22 17:25:22 +08:00
likelovewant	5cae567ee8	megrge upstream update and reslove the conflicts	2024-07-22 17:00:43 +08:00
likelovewant	8ebfa2b4ec	fix links	2024-07-22 08:18:55 +08:00
likelovewant	a8890fd2c6	fix conflicts	2024-07-22 08:10:12 +08:00
Jeffrey Morgan	80ee9b5e47	Remove out of space test temporarily (#5825 )	2024-07-21 00:22:11 -04:00
Jeffrey Morgan	5534f2cc6a	llm: consider `head_dim` in llama arch (#5817 )	2024-07-20 21:48:12 -04:00
Daniel Hiltgen	d321297d8a	Merge pull request #5815 from dhiltgen/win_rocm_gfx_features Adjust windows ROCm discovery	2024-07-20 16:02:55 -07:00
Daniel Hiltgen	06e5d74e34	Merge pull request #5506 from dhiltgen/sched_tests Refine scheduler unit tests for reliability	2024-07-20 15:48:39 -07:00
Daniel Hiltgen	5d707e6fd5	Merge pull request #5583 from dhiltgen/integration_improvements Fix context exhaustion integration test for small gpus	2024-07-20 15:48:21 -07:00
Daniel Hiltgen	283948c83b	Adjust windows ROCm discovery The v5 hip library returns unsupported GPUs which wont enumerate at inference time in the runner so this makes sure we align discovery. The gfx906 cards are no longer supported so we shouldn't compile with that GPU type as it wont enumerate at runtime.	2024-07-20 15:17:50 -07:00
Jeffrey Morgan	1475eab95f	add patch for tekken (#5807 )	2024-07-20 13:41:21 -04:00
Jeffrey Morgan	20090f3172	preserve last assistant message (#5802 )	2024-07-19 20:19:26 -07:00
Jeffrey Morgan	69a2d4ccff	Fix generate test flakyness (#5804 )	2024-07-19 19:11:25 -07:00
Josh	e8b954c646	server: validate template (#5734 ) add template validation to modelfile	2024-07-19 15:24:29 -07:00
royjhan	c57317cbf0	OpenAI: Function Based Testing (#5752 ) * distinguish error forwarding * more coverage * rm comment	2024-07-19 11:37:12 -07:00
royjhan	51b2fd299c	adjust openai chat msg processing (#5729 )	2024-07-19 11:19:20 -07:00
likelovewant	591b595290	Merge branch 'ollama:main' into main v0.2.7	2024-07-19 11:22:39 +08:00
Michael Yang	d0634b1596	Merge pull request #5780 from ollama/mxyng/tools fix parsing tool calls: break on unexpected eofs	2024-07-18 12:14:10 -07:00
Michael Yang	43606d6d6a	fix parsing tool calls	2024-07-18 12:08:11 -07:00
Jeffrey Morgan	70b1010fa5	server: check for empty tools array too (#5779 )	2024-07-18 11:44:57 -07:00
Jeffrey Morgan	84e5721f3a	always provide content even if empty (#5778 )	2024-07-18 11:28:19 -07:00
Jeffrey Morgan	319fb1ce03	server: only parse tool calls if tools are provided (#5771 ) * server: only parse tool calls if tools are provided * still set `resp.Message.Content`	2024-07-18 08:50:23 -07:00
likelovewant	877aa39290	remove update api	2024-07-18 21:52:13 +08:00
likelovewant	5ea9cc588f	Merge branch 'ollama:main' into main	2024-07-18 19:11:27 +08:00
Michael Yang	b255445557	marshal json automatically for some template values (#5758 )	2024-07-17 15:35:11 -07:00
lreed	f02f83660c	bump go version to 1.22.5 to fix security vulnerabilities	2024-07-17 21:44:19 +00:00
Michael Yang	b23424bb3c	Merge pull request #5753 from ollama/mxyng/parse-tool-call parse tool call as individual objects	2024-07-17 11:47:53 -07:00
Michael Yang	5fd6988126	parse tool call as individual objects	2024-07-17 11:19:04 -07:00
Michael Yang	5b82960df8	stub response (#5750 )	2024-07-17 10:39:22 -07:00
Michael Yang	cc9a252d8c	Merge pull request #5732 from ollama/mxyng/cleanup remove ToolCall from GenerateResponse	2024-07-17 10:26:54 -07:00
Pákozdi György	d281a6e603	add sidellama link (#5702 )	2024-07-17 10:24:44 -07:00
likelovewant	5cfa607627	Merge branch 'ollama:main' into main	2024-07-17 22:29:55 +08:00

1 2 3 4 5 ...

3266 Commits