ollama-for-amd

mirror of https://github.com/likelovewant/ollama-for-amd.git synced 2025-12-22 14:53:56 +00:00

Author	SHA1	Message	Date
likelovewant	c7e2f8889d	merge update ,remove update api	2024-07-18 12:17:00 +08:00
likelovewant	5505a018b2	Resolved merge conflicts	2024-07-12 20:44:04 +08:00
Jeffrey Morgan	efbf41ed81	llm: dont link cuda with compat libs (#5621 )	2024-07-10 20:01:52 -07:00
Jeffrey Morgan	4e262eb2a8	remove `GGML_CUDA_FORCE_MMQ=on` from build (#5588 )	2024-07-10 13:17:13 -07:00
Daniel Hiltgen	1f50356e8e	Bump ROCm on windows to 6.1.2 This also adjusts our algorithm to favor our bundled ROCm. I've confirmed VRAM reporting still doesn't work properly so we can't yet enable concurrency by default.	2024-07-10 11:01:22 -07:00
likelovewant	d788d8748b	Merge branch 'ollama:main' into main	2024-07-10 12:32:28 +08:00
Daniel Hiltgen	0bacb30007	Workaround broken ROCm p2p copy Enable the build flag for llama.cpp to use CPU copy for multi-GPU scenarios.	2024-07-08 09:40:52 -07:00
likelovewant	72bcdc1d4e	Merge branch 'ollama:main' into main	2024-07-08 16:02:24 +08:00
Jeffrey Morgan	4607c70641	llm: add `-DBUILD_SHARED_LIBS=off` to common cpu cmake flags (#5520 )	2024-07-06 18:58:16 -04:00
jmorganca	f1a379aa56	llm: statically link pthread and stdc++ dependencies in windows build	2024-07-06 12:54:02 -04:00
jmorganca	9ae146993e	llm: add `GGML_STATIC` flag to windows static lib	2024-07-06 03:27:05 -04:00
Jeffrey Morgan	e0348d3fe8	llm: add `COMMON_DARWIN_DEFS` to arm static build (#5513 )	2024-07-05 22:42:42 -04:00
Jeffrey Morgan	2cc854f8cb	llm: fix missing dylibs by restoring old build behavior on Linux and macOS (#5511 ) * Revert "fix cmake build (#5505)" This reverts commit `4fd5f3526a`. * llm: fix missing dylibs by restoring old build behavior * crlf -> lf	2024-07-05 21:48:31 -04:00
likelovewant	eaa65e39c2	Merge branch 'ollama:main' into main	2024-07-06 09:41:21 +08:00
Jeffrey Morgan	4fd5f3526a	fix cmake build (#5505 )	2024-07-05 19:07:01 -04:00
Jeffrey Morgan	8f8e736b13	update llama.cpp submodule to `d7fd29f` (#5475 )	2024-07-05 13:25:58 -04:00
likelovewant	aab0ccec29	add gfx1010 with xnack-	2024-07-02 11:11:59 +08:00
likelovewant	b5286d46dc	Update gen_windows.ps1	2024-06-27 12:55:18 +08:00
likelovewant	7ef869f2dc	Update gen_windows.ps1	2024-06-25 19:21:02 +08:00
likelovewant	0e42bf50ca	Merge upstream/main and resolve conflicts	2024-06-25 00:54:58 +08:00
Daniel Hiltgen	96624aa412	Merge pull request #5072 from dhiltgen/windows_path Move libraries out of users path	2024-06-19 09:13:39 -07:00
Daniel Hiltgen	b0930626c5	Add back lower level parallel flags nvcc supports parallelism (threads) and cmake + make can use -j, while msbuild requires /p:CL_MPcount=8	2024-06-17 13:44:46 -07:00
Daniel Hiltgen	e890be4814	Revert "More parallelism on windows generate" This reverts commit `0577af98f4`.	2024-06-17 13:32:46 -07:00
Daniel Hiltgen	b2799f111b	Move libraries out of users path We update the PATH on windows to get the CLI mapped, but this has an unintended side effect of causing other apps that may use our bundled DLLs to get terminated when we upgrade.	2024-06-17 13:12:18 -07:00
Jeffrey Morgan	152fc202f5	llm: update llama.cpp commit to `7c26775` (#4896 ) * llm: update llama.cpp submodule to `7c26775` * disable `LLAMA_BLAS` for now * `-DLLAMA_OPENMP=off`	2024-06-17 15:56:16 -04:00
Daniel Hiltgen	0577af98f4	More parallelism on windows generate Make the build faster	2024-06-15 07:44:55 -07:00
likelovewant	1b5848cbf2	remove gfx906 has conflicts with gfx906：xnack-	2024-06-09 11:46:22 +08:00
likelovewant	76026b4a35	Merge branch 'ollama:main' into main	2024-06-09 10:10:23 +08:00
Daniel Hiltgen	ab8c929e20	Add ability to skip oneapi generate This follows the same pattern for cuda and rocm to allow disabling the build even when we detect the dependent libraries	2024-06-07 08:32:49 -07:00
likelovewant	27e7397b11	Update gen_windows.ps1	2024-06-07 17:35:15 +08:00
likelovewant	16ce79eb3b	Merge branch 'ollama:main' into main	2024-05-31 18:43:24 +08:00
Jeffrey Morgan	7ca9605f54	speed up tests by only building static lib (#4740 )	2024-05-30 21:43:15 -07:00
likelovewant	cafde1f8ce	Merge branch 'ollama:main' into main	2024-05-29 19:33:39 +08:00
Daniel Hiltgen	646371f56d	Merge pull request #3278 from zhewang1-intc/rebase_ollama_main Enabling ollama to run on Intel GPUs with SYCL backend	2024-05-28 16:30:50 -07:00
likelovewant	6b50b2f3bf	Update gen_windows.ps1	2024-05-24 15:42:29 +08:00
Wang,Zhe	fd5971be0b	support ollama run on Intel GPUs	2024-05-24 11:18:27 +08:00
likelovewant	d497e31f4b	Merge branch 'ollama:main' into main	2024-05-16 22:24:44 +08:00
Daniel Hiltgen	c48c1d7c46	Port cuda/rocm skip build vars to linux Windows already implements these, carry over to linux.	2024-05-15 15:56:43 -07:00
likelovewant	a1c646af51	Update gen_windows.ps1	2024-05-06 19:43:34 +08:00
likelovewant	a87ff5601a	Update gen_linux.sh	2024-05-04 00:14:15 +08:00
likelovewant	5f221adbcd	Update gen_windows.ps1	2024-05-04 00:09:48 +08:00
likelovewant	e6cdf98166	Update gen_windows.ps1	2024-05-04 00:08:13 +08:00
Hernan Martinez	8a65717f55	Do not build AVX runners on ARM64	2024-04-26 23:55:32 -06:00
Hernan Martinez	b438d485f1	Use architecture specific folders in the generate script	2024-04-26 23:34:12 -06:00
Daniel Hiltgen	e4859c4563	Fine grain control over windows generate steps This will speed up CI which already tries to only build static for unit tests	2024-04-26 15:49:46 -07:00
Daniel Hiltgen	ed5fb088c4	Fix target in gen_windows.ps1	2024-04-26 15:10:42 -07:00
Daniel Hiltgen	421c878a2d	Put back non-avx CPU build for windows	2024-04-26 12:44:07 -07:00
Daniel Hiltgen	8671fdeda6	Refactor windows generate for more modular usage	2024-04-26 08:35:50 -07:00
Daniel Hiltgen	8feb97dc0d	Move cuda/rocm dependency gathering into generate script This will make it simpler for CI to accumulate artifacts from prior steps	2024-04-25 22:38:44 -07:00
Roy Yang	5f73c08729	Remove trailing spaces (#3889 )	2024-04-25 14:32:26 -04:00

1 2 3

122 Commits