ollama-for-amd/server at 94ab428e3f77fdd9d9c833b369bb40980c65049a - ollama-for-amd - Git.NotJustAnna.net

mirrors/ollama-for-amd

mirror of https://github.com/likelovewant/ollama-for-amd.git synced 2025-12-23 23:18:26 +00:00

Files

History

Jesse Gross 94ab428e3f ggml: Seperate tensor load from backend creation

Currently, when the backend is created, the tensors are loaded at the
same time, which is a slow operation. This separates them to be two
steps:
 - Create backend, including enumerating tensors and memory allocation
 - Loading tensor data

This allows more flexibility in managing model loading.

2025-05-19 09:54:22 -07:00

..

lint: enable usetesting, disable tenv (#10594 )

2025-05-08 11:42:14 -07:00

all: fix typos in documentation, code, and comments (#7021 )

2024-12-10 12:58:06 -08:00

auth.go

fix nil deref in auth.go

2024-07-26 14:14:48 -07:00

create_test.go

server: validate local path on safetensor create (#9379 )

2025-02-28 16:10:43 -08:00

create.go

ggml: Seperate tensor load from backend creation

2025-05-19 09:54:22 -07:00

download.go

server: organize error types (#9465 )

2025-03-28 11:50:22 -07:00

fixblobs_test.go

server: replace blob prefix separator from ':' to '-' (#3146 )

2024-03-14 20:18:06 -07:00

fixblobs.go

server: replace blob prefix separator from ':' to '-' (#3146 )

2024-03-14 20:18:06 -07:00

images_test.go

lint: enable usetesting, disable tenv (#10594 )

2025-05-08 11:42:14 -07:00

images.go

ggml: Seperate tensor load from backend creation

2025-05-19 09:54:22 -07:00

layer.go

One corrupt manifest should not wedge model operations (#7515 )

2024-11-05 14:21:45 -08:00

manifest_test.go

One corrupt manifest should not wedge model operations (#7515 )

2024-11-05 14:21:45 -08:00

manifest.go

One corrupt manifest should not wedge model operations (#7515 )

2024-11-05 14:21:45 -08:00

model_test.go

Update the /api/create endpoint to use JSON (#7935 )

2024-12-31 18:02:30 -08:00

model.go

ggml: Seperate tensor load from backend creation

2025-05-19 09:54:22 -07:00

modelpath_test.go

lint: enable usetesting, disable tenv (#10594 )

2025-05-08 11:42:14 -07:00

modelpath.go

server: organize error types (#9465 )

2025-03-28 11:50:22 -07:00

prompt_test.go

chore: update mllama to use ollama engine (#10637 )

2025-05-13 17:36:02 -07:00

prompt.go

chore: update mllama to use ollama engine (#10637 )

2025-05-13 17:36:02 -07:00

quantization_test.go

ggml: Seperate tensor load from backend creation

2025-05-19 09:54:22 -07:00

quantization.go

Follow up to #10363 (#10647 )

2025-05-12 15:23:31 -07:00

routes_create_test.go

Move quantization to new backend (#10363 )

2025-05-06 11:20:48 -07:00

routes_delete_test.go

Update the /api/create endpoint to use JSON (#7935 )

2024-12-31 18:02:30 -08:00

routes_generate_test.go

lint: enable usetesting, disable tenv (#10594 )

2025-05-08 11:42:14 -07:00

routes_list_test.go

Update the /api/create endpoint to use JSON (#7935 )

2024-12-31 18:02:30 -08:00

routes_test.go

fix: stream accumulator exits early (#10593 )

2025-05-08 13:17:30 -07:00

routes.go

chore: update mllama to use ollama engine (#10637 )

2025-05-13 17:36:02 -07:00

sched_test.go

lint: enable usetesting, disable tenv (#10594 )

2025-05-08 11:42:14 -07:00

sched.go

chore: update mllama to use ollama engine (#10637 )

2025-05-13 17:36:02 -07:00

sparse_common.go

Don't hard fail on sparse setup error

2024-08-09 12:16:19 -07:00

sparse_windows.go

Don't hard fail on sparse setup error

2024-08-09 12:16:19 -07:00

upload.go

server: always print upload/download part info (#8832 )

2025-02-04 19:30:49 -08:00