ollama-for-amd/llama/patches/0026-ggml-No-alloc-mode.patch at d4af9f04f9dd1f07e9f2f0ca2270871b368778c3

mirror of https://github.com/likelovewant/ollama-for-amd.git synced 2025-12-21 22:33:56 +00:00

Files

Jesse Gross 79f6376f5b ggml: No-alloc mode

Callers can set a backend buffer type to be no-alloc, meaning that
it does not allocate memory for tensors or operations. This can
be used for calculating memory requirements. Tensors and graphs
must be recreated with no-alloc set to false before loading data.

Defaults to false for newly created backend buffer types.

2025-08-08 14:57:13 -07:00

3.7 KiB

Raw Blame History

View Raw

3.7 KiB Raw Blame History

3.7 KiB

Raw Blame History