Michael Yang
|
333203d871
|
chore: update models to use slice/chunk/chunksections (#12934)
* use slice/chunks
* bert
* llama4
* gemma3n
* gptoss
* mistral3
* qwen3vl
* qwen25vl
* deepseek2
* remove unused ops
|
2025-11-13 15:20:12 -08:00 |
|
Michael Yang
|
c253433d68
|
embed: cleanup (#12299)
* cleanup
* use pooling.TypeNone
* pooling test
|
2025-09-16 09:48:42 -07:00 |
|
Michael Yang
|
3f6642f6fc
|
model: implement bert in ollama engine (#9080)
* fix truncate
* s/SentencePieceModel/SentencePiece/
* bert
* wordpiece
* refactor pooling
* more tokenizers
* normalize embeddings
|
2025-09-15 15:35:59 -07:00 |
|