Michael Yang
|
903b1fc97f
|
use ollama engine for bert models (#13501)
register bpe tokenizer which enables granite-embedding
|
2025-12-16 11:29:19 -08:00 |
|
nicole pardal
|
8de30b568a
|
nomic-embed-text model implementation (#13071)
|
2025-11-18 18:28:10 -08:00 |
|
Michael Yang
|
333203d871
|
chore: update models to use slice/chunk/chunksections (#12934)
* use slice/chunks
* bert
* llama4
* gemma3n
* gptoss
* mistral3
* qwen3vl
* qwen25vl
* deepseek2
* remove unused ops
|
2025-11-13 15:20:12 -08:00 |
|
Michael Yang
|
1188f408dd
|
s/From*Slice/From*s/ (#12255)
|
2025-10-28 12:08:49 -07:00 |
|
Michael Yang
|
c253433d68
|
embed: cleanup (#12299)
* cleanup
* use pooling.TypeNone
* pooling test
|
2025-09-16 09:48:42 -07:00 |
|
Michael Yang
|
3f6642f6fc
|
model: implement bert in ollama engine (#9080)
* fix truncate
* s/SentencePieceModel/SentencePiece/
* bert
* wordpiece
* refactor pooling
* more tokenizers
* normalize embeddings
|
2025-09-15 15:35:59 -07:00 |
|