Skip to content

Commit 2226460

Browse files
committed
update changes.md
1 parent 1f70046 commit 2226460

File tree

1 file changed

+11
-1
lines changed

1 file changed

+11
-1
lines changed

CHANGES.md

Lines changed: 11 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -3,10 +3,20 @@
33
## v1.1.0
44

55
- Rename EmbeddingGenerator to EmbeddingEncoder
6-
- Use non-quantized default model by default for better embedding performance but require more memory
76
- Fixed serverOptions weren't passed through properly in test cases
87
- Upgrade to @huggingface/transformers v3.2.4
98
- Upgrade onnxruntime-node v1.20.1
9+
- Avoid including unused models in docker images (smaller image size)
10+
- Increase probe timeout seconds
11+
- Use worker pool
12+
- Process sentence list with separate model runs
13+
- set default `workerTaskTimeout` to `60` seconds
14+
- use quantized version (q8) default model
15+
- set default `limits.memory` to `850M`
16+
- set default replicas number to `2`
17+
- Add max_length config to model config (configurable via helm config)
18+
- set max_length of default model to 1024 due to excessive memory usage when working on text longer than 2048 (the default model supports up to 8192)
19+
- only use padding for multiple inputs received when encoding the input
1020

1121
## v1.0.0
1222

0 commit comments

Comments
 (0)