Open
Description
This issue is grouping a few things that might be optimized to improve performance:
- Faster dot product possible, see here: maths: switch dot to faster for loop huggingface/transformers.js#667
- Add binary quantization for incredible performance gains: https://github.com/xenova/transformers.js/releases/tag/2.17.0. This should enable saving up to 3000 books in one index. In my last tests it was crashing for 1000 but working perfectly for 100 books. Context1, context2, example
tbc
Metadata
Metadata
Assignees
Labels
No labels