How to load pre-generated embedding files? #685

rustam-ashurov-mcx · 2024-07-04T14:06:42Z

rustam-ashurov-mcx
Jul 4, 2024

Hey mates, have a hard time with SK + KM (mostly because of lack of exp of working with both yet) 😅

My aim is to generate embeddings in advance as files, and on service startup upload them in in-memory store without calling AI provider again and again on each restart.

So I generated embeddings via OpenAI client (I was unable to find whether I can generate them via SK/KM and store as files), and now they are stored as *.json files with such (example) content:

"{
"data": [
{
"embedding": [
0.006308248266577721,
....lot of numbers here...
],
"index": 0,
"object": "embedding"
}
],
"model": "text-embedding-ada-002",
"object": "list",
"usage": {
"prompt_tokens": 305,
"total_tokens": 305
}
}"

What I can not understand is how can this data be loaded into KM?

dluc · 2024-07-09T14:17:50Z

dluc
Jul 9, 2024
Maintainer

hi @rustam-ashurov-mcx, what you're describing is very similar to a cache, which would require storing pairs of text and vectors, allowing to search by text. You could implement a cache dependency and inject into embedding generators, using a lookup table. Or you could develop a custom embedding generator, that never talks to OpenAI, and always loads embeddings from storage, taking care of all the cases, e.g. distributed storage, adding new embeddings, etc.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to load pre-generated embedding files? #685

{{title}}

Replies: 1 comment

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Select a reply

How to load pre-generated embedding files? #685

rustam-ashurov-mcx Jul 4, 2024

Replies: 1 comment

dluc Jul 9, 2024 Maintainer

rustam-ashurov-mcx
Jul 4, 2024

dluc
Jul 9, 2024
Maintainer