-
Notifications
You must be signed in to change notification settings - Fork 1.2k
chore: Updating documentation, adding exception handling for Vector Stores in RAG Tool, more tests on migration, and migrate off of inference_api for context_retriever for RAG #3367
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
8080713
to
b169c7d
Compare
), | ||
Document( | ||
document_id="data-url-doc", | ||
content="data:text/plain;base64,VGhpcyBpcyBhIGRhdGEgVVJMIGRvY3VtZW50IGFib3V0IGRlZXAgbGVhcm5pbmcu", # "This is a data URL document about deep learning." |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
as noted in the comment, this is a base64 encoded string that just says "This is a data URL document about deep learning."
static=VectorStoreChunkingStrategyStaticConfig( | ||
max_chunk_size_tokens=chunk_size_in_tokens, | ||
chunk_overlap_tokens=chunk_size_in_tokens // 4, | ||
try: |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
some nested blocks here to handle failures more gracefully
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Hum given that each except block continues
why do we need to have that nested structure? We always continue the loop anyway right? Or is the goal more granular logging of the actual failures?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Yeah the goal was more granular logging just in case so that end users can debug potential data parsing issues more obviously.
47c2809
to
8caaf38
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Migration instructions are clear and provide ample warning for updating to the new OpenAI APIs.
Unit/Integration tests look good and are passing. /lgtm
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks, @franciscojavierarceo! lgtm
…r Stores in RAG Tool and updating inference to use openai and updating memory implementation to use existing libraries Signed-off-by: Francisco Javier Arceo <[email protected]>
215c898
to
ff0bd41
Compare
@@ -60,6 +57,47 @@ def make_random_string(length: int = 8): | |||
return "".join(secrets.choice(string.ascii_letters + string.digits) for _ in range(length)) | |||
|
|||
|
|||
async def raw_data_from_doc(doc: RAGDocument) -> tuple[bytes, str]: |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
needed this after testing with the type script UI
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
One non-blocking question. Thanks!
static=VectorStoreChunkingStrategyStaticConfig( | ||
max_chunk_size_tokens=chunk_size_in_tokens, | ||
chunk_overlap_tokens=chunk_size_in_tokens // 4, | ||
try: |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Hum given that each except block continues
why do we need to have that nested structure? We always continue the loop anyway right? Or is the goal more granular logging of the actual failures?
What does this PR do?
Test Plan
Integration and unit tests added