Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

AtlasML: Add initial embedding model #39

Merged
merged 5 commits into from
Feb 19, 2025

Conversation

ardakaraman0
Copy link
Contributor

What?

Closes #22
This PR creates the two initial embedding models that we will be using.
Tries out OpenAI embeddings model.
Tries out Sentence BERT as a fallback model.

Why?

These models are the starting point of the context extraction that we will be using for the similarity measures and clustering methods.

How?

Affected Issues & Feature Proposal

Closes #22

Checklist

General

AtlasML - Machine Learning

  • I updated the machine learning documentation.

Testing

Testing Instructions

Requirements:

  • OpenAI key

Steps:

  1. Run AtlasML/atlasml/ml/VectorEmbeddings/FallbackModel.py
  2. Put in your OpenAI key in MainEmbeddingModel.py:4 where it indicates.
  3. Run AtlasML/atlasml/ml/VectorEmbeddings/MainEmbeddingModel.py

Review Progress

Code Review
  • Code Review 1
  • Code Review 2
Manuel Test
  • Test 1
  • Test 2

@ardakaraman0 ardakaraman0 self-assigned this Feb 10, 2025
@MaximilianAnzinger MaximilianAnzinger added this to the 0.0.0 milestone Feb 11, 2025
ufukygmr
ufukygmr previously approved these changes Feb 14, 2025
@MaximilianAnzinger MaximilianAnzinger changed the title AtlasML : Add Initial embedding model AtlasML: Add Initial embedding model Feb 19, 2025
@MaximilianAnzinger MaximilianAnzinger changed the title AtlasML: Add Initial embedding model AtlasML: Add initial embedding model Feb 19, 2025
@MaximilianAnzinger MaximilianAnzinger merged commit 7b954c7 into main Feb 19, 2025
4 checks passed
@MaximilianAnzinger MaximilianAnzinger deleted the feature/ML/initial-embedding-model branch February 19, 2025 09:35
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
Status: Done
Development

Successfully merging this pull request may close these issues.

AtlasML : Add Initial embedding model
3 participants