GitHub - DhanashriPetkar/Dy.Tech: We use the 1M version of the Movielens dataset. The dataset includes around 1 million ratings from 6000 users on 4000 movies, along with some user features, movie genres. In addition, the timestamp of each user-movie rating is provided, which allows creating sequences of movie ratings for each user, as expected by the BST model.

Team - Dy.Tech

Amar Parab	Simeen Pathan	Dhanashri Petkar	Sania Alam
Class - TY A	Class - TY A	Class - TY A	Class - TY A
Roll No. - 39	Roll No. - 35	Roll No. - 02	Roll No. - 17

##PROJECT TITTLE ###movielens_recommendations_transformers

##USE OF REAL DATASET

###Project Structure

Uploading and Reading the Dataset: Upload the dialogue transcript and read it into a pandas DataFrame.
Preprocessing the Data: Clean and preprocess the data for training, including encoding emotions and normalizing VAD (Valence, Arousal, Dominance) scores.
Tokenization: Tokenize the input text using BERT tokenizer.
Custom Dataset Class: Create a custom dataset class to handle input encodings and labels.
Model Training: Train a BERT model for sequence classification using the prepared dataset.
Model Evaluation: Evaluate the trained model on the dataset.

Name		Name	Last commit message	Last commit date
Latest commit History 27 Commits
DataInsights.py		DataInsights.py
Default_Transformer_based_recommendation_system.ipynb		Default_Transformer_based_recommendation_system.ipynb
G42_DY_TECH_REAL_DATASET.ipynb		G42_DY_TECH_REAL_DATASET.ipynb
MAE.png		MAE.png
Model Training with no MAE		Model Training with no MAE
P1.png		P1.png
P2.png		P2.png
README.md		README.md
Report_on_A_Transformer_based_recommendation_system.pdf		Report_on_A_Transformer_based_recommendation_system.pdf
Ses01M_script01_1.txt		Ses01M_script01_1.txt
Ses01M_script01_2.txt		Ses01M_script01_2.txt
Ses01M_script01_3.txt		Ses01M_script01_3.txt
project.py		project.py