A Study of Biases in LLM-Generated Musical Taste Profiles for Recommendation

This repository provides our Python code to reproduce the experiments from the paper "A Study of Biases in LLM-Generated Musical Taste Profiles for Recommendation". Submited to ACM Transactions on Recommender Systems.

The data and code for reproducing some of the experiments can be found in the repository of the recsys version of this paper.

Recommendation-as-retrieval experiments

This repository is aimed to test the recommendation-as-retrieval when using generated user profiles as queries and track metadata as documents.

Installation

make build
make run
hf auth login

Place the data in data/ following the instructions.

Testing pre-trained encoders

We selected models relying on a variety of architectures and sizes. The criteria for selecting the models were that they are open-source and achieve top retrieval results within their size category according to the MTEB leaderboard.

When running experiments with a model for the first time, embeddings are computed from scratch and saved in the output directory provided as argument (see below). Subsequent runs load the embeddings from the corresponding directory. To force recomputation of the embeddings, the corresponding directory must be manually removed.

Average model size

google/embeddinggemma-300m

encoder-decoder architecture, decoder uses Gemma 3 as backbone
1155 MB memory usage
768 embedding dimension

 python -m src.eval --input_dir data/ --output_dir output --model_name google/embeddinggemma-300m

Smaller model size

Alibaba-NLP/gte-multilingual-base

encoder-only
582 memory usage
768 embedding dimension

 python -m src.eval --input_dir data/ --output_dir output --model_name Alibaba-NLP/gte-multilingual-base

Compact model size

sentence-transformers/all-MiniLM-L6-v2

encoder-only
87 memory usage
384 embedding size

python -m src.eval --input_dir data/ --output_dir output --model_name sentence-transformers/all-MiniLM-L6-v2

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
notebooks		notebooks
src		src
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

A Study of Biases in LLM-Generated Musical Taste Profiles for Recommendation

Recommendation-as-retrieval experiments

Installation

Testing pre-trained encoders

Average model size

Smaller model size

Compact model size

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

A Study of Biases in LLM-Generated Musical Taste Profiles for Recommendation

Recommendation-as-retrieval experiments

Installation

Testing pre-trained encoders

Average model size

Smaller model size

Compact model size

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages