Skip to content

theophanedb/ML-Music_Year_Prediction

Repository files navigation

ML-YearPredictionMSD

The project

The goal of this Machine Learning project is to build a model to predict the release year of a song, based on timbre audio features.

Data was collected by the Echo Nest API, which loads music files and returns a JSON file containing much information.

The dataset consists of 515345 instances, 90 predictors and 1 target : Year.

My final model is : KNeighborsClassifier(n_neighbors=3, weights='distance')

It got an accuracy of 5.7 % and an accuracy to the decade of 47.7 %.

Its average absolute distance from the original release year is 9 years.

The repository contains

  • A Jupyter Notebook where you can find all data processing, dataviz and machine learning algorithms
  • An API
  • A PDF file that summarizes my work
  • Documentation about the subject, the context and the dataset

Launching the API

Requirements

  • numpy
  • pandas
  • pickle

About

Machine Learning for Year Prediction on Audio Features

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published