Skip to content

A music recommender system as part of Kaggle Championship.

Notifications You must be signed in to change notification settings

BlancNicolas/RecSys_challenge

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

22 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

RecSys_challenge

A music recommender system as part of Kaggle Championship.

Data

train.csv : the training set describing which tracks are included in the playlists tracks.csv : supplementary information about tracks target_playlists.csv : the set of target playlists that will receive recommendations train_sequential.csv: list of ordered interactions for the 5k sequential playlists. You can find the file here. sample_submission.csv: correct format for submissions

train.csv

playlist_id : identifier of the playlist track_id : identifier of the track

Cold users analysis

According to the data here is the repartition of the number of tracks per playlist.

| Number of tracks | Number of playlists | | :-------------: |: -------------: | | <=5 | 1788 | | <10 | 11657 | | <20 | 26093 | | <30 | 35959 | | <40 | 42285 | | <100 | 50446 |

total number of playlist : 50446

After a comparison between the cold users found in the dataset and users in test dataset, there are no users with a track number <= 5.

Process

  • Sparse matrices creation
  • Recommender creation ( Type of recommender as to be chosen )
  • Evaluation of parameters of the extracted test set of train.csv
  • Choice of parameters according to the previous evaluation

Environment

  • Conda

Packages

  • sklearn

About

A music recommender system as part of Kaggle Championship.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published