Spotify-XRL-Skipping-Prediction

An investigation on the utility of users’ historical data for the task of sequentially predicting users’ music skipping behaviour using Deep Reinforcement Learning. The analysis is performed on the Spotify’s Music Streaming Sessions Dataset (MSSD).

This repository contains the source code for the approach outlined in the Full Paper Why People Skip Music? On Predicting Music Skips using Deep Reinforcement Learning, accepted at the 2023 ACM SIGIR Conference on Human Information Interaction and Retrieval (CHIIR2023).

For the YouTube presentation of this paper, please click here.

To know more about our research activities at NeuraSearch Laboratory, please follow us on Twitter (@NeuraSearch) and to get notified of future uploads please subscribe to our YouTube channel!

Installation

The required Python packages can be found in requirements.txt. Using a package manager such as pip, they can be easily installed as follows:

conda create --name XRL-Skipping-Prediction python=3.9
conda activate XRL-Skipping-Prediction
pip3 install -r requirements.txt

Data Preparation

Part 1. Upload the track features and the individual log files (from the MSSD) within the data/ folder as follows:

data/track_features/ should contain:
- tf_000000000000.csv
- tf_000000000001.csv
data/training_set/ should contain:
- log_0_20180715_000000000000.csv
- log_1_20180715_000000000000.csv
- log_2_20180715_000000000000.csv
- log_3_20180715_000000000000.csv
data/test_set should contain (e.g., to generate the Test Set T1 in the accompanying paper):
- log_4_20180715_000000000000.csv

Part 2:

python3 data_preprocessing.py

The following files should have been created:

data/track_features/track_features_data.parquet
data/training_set/training_set.parquet
data/test_set/test_set.parquet

Deep Reinforcement Learning Model

Manual Execution

python3 model.py --model-name MyFirstDQN \
    --dqn-type dqn \
    --network mlp \
    --test-set data/test_set/test_set.parquet

The DQN is run five times on the given test set. The results and the models can be located at /results and /results/models respectively.

The model.py script will train and evaluate our proposed approach.

Below is the list of all command line parameters:

Command	Description
`--model-name`	Name of the model (e.g., MyFirstDQN)
`-v`	[Optional]. Enable verbose printing
`-n`	[Optional]. Specify the number of episodes for training and evaluation. If not set, all episodes are considered
`--dqn-type`	[Optional, default: `dqn`]. Type of DQN: Vanilla DQN (`dqn`), Double DQN (`ddqn`), or Dueling DQN (`dueling_dqn`)
`--n-step`	[Optional, default: `1`]. n-step learning (>=1).
`--network`	[Optional, default: `mlp`]. Type of network: MLP (`mlp`), Observations Stacking (`mlp_obs_stacking`), LSTM (`lstm`), or GRU (`gru`)
`--no-temporal-leakage`	[Optional]. Do not remove the data leaking features. By default (i.e., no flag), such features are removed from the state representation
`--test-set`	The path to testing set (e.g., `data/testing_set/test_set.parquet`)

Post-Hoc (SHAP) analysis

By default, the post-hoc (SHAP) analysis is not performed given its extensive computational requirements. To perform a SHAP analysis, please make the following change in model.py:

self.evaluate(shap_analysis=False)

➡️

self.evaluate(shap_analysis=True)

Then, run model.py as outlined in Section Manual Execution. The results from the SHAP analysis can be found at /results/shap.

Ablation Analysis of the State Representation

Details on how to perform the ablation analysis can be found within the def __retrieve_df_and_n_episodes_from(self, data_path) function in lib/base_model.py. This procedure requires some manual intervention.

By default (and with a "corrected" state representation), the following line is executed:

df = np.delete(df, [58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68], 1)

These indexes refer to the Reason End (RE) User Behaviour (UB) features in the state representation, which is one of the two identified leaking features.

In order to remove further features, and hence perform the ablation analysis, the following two steps have to be performed:

Step 1. Locate the indexes of the features to be removed. They can be found in lib/constants.py and looking at the STATE variable. For example, if you wish to remove the pauses (i.e., no_pause_before_play, short_pause_before_play, and long_pause_before_play), the indexes are: 2, 3, 4.

Step 2. Change the previous block of code in lib/base_model.py as follows:

df = np.delete(df, [58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68], 1)

➡️

df = np.delete(df, [2, 3, 4, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68], 1)

Please note that these indexes are for when interacting with the STATE variable, and therefore on a state without the temporal data leakage problem. The TEMPORAL_LEAKAGE_STATE refers to the original and not corrected state representation. Since it contains the Session Length (SL), the indexes have to be adjusted accordingly.

Hyperparameter Optimisation

python3 model_optimizer.py --model-name MyFirstOptDQN \
    --dqn-type dqn \
    --network mlp \
    --n-trials 20 \
    --n-jobs 2 \
    --test-set data/test_set/test_set.parquet

This script will perform hyperparameter optimisation on the given DQN type and network type. Given a set of parameters, the DQN is run five times on the given test set. This process, whereby a different set of parameters is selected, is repeated for n-trials times. The results and the models can be located at /results and /results/opt respectively.

Below is the list of all command line parameters:

Command	Description
`--model-name`	Name of the model (e.g., MyFirstDQN)
`-v`	[Optional]. Enable verbose printing
`-n`	[Optional]. Specify the number of episodes for training and evaluation. If not set, all episodes are considered
`--dqn-type`	[Optional, default: `dqn`]. Type of DQN: Vanilla DQN (`dqn`), Double DQN (`ddqn`), or Dueling DQN (`dueling_dqn`)
`--n-step`	[Optional, default: `1`]. n-step learning (>=1).
`--network`	[Optional, default: `mlp`]. Type of network: MLP (`mlp`), Observations Stacking (`mlp_obs_stacking`), LSTM (`lstm`), or GRU (`gru`)
`--no-temporal-leakage`	[Optional]. Do not remove the data leaking features. By default (i.e., no flag), such features are removed from the state representation
`--n-trials`	[Optional, default: `10`]. Number of trials
`--n-jobs`	[Optional, default: `1`] Number of jobs for parallel execution
`--test-set`	The path to testing set (e.g., `data/testing_set/test_set.parquet`)

Cite

Please, cite this work as follows:

@inproceedings{10.1145/3576840.3578312,
  author = {Meggetto, Francesco and Revie, Crawford and Levine, John and Moshfeghi, Yashar},
  title = {Why People Skip Music? On Predicting Music Skips Using Deep Reinforcement Learning},
  year = {2023},
  isbn = {9798400700354},
  publisher = {Association for Computing Machinery},
  address = {New York, NY, USA},
  url = {https://doi.org/10.1145/3576840.3578312},
  doi = {10.1145/3576840.3578312},
  booktitle = {Proceedings of the 2023 Conference on Human Information Interaction and Retrieval},
  pages = {95–106},
  numpages = {12},
  keywords = {Spotify, Deep Reinforcement Learning, Music, Prediction, User Behaviour, Skipping},
  location = {Austin, TX, USA},
  series = {CHIIR '23}
}

Francesco Meggetto, Crawford Revie, John Levine, and Yashar Moshfeghi. 2023. Why People Skip Music? On Predicting Music Skips using Deep Reinforcement Learning. In Proceedings of the 2023 Conference on Human Information Interaction and Retrieval (CHIIR '23). Association for Computing Machinery, New York, NY, USA, 95–106. https://doi.org/10.1145/3576840.3578312

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Spotify-XRL-Skipping-Prediction

Installation

Data Preparation

Deep Reinforcement Learning Model

Manual Execution

Post-Hoc (SHAP) analysis

Ablation Analysis of the State Representation

Hyperparameter Optimisation

Cite

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
DRL		DRL
data		data
lib		lib
results		results
utils		utils
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
data_preprocessing.py		data_preprocessing.py
model.py		model.py
model_optimizer.py		model_optimizer.py
requirements.txt		requirements.txt
setup.py		setup.py

License

MeggettoFrancesco/Spotify-XRL-Skipping-Prediction

Folders and files

Latest commit

History

Repository files navigation

Spotify-XRL-Skipping-Prediction

Installation

Data Preparation

Deep Reinforcement Learning Model

Manual Execution

Post-Hoc (SHAP) analysis

Ablation Analysis of the State Representation

Hyperparameter Optimisation

Cite

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages