Build Your Own Music Recommender by Modeling Internet Radio Streams

Oren Somekh
Senior Scientist, Yahoo! Labs
Given on: March 8th, 2013


In the Internet music scene, where recommendation technology is key for navigating huge collections, large market players enjoy a considerable advantage. Accessing a wider pool of user feedback leads to an increasingly more accurate analysis of user tastes, effectively creating a ``rich get richer'' effect. This work aims at significantly lowering the entry barrier for creating music recommenders, through a paradigm coupling a public data source and a new collaborative filtering (CF) model. We claim that Internet radio stations form a readily available resource of abundant fresh human signals on music through their playlists, which are essentially cohesive sets of related tracks.

In a way, our models rely on the knowledge of a diverse group of experts in lieu of the commonly used wisdom of crowds. Over several weeks, we aggregated publicly available playlists of thousands of Internet radio stations, resulting in a dataset encompassing millions of plays, and hundreds of thousands of tracks and artists. This provides the large scale ground data necessary to mitigate the cold start problem of new items at both mature and emerging services.

Furthermore, we developed a new probabilistic CF model, tailored to the Internet radio resource. The success of the model was empirically validated on the collected dataset. Moreover, we tested the model at a cross-source transfer learning manner - the same model trained on the Internet radio data was used to predict behavior of Yahoo! Music users. This demonstrates the ability to tap the Internet radio signals in other music recommendation setups. Based on encouraging empirical results, our hope is that the proposed paradigm will make quality music recommendation accessible to all interested parties in the community.


Oren Somekh received his PhD from the EE department of the Technion, Israel, studying information theoretical aspects of cooperative wireless networks. He spent several years in the high-tech industry as the VP R&D and co-founder of Surf Communication Solutions Ltd, Israel. During 2005-2009 he was a visiting research fellow at EE departments of NJIT, and Princeton University. Since 2009 he is a Sr. Scientist at Yahoo! Labs, exploring various scientific aspects of Internet technologies such as search, recommendation systems, and online social networks.