首页> 外文会议>International conference on emerging trends in information technology >Music Listening History Dataset Curation and Distributed Music Recommendation Engines Using Collaborative Filtering
【24h】

Music Listening History Dataset Curation and Distributed Music Recommendation Engines Using Collaborative Filtering

机译:使用协作过滤的音乐聆听历史数据集策策和分布式音乐推荐引擎

获取原文

摘要

In this paper, a new system for creation of music recommendation engines based on massively large datasets containing music listening patterns for users is introduced. The engine is designed to be scalable and hence, uses distributed computing clusters to perform tasks and calculations. The collaborative filtering algorithm is implemented on a distributed Apache Spark cluster to maximize performance and efficiency. Also, the open source Creative Commons (CC0) licensed ListenBrainz dataset is introduced. It makes public for commercial and non-commercial usage the music listening histories of over 4500 users and counting. The dataset includes over 180 million unique listening records (called listens) from various sources with additional metadata including MusicBrainz IDs which link the listens to additional sources of information. The collection of such music listening data is very useful for industry and research as open data in large quantities is not easily available. A system is developed to allow continuous access to this data and to release snapshots of the data regularly. The aforementioned music recommendation engine then uses this data to train its models and create recommendations for users.
机译:在本文中,介绍了一种基于包含用于用户音乐聆听模式的大型数据集的音乐推荐引擎的新系统。该发动机被设计为可扩展,因此,使用分布式计算集群来执行任务和计算。协作滤波算法在分布式Apache Spark集群上实现,以最大限度地提高性能和效率。此外,介绍了开源创新共享(CC0)许可的ListenBrainz数据集。它公开用于商业和非商业用途的音乐收听历史,超过4500用户和计数。数据集包括来自各种来源的超过1.5亿个独特的聆听记录(称为侦听),其中包括MusicBrainz ID,其中包含侦听侦听到其他信息源。这些音乐聆听数据的集合对于工业和研究非常有用,因为大量的开放数据不易使用。开发系统以允许持续访问此数据并定期释放数据的快照。然后,上述音乐推荐引擎然后使用此数据来训练其模型并为用户创建建议。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号