首页> 外文会议>International Conference on Contemporary Computing >User tweets based genre prediction and movie recommendation using LSI and SVD
【24h】

User tweets based genre prediction and movie recommendation using LSI and SVD

机译:使用LSI和SVD的基于用户推文的流派预测和电影推荐

获取原文

摘要

The emerging popularity and raise in users' posts on social media gave birth to numerous research challenges. Out of all challenges users' centric context information based recommendation is one prime research area to recommend jobs, events and movies. Here in this research work we focus on movie context aware recommendation and for this purpose, we analyze users' posted movie tweets to understand their intentions for the same. Therefore, the objective of this research work is to predict genre of movies based on user's posted movie tweets and recommending movies to users' according to predicted genre. For this purpose, we pre-processed twitter extracted movie tweets using tokenization, porter stemming, stop word removal and use Word-Net dictionary for synonym matching. Further, we apply Latent Semantic Indexing technique which in turn involves Singular Value Decomposition on this pre-processed data and predicts genre on the basis of IMDb movie genre categorization. The predicted genre conveys the movie interest of the user and to recommend movie on the basis of predicted genre which is measured through euclidean distance. We have extracted IMdb given movie data and further predicted genre using our proposed technique. To validate this we divided our dataset using pareto principle and matched with IMDb given genre data set and achieved approximate 70% accuracy using our approach.
机译:社交媒体上用户帖子的兴起和兴起带来了众多研究挑战。在所有挑战中,基于用户中心上下文信息的推荐是推荐工作,事件和电影的主要研究领域。在此研究工作中,我们重点关注电影上下文感知推荐,并为此目的,我们分析用户发布的电影推文以了解其意图。因此,这项研究工作的目的是根据用户发布的电影推文来预测电影的类型,并根据预测的类型向用户推荐电影。为此,我们使用标记化,波特词干提取,停止单词删除并使用Word-Net词典进行同义词匹配来对Twitter提取的电影tweet进行预处理。此外,我们应用了潜在语义索引技术,该技术随后又对该预处理数据进行了奇异值分解,并根据IMDb电影流派分类对流派进行了预测。预测体裁传达了用户的电影兴趣并且基于通过欧几里德距离测量的预测体裁来推荐电影。我们使用给定的电影数据提取了IMdb,并使用我们提出的技术进一步预测了类型。为了验证这一点,我们使用pareto原理对数据集进行了划分,并与给定体裁数据集的IMDb相匹配,并使用我们的方法实现了大约70%的准确性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号