首页> 外文期刊>Enterprise information systems >Personalised news filtering and recommendation system using Chi-square statistics-based K-nearest neighbour ((SB)-S-2-KNN) model
【24h】

Personalised news filtering and recommendation system using Chi-square statistics-based K-nearest neighbour ((SB)-S-2-KNN) model

机译:使用基于卡方统计的K近邻((SB)-S-2-KNN)模型的个性化新闻过滤和推荐系统

获取原文
获取原文并翻译 | 示例
           

摘要

Recommendation problem has been extensively studied by researchers in the field of data mining, database and information retrieval. This study presents the design and realisation of an automated, personalised news recommendations system based on Chi-square statistics-based K-nearest neighbour ((SB)-S-2-KNN) model. The proposed (SB)-S-2-KNN model has the potential to overcome computational complexity and information overloading problems, reduces runtime and speeds up execution process through the use of critical value of (2) distribution. The proposed recommendation engine can alleviate scalability challenges through combined online pattern discovery and pattern matching for real-time recommendations. This work also showcases the development of a novel method of feature selection referred to as Data Discretisation-Based feature selection method. This is used for selecting the best features for the proposed (SB)-S-2-KNN algorithm at the preprocessing stage of the classification procedures. The implementation of the proposed (SB)-S-2-KNN model is achieved through the use of a developed in-house Java program on an experimental website called OUC newsreaders' website. Finally, we compared the performance of our system with two baseline methods which are traditional Euclidean distance K-nearest neighbour and Naive Bayesian techniques. The result shows a significant improvement of our method over the baseline methods studied.
机译:研究人员在数据挖掘,数据库和信息检索领域对推荐问题进行了广泛研究。这项研究提出了基于卡方统计的K近邻((SB)-S-2-KNN)模型的自动化,个性化新闻推荐系统的设计和实现。所提出的(SB)-S-2-KNN模型具有克服计算复杂性和信息过载问题的潜力,可以通过使用(2)分布的临界值来减少运行时间并加快执行过程。所提出的推荐引擎可以通过结合在线模式发现和实时推荐的模式匹配来缓解可伸缩性挑战。这项工作还展示了一种新的特征选择方法(称为基于数据离散化的特征选择方法)的发展。这用于在分类程序的预处理阶段为建议的(SB)-S-2-KNN算法选择最佳特征。拟议的(SB)-S-2-KNN模型的实现是通过在称为OUC新闻阅读器的实验网站上使用内部开发的Java程序实现的。最后,我们将系统的性能与两种基线方法进行了比较,这两种基线方法是传统的欧几里得距离K近邻法和朴素贝叶斯技术。结果表明,与所研究的基准方法相比,我们的方法有了重大改进。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号