首页> 外文会议>Annual German Conference on Artificial Intelligence >Learning the Importance of Latent Topics to Discover Highly Influential News Items
【24h】

Learning the Importance of Latent Topics to Discover Highly Influential News Items

机译:学习潜在主题的重要性,以发现高度影响力的新闻项目

获取原文

摘要

Online news is a major source of information for many people. The overwhelming amount of new articles published every day makes it necessary to filter out unimportant ones and detect ground breaking new articles. In this paper, we propose the use of Latent Dirichlet Allocation (LDA) to find the hidden factors of important news stories. These factors are then used to train a Support Vector Machine (SVM) to classify new news items as they appear. We compare our results with SVMs based on a bag-of-words approach and other language features. The advantage of a LDA processing is not only a better accuracy in predicting important news, but also a better interpretability of the results. The latent topics show directly the important factors of a news story.
机译:在线新闻是许多人的主要信息来源。每天发布的压倒性新的文章都会使得有必要过滤输出不重要的产品并检测地面打破新文章。在本文中,我们建议使用潜在的Dirichlet分配(LDA)来找到重要新闻故事的隐藏因素。然后使用这些因素来培训支持向量机(SVM)以在出现时对新闻项进行分类。我们将使用SVMS基于单词袋方法和其他语言特征进行比较我们的结果。 LDA处理的优点不仅可以预测重要新闻的更好准确性,而且还具有更好的结果。潜在主题直接显示新闻故事的重要因素。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号