首页> 外文会议>Graph-based methods for natural language processing workshop 2014 >A Novel Two-stage Framework for Extracting Opinionated Sentences from News Articles
【24h】

A Novel Two-stage Framework for Extracting Opinionated Sentences from News Articles

机译:从新闻文章中提取有意句子的新型两阶段框架

获取原文
获取原文并翻译 | 示例

摘要

This paper presents a novel two-stage framework to extract opinionated sentences from a given news article. In the first stage, Naieve Bayes classifier by utilizing the local features assigns a score to each sentence - the score signifies the probability of the sentence to be opinionated. In the second stage, we use this prior within the HITS (Hyperlink-Induced Topic Search) schema to exploit the global structure of the article and relation between the sentences. In the HITS schema, the opinionated sentences are treated as Hubs and the facts around these opinions are treated as the Authorities. The algorithm is implemented and evaluated against a set of manually marked data. We show that using HITS significantly improves the precision over the baseline Naieve Bayes classifier. We also argue that the proposed method actually discovers the underlying structure of the article, thus extracting various opinions, grouped with supporting facts as well as other supporting opinions from the article.
机译:本文提出了一种新颖的两阶段框架,可从给定的新闻文章中提取有观点的句子。在第一阶段,Naieve Bayes分类器通过利用局部特征为每个句子分配一个分数-分数表示该句子被接受的可能性。在第二阶段,我们在HITS(超链接诱导主题搜索)模式中使用此优先级,以利用文章的全局结构以及句子之间的关系。在HITS架构中,将有观点的句子视为中心,并将围绕这些观点的事实视为权威。该算法是针对一组手动标记的数据实施和评估的。我们显示,与基线Naieve Bayes分类器相比,使用HITS可以显着提高精度。我们还认为,所提出的方法实际上发现了文章的底层结构,从而提取了各种观点,并结合了文章中的支持事实和其他支持观点。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号