首页> 外文会议>Annual meeting of the Association for Computational Linguistics;ACL 2012 >Cross-Domain Co-Extraction of Sentiment and Topic Lexicons
【24h】

Cross-Domain Co-Extraction of Sentiment and Topic Lexicons

机译:情感和主题词汇的跨域共提取

获取原文

摘要

Extracting sentiment and topic lexicons is important for opinion mining. Previous works have showed that supervised learning methods are superior for this task. However, the performance of supervised methods highly relies on manually labeled training data. In this paper, we propose a domain adaptation framework for sentiment- and topic- lexicon co-extraction in a domain of interest where we do not require any labeled data, but have lots of labeled data in another related domain. The framework is twofold. In the first step, we generate a few high-confidence sentiment and topic seeds in the target domain. In the second step, we propose a novel Relational Adaptive bootstraPping (RAP) algorithm to expand the seeds in the target domain by exploiting the labeled source domain data and the relationships between topic and sentiment words. Experimental results show that our domain adaptation framework can extract precise lexicons in the target domain without any annotation.
机译:提取情绪和主题词典对于挖掘观点非常重要。以前的工作表明,有监督的学习方法对于完成这项任务是优越的。但是,监督方法的性能高度依赖于手动标记的训练数据。在本文中,我们提出了一个领域适应框架,用于感兴趣领域中的情感和主题词典共提取,在该领域中我们不需要任何标记数据,但是在另一个相关域中有大量标记数据。该框架是双重的。第一步,我们在目标域中生成一些高信心情绪和主题种子。在第二步中,我们提出了一种新颖的关系自适应自举(RAP)算法,以通过利用标记的源域数据以及主题词和情感词之间的关系来扩展目标域中的种子。实验结果表明,我们的域自适应框架可以在没有任何注释的情况下提取目标域中的精确词典。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号