首页> 外文会议>International conference on intelligent text processing and computational linguistics;CICLing 2012 >A Domain Independent Framework to Extract and Aggregate Analogous Features in Online Reviews
【24h】

A Domain Independent Framework to Extract and Aggregate Analogous Features in Online Reviews

机译:一个域独立框架,用于提取和汇总在线评论中的类似功能

获取原文

摘要

Extracting and detecting features from online reviews is both important and challenging, especially when domain knowledge is not explicitly available. Moreover, opinions about the same feature of a product or service are frequently expressed in various lexical forms. In this paper, we present a novel framework to automatically detect, extract and aggregate semantically related features of reviewed products and services. Our model uses sentence level syntactic and lexical information to detect candidate feature words, and corpus level co-occurrence statistics to perform grouping of semantically similar features to obtain high precision feature detection. The high precision feature assembly capability of our model has a distinct advantage over state of the art approaches, like double propagation, by producing short and succinct sets of features compared to potential thousands of features that are generated by existing approaches. We evaluate our model in two completely unrelated domains, restaurant and camera online reviews, to verify its domain independence. The results of our model outperformed existing state of the art probabilistic models.
机译:从在线评论中提取和检测功能既重要又具有挑战性,尤其是在没有明确提供领域知识的情况下。而且,关于产品或服务的相同特征的观点经常以各种词汇形式表达。在本文中,我们提出了一个新颖的框架来自动检测,提取和聚合所审查产品和服务的语义相关特征。我们的模型使用句子级别的句法和词汇信息来检测候选特征词,并使用语料库级别的共现统计信息来对语义相似的特征进行分组以获得高精度的特征检测。与现有方法生成的数千个潜在特征相比,我们模型的高精度特征装配能力相对于最新方法(如双重传播)具有明显优势,它可以产生简短的特征集。我们在两个完全不相关的领域(餐厅和相机在线评论)中评估我们的模型,以验证其领域的独立性。我们模型的结果优于现有的最新概率模型。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号