首页> 外文会议>IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining >Data intensive review mining for sentiment classification across heterogeneous domains
【24h】

Data intensive review mining for sentiment classification across heterogeneous domains

机译:数据密集型评论挖掘,用于跨异构域的情感分类

获取原文

摘要

The automatic detection of orientation and emotions in texts is becoming increasingly important in the Web 2.0 scenario. There is a considerable need for innovative techniques and tools capable of identifying and detecting the attitude of unstructured text. The paper tackles two crucial aspects of the sentiment classification problem: first, the computational complexity of the deployed framework; second, the ability of the framework itself to operate effectively in heterogeneous commercial domains. The proposed approach adopts empirical learning to implement the sentiment-classification technology, and uses a distance-based predictive model to combine computational efficiency and modularity. A suitably designed semantic-based metric is the cognitive core that measures the distance between two user reviews, according to the sentiment they communicate. The framework ultimately nullifies the training process; at the same time, it takes advantage of a classification procedure whose computational cost increases linearly when the training corpus increases. To attain an objective measurement of the actual accuracy of the sentiment classification method, a campaign of tests involved a pair of complex, real-world scoring domains; the goal was to compare the predicted sentiment scores with actual scores provided by human assessors. Experimental results confirmed that the overall approach attained satisfactory performances in terms of both cross-domain classification accuracy and computational efficiency.
机译:在Web 2.0场景中,文本方向和情感的自动检测变得越来越重要。迫切需要能够识别和检测非结构化文本态度的创新技术和工具。本文解决了情感分类问题的两个关键方面:第一,所部署框架的计算复杂性;第二,解决方案。第二,框架本身在异构商业领域中有效运作的能力。所提出的方法采用经验学习来实现情感分类技术,并使用基于距离的预测模型来结合计算效率和模块化。适当设计的基于语义的度量是认知核心,它可以根据两个用户评论之间的交流情绪来衡量它们之间的距离。该框架最终使培训过程无效。同时,利用分类程序的优势,当训练语料库增加时,其计算成本会线性增加。为了客观评估情感分类方法的实际准确性,测试活动涉及一对复杂的,真实的得分域;目的是将预测的情感分数与人类评估者提供的实际分数进行比较。实验结果证实,该总体方法在跨域分类精度和计算效率方面均取得了令人满意的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号