首页> 外文期刊>Knowledge-Based Systems >Cross-ratio uninorms as an effective aggregation mechanism in sentiment analysis
【24h】

Cross-ratio uninorms as an effective aggregation mechanism in sentiment analysis

机译:跨比率单一性是情感分析中的有效汇总机制

获取原文
获取原文并翻译 | 示例

摘要

There are situations in which lexicon-based methods for Sentiment Analysis (SA) are not able to generate a classification output for specific instances of a given dataset. Most often, the reason for this situation is the absence of specific terms in the sentiment lexicon required in the classification effort. In such cases, there were only two possible paths to follow: (1) add terms to the lexicon (off-line process) by human intervention to guarantee no noise is introduced into the lexicon, which prevents the classification system to provide an immediate answer; or (2) use the services of a word-frequency dictionary (on-line process), which is computationally costly to build. This paper investigates an alternative approach to compensate for the lack of ability of a lexicon-based method to produce a classification output. The method is based on the combination of the classification outputs of non lexicon-based tools. Specifically, firstly the outcome values of applying two or more non-lexicon classification methods are obtained. Secondly, these non-lexicon outcomes are fused using a uninorm based approach, which has been proved to have desirable compensation properties as required in the SA context, to generate the classification output the lexicon based approach is unable to achieve. Experimental results based on the execution of two well-known supervised machine learning algorithms, namely Naive Bayes and Maximum Entropy, and the application of a cross-ratio uninorm operator are presented. Performance indices associated to options (1) and (2) above are compared against the results obtained using the proposed approach for two different datasets. Additionally, the performance of the proposed cross-ratio uninorm operator based approach is also compared when the aggregation operator used is the arithmetic mean instead. It is shown that the combination of non lexicon-based classification methods with specific uninorm operators improves the classification performance of lexicon-based methods, and it enables the offering of an alternative solution to the SA classification problem when needed. The proposed aggregation method could be used as well as a replacement of ensemble averaging techniques commonly applied when combining the results of several machine learning classifiers' outputs. (C) 2017 Elsevier B.V. All rights reserved.
机译:在某些情况下,基于词典的情感分析(SA)方法无法为给定数据集的特定实例生成分类输出。大多数情况下,这种情况的原因是分类工作中所需的情感词典中缺少特定术语。在这种情况下,只有两种可能的途径:(1)通过人工干预在词典中添加术语(脱机过程),以确保词典中不会引入噪声,这阻止了分类系统提供即时答案。 ;或(2)使用字频字典的服务(在线过程),这在计算上是昂贵的。本文研究了一种替代方法,以弥补基于词典的方法无法产生分类输出的能力。该方法基于非基于词典的工具的分类输出的组合。具体而言,首先获得应用两种或更多种非词典分类方法的结果值。其次,使用基于非标准的方法融合这些非词典结果,该方法已被证明具有SA上下文所需的理想补偿特性,以生成基于词典的方法无法实现的分类输出。提出了基于两种著名的监督机器学习算法(朴素贝叶斯算法和最大熵)的执行结果,以及跨比率单项算子的应用。将与上述选项(1)和(2)相关的性能指标与使用建议的方法针对两个不同数据集获得的结果进行比较。另外,当所使用的聚合算子代替算术平均值时,还比较了所提出的基于跨比率非偶数算子的方法的性能。结果表明,基于非词典的分类方法与特定的uninorm运算符的组合可以提高基于词典的方法的分类性能,并且可以在需要时为SA分类问题提供替代解决方案。当合并多个机器学习分类器的输出结果时,可以使用所提出的聚合方法,也可以替代通常使用的集成平均技术。 (C)2017 Elsevier B.V.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号