首页> 外文期刊>Decision support systems >External validity of sentiment mining reports: Can current methods identify demographic biases, event biases, and manipulation of reviews?
【24h】

External validity of sentiment mining reports: Can current methods identify demographic biases, event biases, and manipulation of reviews?

机译:情感挖掘报告的外部有效性:当前的方法可以识别人口统计偏差,事件偏差以及对评论的操纵吗?

获取原文
获取原文并翻译 | 示例
       

摘要

Many publications in sentiment mining provide new techniques for improved accuracy in extracting features and corresponding sentiments in texts. For the external validity of these sentiment reports, i.e., the applicability of the results to target audiences, it is important to well analyze data of the context of user-generated content and their sample of authors. The literature lacks an analysis of external validity of sentiment mining reports and the sentiment mining field lacks an operationalization of external validity dimensions toward practically useful techniques. From a kernel theory, we identify multiple threats to sentiment mining external validity and study three of them empirically 1) a mismatch in demographics of the reviewers sample, 2) bias due to reviewers' incidental experiences, and 3) manipulation of reviews. The value of external validity threat identifying techniques is next examined in cases from Goodread.com. We conclude that demographic biases can be well detected by current techniques, although we have doubts regarding stylometric techniques for this purpose. We demonstrate the usefulness of event and manipulation bias detection techniques in our cases, but this result needs further replications in more complex and more competitive contexts. Finally, for increasing the decisional usefulness of sentiment mining reports, they should be accompanied by external validity reports and software and service providers in this field should incorporate these in their offerings.
机译:情感挖掘中的许多出版物提供了新的技术,以提高文本中特征和相应情感的提取精度。对于这些情感报告的外部有效性,即结果对目标受众的适用性,重要的是要很好地分析用户生成的内容及其作者样本的背景数据。文献缺乏对情感挖掘报告的外部有效性的分析,并且情感挖掘领域缺乏针对实际有用技术的外部有效性维度的可操作性。从核心理论出发,我们确定了挖掘情感外部有效性的多种威胁,并通过经验研究了其中三个:1)审阅者样本的人口统计不匹配,2)审阅者偶然经历引起的偏见以及3)审阅操作。接下来,将在Goodread.com上检查外部有效性威胁识别技术的价值。我们得出结论,尽管我们对用于此目的的测速技术存有疑问,但当前的技术可以很好地检测到人口统计学偏差。我们展示了事件和操作偏差检测技术在我们的案例中的有用性,但是此结果需要在更复杂和更具竞争性的情况下进一步复制。最后,为了提高情感挖掘报告的决策效用,应在报告中随附外部有效性报告,并且该领域的软件和服务提供商应将这些报告纳入其产品中。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号