【24h】

Entity-Level Sentiment Analysis of Issue Comments

机译:问题评论的实体级别情绪分析

获取原文

摘要

Emotions and sentiment of software developers can largely influence the software productivity and quality. However, existing work on emotion mining and sentiment analysis is still in the early stage in software engineering in terms of accuracy, the size of datasets used and the specificity of the analysis. In this work, we are concerned with conducting entity-level sentiment analysis. We frst build a manually labeled dataset containing 3,000 issue comments selected from 231,732 issue comments collected from 10 open source projects in GitHub. Then we design and develop SentiSW, an entity-level sentiment analysis tool consisting of sentiment classification and entity recognition, which can classify issue comments into <;sentiment, entity> tuples. We evaluate the sentiment classification using ten-fold cross validation, and it achieves 68.71% mean precision, 63.98% mean recall and 77.19% accuracy , which is signifcantly higher than existing tools. We evaluate the entity recognition by manually annotation and it achieves a 75.15% accuracy.
机译:软件开发人员的情感和情绪可能在很大程度上影响软件生产力和质量。但是,在精确率方面,现有的情感挖掘和情感分析的工作仍处于软件工程的早期阶段,使用的数据集的大小以及分析的特异性。在这项工作中,我们担心进行实体级感分析。我们将建立一个手动标记的数据集,其中包含3,000个问题的评论,从Github中的10个开源项目中收集的第231,732个发表评论。然后我们设计和开发Sentisw,一个由情绪分类和实体识别组成的实体级情绪分析工具,可以将问题评论分类为<;情绪,实体>元组。我们使用十倍交叉验证评估了情绪分类,其平均精度为68.71%,平均召回和77.19%的准确度,其明显高于现有工具。我们通过手动注释来评估实体识别,精度为75.15%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号