【24h】

Entity-Level Sentiment Analysis of Issue Comments

机译:问题注释的实体级情感分析

获取原文

摘要

Emotions and sentiment of software developers can largely influence the software productivity and quality. However, existing work on emotion mining and sentiment analysis is still in the early stage in software engineering in terms of accuracy, the size of datasets used and the specificity of the analysis. In this work, we are concerned with conducting entity-level sentiment analysis. We frst build a manually labeled dataset containing 3,000 issue comments selected from 231,732 issue comments collected from 10 open source projects in GitHub. Then we design and develop SentiSW, an entity-level sentiment analysis tool consisting of sentiment classification and entity recognition, which can classify issue comments into tuples. We evaluate the sentiment classification using ten-fold cross validation, and it achieves 68.71% mean precision, 63.98% mean recall and 77.19% accuracy , which is signifcantly higher than existing tools. We evaluate the entity recognition by manually annotation and it achieves a 75.15% accuracy.
机译:软件开发人员的情绪和情感会在很大程度上影响软件的生产率和质量。但是,就准确性,使用的数据集的大小和分析的特殊性而言,现有的情感挖掘和情感分析工作仍处于软件工程的早期阶段。在这项工作中,我们关注进行实体级别的情感分析。我们首先构建一个手动标记的数据集,其中包含3,000个问题注释,这些注释是从GitHub的10个开源项目收集的231,732个问题注释中选择的。然后,我们设计和开发SentiSW,这是一个由情感分类和实体识别组成的实体级情感分析工具,可以将问题注释分类为元组。我们使用十倍交叉验证对情感分类进行评估,它达到了68.71%的平均准确度,63.98%的平均召回率和77.19%的准确度,这明显高于现有工具。我们通过手动注释来评估实体识别,它可以达到75.15%的准确性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号