首页> 外文会议>IEEE International Conference on Software Maintenance and Evolution >Achieving Reliable Sentiment Analysis in the Software Engineering Domain using BERT
【24h】

Achieving Reliable Sentiment Analysis in the Software Engineering Domain using BERT

机译:使用BERT在软件工程领域中进行可靠的情感分析

获取原文

摘要

Researchers have shown that sentiment analysis of software artifacts can potentially improve various software engineering tools, including API and library recommendation systems, code suggestion tools, and tools for improving communication among software developers. However, sentiment analysis techniques applied to software artifacts still have not yet yielded very high accuracy. Recent adaptations of sentiment analysis tools to the software domain have reported some improvements, but the f-measures for the positive and negative sentences still remain in the 0.4-0.64 range, which deters their practical usefulness for software engineering tools.In this paper, we explore the potential effectiveness of customizing BERT, a language representation model, which has recently achieved very good results on various Natural Language Processing tasks on English texts, for the task of sentiment analysis of software artifacts. We describe our application of BERT to analyzing sentiments of sentences in Stack Overflow posts and compare the impact of a BERT sentiment classifier to state-of-the-art sentiment analysis techniques when used on a domain-specific data set created from Stack Overflow posts. We also investigate how the performance of sentiment analysis changes when using a much (3 times) larger data set than previous studies. Our results show that the BERT classifier achieves reliable performance for sentiment analysis of software engineering texts. BERT combined with the larger data set achieves an overall f-measure of 0.87, with the f-measures for the negative and positive sentences reaching 0.91 and 0.78 respectively, a significant improvement over the state-of-the-art.
机译:研究人员已经表明,对软件工件的情感分析可以潜在地改善各种软件工程工具,包括API和库推荐系统,代码建议工具以及用于改善软件开发人员之间的通信的工具。但是,应用于软件工件的情感分析技术仍未产生非常高的准确性。情感分析工具对软件领域的最新调整已报告了一些改进,但肯定句和否定句的f量度仍保持在0.4-0.64范围内,这阻止了它们对软件工程工具的实用性。探索定制语言表示模型BERT的潜在效果,最近在英语文本的各种自然语言处理任务上,对软件工件的情感分析任务取得了很好的效果。我们描述了BERT在分析Stack Overflow帖子中的句子情感上的应用,并比较了将BERT情感分类器与由Stack Overflow帖子创建的特定领域数据集所使用的最新情感分析技术的影响。当使用比以前的研究大得多(3倍)的数据集时,我们还研究情绪分析的性能如何变化。我们的结果表明,BERT分类器可为软件工程文本的情感分析提供可靠的性能。 BERT与更大的数据集相结合,实现了0.87的整体f度量,否定句和肯定句的f度量分别达到0.91和0.78,这比现有技术有了显着改进。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号