Achieving Reliable Sentiment Analysis in the Software Engineering Domain using BERT

机译：使用BERT在软件工程领域中进行可靠的情感分析

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Researchers have shown that sentiment analysis of software artifacts can potentially improve various software engineering tools, including API and library recommendation systems, code suggestion tools, and tools for improving communication among software developers. However, sentiment analysis techniques applied to software artifacts still have not yet yielded very high accuracy. Recent adaptations of sentiment analysis tools to the software domain have reported some improvements, but the f-measures for the positive and negative sentences still remain in the 0.4-0.64 range, which deters their practical usefulness for software engineering tools.In this paper, we explore the potential effectiveness of customizing BERT, a language representation model, which has recently achieved very good results on various Natural Language Processing tasks on English texts, for the task of sentiment analysis of software artifacts. We describe our application of BERT to analyzing sentiments of sentences in Stack Overflow posts and compare the impact of a BERT sentiment classifier to state-of-the-art sentiment analysis techniques when used on a domain-specific data set created from Stack Overflow posts. We also investigate how the performance of sentiment analysis changes when using a much (3 times) larger data set than previous studies. Our results show that the BERT classifier achieves reliable performance for sentiment analysis of software engineering texts. BERT combined with the larger data set achieves an overall f-measure of 0.87, with the f-measures for the negative and positive sentences reaching 0.91 and 0.78 respectively, a significant improvement over the state-of-the-art.

机译：研究人员已经表明，对软件工件的情感分析可以潜在地改善各种软件工程工具，包括API和库推荐系统，代码建议工具以及用于改善软件开发人员之间的通信的工具。但是，应用于软件工件的情感分析技术仍未产生非常高的准确性。情感分析工具对软件领域的最新调整已报告了一些改进，但肯定句和否定句的f量度仍保持在0.4-0.64范围内，这阻止了它们对软件工程工具的实用性。探索定制语言表示模型BERT的潜在效果，最近在英语文本的各种自然语言处理任务上，对软件工件的情感分析任务取得了很好的效果。我们描述了BERT在分析Stack Overflow帖子中的句子情感上的应用，并比较了将BERT情感分类器与由Stack Overflow帖子创建的特定领域数据集所使用的最新情感分析技术的影响。当使用比以前的研究大得多（3倍）的数据集时，我们还研究情绪分析的性能如何变化。我们的结果表明，BERT分类器可为软件工程文本的情感分析提供可靠的性能。 BERT与更大的数据集相结合，实现了0.87的整体f度量，否定句和肯定句的f度量分别达到0.91和0.78，这比现有技术有了显着改进。

著录项

来源
《IEEE International Conference on Software Maintenance and Evolution》|2020年|162-173|共12页
会议地点
作者
Eeshita Biswas; Mehmet Efruz Karabulut; Lori Pollock; K. Vijay-Shanker;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Sentiment Analysis; Software Engineering; BERT;

机译：情感分析;软件工程; BERT;

相似文献

外文文献
中文文献
专利

1. SentiStrength-SE: Exploiting domain specificity for improved sentiment analysis in software engineering text [J] . Islam Md Rakibul, Zibran Minhaz F. The Journal of Systems and Software . 2018,第NOVa期

机译：SentiStrength-SE：利用领域特异性来改善软件工程文本中的情感分析
2. Sentiment analysis for software engineering: How far can we go? [J] . Andy Brooks Computing reviews . 2020,第1期

机译：软件工程的情感分析：我们能走多远？
3. Sentiment analysis for software engineering: How far can we go? [J] . Andy Brooks Computing reviews . 2019,第11期

机译：软件工程的情感分析：我们能走多远？
4. A comparison of software engineering domain specific sentiment analysis tools [C] . Md Rakibul Islam, Minhaz F. Zibran IEEE International Conference on Software Analysis, Evolution, and Reengineering . 2018

机译：软件工程领域特定情感分析工具的比较
5. Towards Automated Domain-Oriented Lexicon Construction and Dimension Reduction for Arabic Sentiment Analysis [D] . Alshahrani, Hasan A. 2018

机译：面向阿拉伯语情感分析的面向领域的自动词典构建和降维
6. An Effective BERT-Based Pipeline for Twitter Sentiment Analysis: A Case Study in Italian [O] . Marco Pota, Mirko Ventura, Rosario Catelli, 2021

机译：一种有效的基于伯特语的管道用于Twitter情绪分析 - 以意大利语为例
7. Sentiment Analysis for Software Engineering Domain in Turkish [O] . Mansur Alp TOÇOĞLU 2020

机译：土耳其软件工程领域的情感分析

Achieving Reliable Sentiment Analysis in the Software Engineering Domain using BERT

摘要

著录项

相似文献

相关主题

期刊订阅