An empirical assessment of best-answer prediction models in technical Q&A sites

Calefato Fabio; Lanubile Filippo; Novielli Nicole

首页> 外文期刊>Empirical Software Engineering >An empirical assessment of best-answer prediction models in technical Q&A sites

【24h】

An empirical assessment of best-answer prediction models in technical Q&A sites

机译：对技术问答网站中最佳答案预测模型的实证评估

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Technical Q&A sites have become essential for software engineers as they constantly seek help from other experts to solve their work problems. Despite their success, many questions remain unresolved, sometimes because the asker does not acknowledge any helpful answer. In these cases, an information seeker can only browse all the answers within a question thread to assess their quality as potential solutions. We approach this time-consuming problem as a binary-classification task where a best-answer prediction model is built to identify the accepted answer among those within a resolved question thread, and the candidate solutions to those questions that have received answers but are still unresolved. In this paper, we report on a study aimed at assessing 26 best-answer prediction models in two steps. First, we study how models perform when predicting best answers in Stack Overflow, the most popular Q&A site for software engineers. Then, we assess performance in a cross-platform setting where the prediction models are trained on Stack Overflow and tested on other technical Q&A sites. Our findings show that the choice of the classifier and automatied parameter tuning have a large impact on the prediction of the best answer. We also demonstrate that our approach to the best-answer prediction problem is generalizable across technical Q&A sites. Finally, we provide practical recommendations to Q&A platform designers to curate and preserve the crowdsourced knowledge shared through these sites.

机译：技术问答网站对于软件工程师来说至关重要，因为他们不断寻求其他专家的帮助来解决他们的工作问题。尽管取得了成功，但许多问题仍未解决，有时是因为询问者没有认可任何有用的答案。在这些情况下，信息搜索者只能浏览问题线索中的所有答案，以评估其作为潜在解决方案的质量。我们将此耗时的问题视为二进制分类任务，其中建立了最佳答案预测模型以在已解决的问题线程中识别那些已接受的答案，以及已收到答案但仍未解决的那些问题的候选解决方案。在本文中，我们报告了一项旨在分两步评估26个最佳答案预测模型的研究。首先，我们研究在Stack Overflow（最流行的软件工程师问答网站）中预测最佳答案时模型的性能。然后，我们在跨平台设置中评估性能，在该设置中，对预测模型进行Stack Overflow训练，并在其他技术问答站点进行测试。我们的发现表明，分类器的选择和自动参数调整对最佳答案的预测有很大影响。我们还证明，我们针对最佳答案预测问题的方法可在技术问答网站上推广。最后，我们向问答平台设计人员提供实用建议，以策划和保存通过这些站点共享的众包知识。

著录项

来源
《Empirical Software Engineering》 |2019年第2期|854-901|共48页
作者
Calefato Fabio; Lanubile Filippo; Novielli Nicole;
展开▼
作者单位

Univ Bari A Moro, Dipartimento Jonico, Via Duomo 259, I-74123 Taranto, Italy;

Univ Bari A Moro, Dipartimento Infomat, Via E Orabona 4, I-70125 Bari, Italy;

Univ Bari A Moro, Dipartimento Infomat, Via E Orabona 4, I-70125 Bari, Italy;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Cross-platform prediction; Qa; Stack overflow; Crowdsourcing; Knowledge sharing; Imbalanced datasets;

机译：跨平台预测;问答;堆栈溢出;众包;知识共享;数据集不平衡;

相似文献

外文文献
中文文献
专利

1. How do developers discuss and support new programming languages in technical Q&A site? An empirical study of Go, Swift, and Rust in Stack Overflow [J] . Chakraborty Partha, Shahriyar Rifat, Iqbal Anindya, Information and software technology . 2021,第Sepa期

机译：开发人员如何在技术问答网站中讨论和支持新的编程语言？堆栈溢出中的Go，Swift和Rust的实证研究
2. Comparative assessment of data obtained using empirical models for path loss predictions in a university campus environment [J] . Segun I. Popoola, Aderemi A. Atayero, Oluwafunso A. Popoola Data in Brief . 2018,第2期

机译：利用大学校园环境下路径损失预测获得的数据的比较评估
3. Assessment of runup predictions by empirical models on non-truncated beaches on the south-east Australian coast [J] . Atkinson Alexander L., Power Hannah E., Moura Theo, Coastal engineering . 2017,第JANa期

机译：通过经验模型评估澳大利亚东南沿海非截断海滩的径流预测
4. Clutter forecast - a synthesis of mesoscale numerical weather prediction and empirical site specific radar clutter models [C] . LeFurjah George, Marshall Robert, Casey Timothy S., IEEE Radar Conference . 2008

机译：杂波预测 - 迈空数值天气象预测的合成及经验现场特定雷达杂波模型
5. Mechanistic-Empirical Failure Prediction Models for Spring Weight Restricted Flexible Pavements in Manitoba Using Manitoba and MnROAD Instrumented Test Sites. [D] . Kavanagh, Leonnie N. 2013

机译：使用曼尼托巴省和MnROAD仪器测试站点的曼尼托巴省弹簧重量受限柔性路面的机械-经验失效预测模型。
6. Comparative assessment of data obtained using empirical models for path loss predictions in a university campus environment [O] . Segun I. Popoola, Aderemi A. Atayero, Oluwafunso A. Popoola 2018

机译：在大学校园环境中使用经验模型进行路径损耗预测的数据的比较评估
7. An empirical assessment of best-answer prediction models in technical QA sites [O] . Fabio Calefato, Filippo Lanubile, Nicole Novielli 2018

机译：技术问答网站上最佳答案预测模型的实证评估
8. SITE-94. Discrete-feature modelling of the Aespoe Site: 3. Predictions of hydrogeological parameters for performance assessment [R] . Geier, J. E. 1996

机译：sITE-94。 aespoe场地的离散特征建模：3。用于绩效评估的水文地质参数预测

An empirical assessment of best-answer prediction models in technical Q&A sites

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅