首页> 外文会议>International Conference on Web Research >Predicting Subjective Features from Questions on QA Websites using BERT
【24h】

Predicting Subjective Features from Questions on QA Websites using BERT

机译:使用BERT预测QA网站问题的主观特征

获取原文

摘要

Community Question-Answering websites, such as StackOverflow and Quora, expect users to follow specific guidelines in order to maintain content quality. These systems mainly rely on community reports for assessing contents, which has serious problems, such as the slow handling of violations, the loss of normal and experienced users' time, the low quality of some reports, and discouraging feedback to new users. Therefore, with the overall goal of providing solutions for automating moderation actions in Q&A websites, we aim to provide a model to predict 20 quality or subjective aspects of questions in QA websites. To this end, we used data gathered by the CrowdSource team at Google Research in 2019 and fine-tuned pre-trained BERT model on our problem. Based on our evaluation, model achieved value of 0.046 for Mean-Squared-Error (MSE) after 2 epochs of training, which did not improve substantially in the next ones. Results confirm that by simple fine-tuning, we can achieve accurate models in little time and on less amount of data.11Code is available at: https://github.com/Moradnejad/Predicting-Subjective-Features-on-QA-Websites
机译:社区质疑答案网站,如stackoverflow和quora,期望用户遵循特定指南以维护内容质量。这些系统主要依赖于社区报告来评估内容,这具有严重的问题,例如违规的缓慢处理,正常和经验丰富的用户的时间,一些报告的低质量,以及向新用户劝阻反馈。因此,通过为Q&A网站提供自动化审核行动的解决方案的总体目标,我们的目标是提供一种模型来预测QA网站中的20个质量或主观方面。为此,我们在2019年在谷歌研究中使用了Crowdsource团队收集的数据,并在我们的问题上进行了微调预先训练的BERT BERT模型。根据我们的评价,模型在培训2时代后的平均误差(MSE)实现了0.046的值,这在下一个训练中没有改善。结果确认,通过简单的微调,我们可以在几小时和更少的数据上实现准确的模型。 1 1 代码可在:https://github.com/moradnejad/predicting-subjective-features-on-qa-websites

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号