首页> 中文期刊>计算机科学 >中文社区问答中问题答案质量评价和预测

中文社区问答中问题答案质量评价和预测

     

摘要

知识共享型网站为自动问答系统带来了新的研究契机.但用户提供的问题及其答案质量参差不齐,在提供有用信息的同时可能包含各种无关甚至恶意的信息.对此类信息进行判别和过滤,并选取高质量的问题与答案对,有助于在基于社区的自动问答系统中重用相关问题的答案以提高问答系统的服务质量.首先从中文社区问答网站上抓取大量问题及答案,利用社会网络的方法对提问者和回答者的互动关系及特点进行了统计与分析.然后基于给定的问答质量判定标准,对3000多个问题及其答案进行了人工标注.并通过提取文本和非文本两类特征集,利用机器学习算法设计和实现了基于特征集的问答质量分类器.试验结果表明其精度和召回率均在70%以上.最后分析了影响社区网络中问答质量的主要因素.%The rise of Knowledge-sharing platform on the Internet in China provides a new approach for Automatic Question Answering.However, the quality of User-Generated Content in such social networks may vary significantly,from useless information to malice spam.Identifying and filtering such content are particularly important to improve users' experience and the performance of Question Answering System.We first extracted a set of question answer content from Chinese Community Question Answering site, investigated a series of statistic characteristics on the interaction of participants,and then manually annotated quality of a subset of these questions and answers.By combining text features and non-text features provided by the community extracted from those questions and answers, we established a content quality classification model for evaluation and prediction.We find that this model is able to distinguish highquality ones from others with considerable accuracy.

著录项

相似文献

  • 中文文献
  • 外文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号