首页> 外文期刊>ACM transactions on intelligent systems >DiffQue: Estimating Relative Difficulty of Questions in Community Question Answering Services
【24h】

DiffQue: Estimating Relative Difficulty of Questions in Community Question Answering Services

机译:DiffQue:估计社区问答服务中问题的相对难度

获取原文
获取原文并翻译 | 示例
           

摘要

Automatic estimation of relative difficulty of a pair of questions is an important and challenging problem in community question answering (CQA) services. There are limited studies that addressed this problem. Past studies mostly leveraged expertise of users answering the questions and barely considered other properties of CQA services such as metadata of users and posts, temporal information, and textual content. In this article, we propose DiffQue, a novel system that maps this problem to a network-aided edge directionality prediction problem. DiffQue starts by constructing a novel network structure that captures different notions of difficulties among a pair of questions. It then measures the relative difficulty of two questions by predicting the direction of a (virtual) edge connecting these two questions in the network. It leverages features extracted from the network structure, metadata of users/posts, and textual description of questions and answers. Experiments on datasets obtained from two CQA sites (further divided into four datasets) with human annotated ground-truth show that DiffQue outperforms four state-of-the-art methods by a significant margin (28.77% higher F 1 score and 28.72% higher AUC than the best baseline). As opposed to the other baselines, (i) DiffQue appropriately responds to the training noise, (ii) DiffQue is capable of adapting multiple domains (CQA datasets), and (iii) DiffQue can efficiently handle the "cold start" problem that may arise due to the lack of information for newly posted questions or newly arrived users.
机译:在社区问答(CQA)服务中,自动估计一对问题的相对难度是一个重要且具有挑战性的问题。解决此问题的研究有限。过去的研究大多利用用户回答问题的专业知识,而很少考虑CQA服务的其他属性,例如用户和帖子的元数据,时间信息和文本内容。在本文中,我们提出了DiffQue,这是一个新颖的系统,可将这个问题映射到网络辅助的边缘方向性预测问题。 DiffQue首先构建一个新颖的网络结构,该结构捕获一对问题之间不同的困难概念。然后,通过预测网络中连接这两个问题的(虚拟)边的方向来测量两个问题的相对难度。它利用从网络结构中提取的功能,用户/帖子的元数据以及问题和答案的文字描述。对从两个带有人工注释地面的CQA站点(进一步分为四个数据集)获得的数据集进行的实验表明,DiffQue优于四种最新方法(F 1得分高28.77%,AUC更高28.72%)比最佳基准)。与其他基准相反,(i)DiffQue可以适当地响应训练噪声,(ii)DiffQue能够适应多个域(CQA数据集),并且(iii)DiffQue可以有效处理可能出现的“冷启动”问题由于缺少有关新发布的问题或新到达的用户的信息。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号