...
首页> 外文期刊>Journal of Information Science >QSem: A novel question representation framework for question matching over accumulated question-answer data
【24h】

QSem: A novel question representation framework for question matching over accumulated question-answer data

机译:QSem:一种新颖的问题表示框架,用于对累积的问答数据进行问题匹配

获取原文
获取原文并翻译 | 示例
           

摘要

This paper proposes a novel question representation framework to assist automated question answering through reusing accumulated question-answer data. The framework, named QSem, defines three types of question words - question-target words, user-oriented words and irrelevant words, along with semantic patterns, for representing a question. The question word types are semantically labelled by a pre-defined ontology to enrich the semantic representation of questions. The semantic patterns through equivalent pattern linking enhance normal structure matching aiming at improving question matching performance. We trained QSem on 400 randomly selected questions with semantic patterns and obtained optimized parameters. After that, 5000 questions from our system were tested and the precision of question matching was between 0.71 and 0.93 with respect to various generators, indicating the stability of the approach. We further compared our approach with Cosine similarity, WordNet-based semantic similarity and IBM translation model on a standard TREC dataset containing 5536 questions. The results presented that our approach achieved best performance with mean reciprocal rank increased by 7.2% and accuracy increased by 7.5% on average, demonstrating the effectiveness of the approach.
机译:本文提出了一种新颖的问题表示框架,通过重用累积的问题答案数据来辅助自动问题解答。名为QSem的框架定义了三种类型的疑问词-问题目标词,面向用户的单词和不相关的词,以及用于表示问题的语义模式。疑问词类型在语义上由预定义的本体标记,以丰富问题的语义表示。通过等效模式链接的语义模式增强了正常结构匹配,旨在提高问题匹配性能。我们用语义模式对400个随机选择的问题进行了QSem训练,并获得了优化参数。之后,对我们系统中的5000个问题进行了测试,对于各种生成器,问题匹配的精度在0.71至0.93之间,这表明该方法的稳定性。我们在包含5536个问题的标准TREC数据集上进一步比较了余弦相似度,基于WordNet的语义相似度和IBM翻译模型的方法。结果表明,我们的方法取得了最佳性能,平均倒数排名平均提高了7.2%,准确度平均提高了7.5%,证明了该方法的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号