首页> 外文会议>International Symposium on Computational Intelligence and Design >Open Domain Question Answering with Character-Level Deep Learning Models
【24h】

Open Domain Question Answering with Character-Level Deep Learning Models

机译:字符级深度学习模型的开放域问答

获取原文

摘要

Single-relation factoid question answering (QA) is strongly supported by rich sources of facts from knowledge bases (KB). However, there are many irrelevant information in questions and overwhelming number of facts in knowledge bases, mak-ing it difficult to capture goal entity and relation involved in a question. In order to settle these issues, firstly, a state-of-the-art sequence tagging model (BiLSTM-CRF) is adopted to detect the entity mention in a question. Then, we propose a n-gram match (NGM) algorithm with Chinese-specific rules and an attention-based siamese bidirectional long-short term memory (ASBLSTM) model to measure the lexical and se-mantic similarity between questions and candidate facts. Our whole method requires no hand-crafted template or feature engineering. In addition, character-level models are proved to be effective in solving the out of vocabulary (OOV) issue and improving the accuracy in Chinese KBQA task. Experiment results show that our system outperforms the best system with deep learning models in the KBQA share task of the Confer-ence on Natural Language Processing and Chinese Compu-ting (NLPCC) 2016 and our system achieves an AverageF1 measure of 80.97% and 37.18% on test dataset in NLPCC 2016 and 2017 respectively.
机译:单一关系事实类问题解答(QA)受到知识库(KB)丰富的事实来源的有力支持。但是,问题中有许多不相关的信息,知识库中的事实太多,这使得捕获问题中涉及的目标实体和关系变得困难。为了解决这些问题,首先,采用最新的序列标签模型(BiLSTM-CRF)来检测问题中提到的实体。然后,我们提出了一种具有中文特定规则的n-gram匹配(NGM)算法和一个基于注意力的暹罗双向长期短期记忆(ASBLSTM)模型,用于测量问题和候选事实之间的词汇和语义相似度。我们的整个方法不需要手工制作的模板或特征工程。此外,字符级模型被证明可有效解决词汇量不足(OOV)问题并提高中文KBQA任务的准确性。实验结果表明,在2016年自然语言处理和中文计算会议(KBC)的KBQA共享任务中,我们的系统优于具有深度学习模型的最佳系统,并且我们的系统实现了80.97 \%和37.18的AverageF1测度\%分别在NLPCC 2016和2017中的测试数据集上。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号