...
首页> 外文期刊>ACM SIGIR FORUM >Experiments with Convolutional Neural Network Models for Answer Selection
【24h】

Experiments with Convolutional Neural Network Models for Answer Selection

机译:卷积神经网络模型用于答案选择的实验

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

In recent years, neural networks have been applied to many textrnprocessing problems. One example is learning a similarity functionrnbetween pairs of text, which has applications to paraphrasernextraction, plagiarism detection, question answering, and ad hocrnretrieval. Within the information retrieval community, the convolutionalrnneural network model proposed by Severyn and MoschiŠirnin a SIGIR 2015 paper has gained prominence. Œis paper focusesrnon the problem of answer selection for question answering: wernaŠempt to replicate the results of Severyn and MoschiŠi using theirrnopen-source code as well as to reproduce their results via a de novorn(i.e., from scratch) implementation using a completely di‚erentrndeep learning toolkit. Our de novo implementation is instructivernin ascertaining whether reported results generalize across toolkits,rneach of which have their idiosyncrasies. We were able to successfullyrnreplicate and reproduce the reported results of Severynrnand MoschiŠi, albeit with minor di‚erences in e‚ectiveness, butrnarming the overall design of their model. Additional ablation experimentsrnbreak down the components of the model to show theirrncontributions to overall e‚ectiveness. Interestingly, we €nd thatrnremoving one component actually increases e‚ectiveness and thatrna simpli€ed model with only four word overlap features performsrnsurprisingly well, even beŠer than convolution feature maps alone
机译:近年来,神经网络已应用于许多文本处理问题。一个示例是学习成对的文本之间的相似性功能,该功能可应用于释义提取,抄袭检测,问题回答和临时检索。在信息检索社区中,Severyn和MoschiŠirnin在SIGIR 2015论文中提出的卷积神经网络模型获得了广泛关注。这是本文的重点吗?不是回答问题的答案选择问题:您可以使用开源代码复制Severyn和MoschiŠi的结果,也可以使用从头开始(即从头开始)的实现来复制他们的结果完全不同的深度学习工具包。我们的从头实施具有指导意义,可以确定报告的结果是否可以在工具箱中普遍使用,每个工具箱都有其特质。我们能够成功复制和重现SeverynrnandMoschiŠi的报告结果,尽管在有效性上存在微小差异,但仍能验证其模型的总体设计。附加的消融实验分解了模型的各个组成部分,以显示它们对整体有效性的贡献。有趣的是,我们发现删除一个组成部分实际上会提高效果,只有四个单词重叠特征的那个简化的模型的效果令人惊讶,甚至比仅卷积特征图还好

著录项

  • 来源
    《ACM SIGIR FORUM》 |2017年第cd期|1217-1220|共4页
  • 作者

    Jinfeng Rao; Hua He; Jimmy Lin;

  • 作者单位

    Department of Computer Science University of Maryland;

    Department of Computer Science University of Maryland;

    David R. Cheriton School of Computer Science University of Waterloo;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号