首页> 外文期刊>ACM Transactions on Management Information Systems >Putting Question-Answering Systems into Practice: Transfer Learning for Efficient Domain Customization
【24h】

Putting Question-Answering Systems into Practice: Transfer Learning for Efficient Domain Customization

机译:将问题解答系统付诸实践:有效学习域定制的转移学习

获取原文
获取原文并翻译 | 示例
           

摘要

Traditional information retrieval (such as that offered by web search engines) impedes users with information overload from extensive result pages and the need to manually locate the desired information therein. Conversely, question-answering systems change how humans interact with information systems: users can now ask specific questions and obtain a tailored answer-both conveniently in natural language. Despite obvious benefits, their use is often limited to an academic context, largely because of expensive domain customiza-tions, which means that the performance in domain-specific applications often fails to meet expectations. This article proposes cost-efficient remedies: (ⅰ) we leverage metadata through a filtering mechanism, which increases the precision of document retrieval, and (ⅱ) we develop a novel fuse-and-oversample approach for transfer learning to improve the performance of answer extraction. Here, knowledge is inductively transferred from related, yet different, tasks to the domain-specific application, while accounting for potential differences in the sample sizes across both tasks. The resulting performance is demonstrated with actual use cases from a finance company and the film industry, where fewer than 400 question-answer pairs had to be annotated to yield significant performance gains. As a direct implication to management, this presents a promising path to better leveraging of knowledge stored in information systems.
机译:传统的信息检索(例如由Web搜索引擎提供的信息检索)使用户无法从大量结果页面中获取过多的信息,并且需要在其中手动定位所需的信息。相反,问答系统改变了人类与信息系统的交互方式:用户现在可以用自然语言方便地提出特定问题并获得量身定制的答案。尽管有明显的好处,但它们的使用通常仅限于学术环境,这主要是因为昂贵的领域定制,这意味着特定领域应用程序中的性能通常无法达到期望。本文提出了具有成本效益的补救措施:(ⅰ)我们通过过滤机制利用元数据,从而提高了文档检索的准确性,并且(ⅱ)我们开发了一种新颖的融合和过采样方法来进行转移学习,以提高答案的性能萃取。在这里,知识被归纳地从相关但又不同的任务转移到特定于领域的应用程序中,同时考虑了这两个任务中样本量的潜在差异。财务公司和电影行业的实际用例证明了所产生的性能,在这些案例中,注释不超过400个问答对才能获得显着的性能提升。作为对管理的直接暗示,这为更好地利用信息系统中存储的知识提供了一条有希望的途径。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号