【24h】

Natural Language Business Intelligence Question Answering Through SeqtoSeq Transfer Learning

机译:通过SeqtoSeq转移学习进行自然语言商务智能问答

获取原文

摘要

Enterprise data is usually stored in the form of relational databases. Question Answering systems provides an easier way so that business analysts can get data insights without struggling with the syntax of SQL. However, building a supervised machine learning based question answering system is a challenging task involving large manual annotations for a specific domain. In this paper we explore the problem of transfer learning for neural sequence taggers, where a source task with plentiful annotations (e.g., Training samples (NL questions) on IT enetr-prize domain) is used to improve performance on a target task with fewer available annotations (e.g., Training samples (NL questions) on pharmaceutical domain). We examine the effects of transfer learning for deep recurrent networks across domains and show that significant improvement can often be obtained. Our question answering framework is based on a set of machine learning models that create an intermediate sketch from a natural language query. Using the intermediate sketch, we generate a final database query over a large knowledge graph. Our framework supports multiple queries such as aggregation, self joins, factoid and transnational.
机译:企业数据通常以关系数据库的形式存储。问答系统提供了一种更简便的方法,使业务分析人员可以获取数据见解,而不必担心SQL的语法。但是,构建基于监督的机器学习的问答系统是一项艰巨的任务,涉及针对特定领域的大型手动注释。在本文中,我们探讨了神经序列标记器的转移学习问题,其中使用带有大量注释的源任务(例如,IT渗透域上的训练样本(NL问题))来提高目标任务的性能,而可用的更少注释(例如,有关药物领域的培训样本(NL问题))。我们研究了跨域深层递归网络的迁移学习的效果,并表明可以经常获得重大改进。我们的问题解答框架基于一组机器学习模型,这些模型从自然语言查询创建中间草图。使用中间的草图,我们在一个大型知识图上生成最终的数据库查询。我们的框架支持多种查询,例如聚合,自连接,事实和跨国查询。

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号