首页> 外文期刊>Journal of computer sciences >SQL Generation from Natural Language: A Sequence-to-Sequence Model Powered by the Transformers Architecture and Association Rules
【24h】

SQL Generation from Natural Language: A Sequence-to-Sequence Model Powered by the Transformers Architecture and Association Rules

机译:来自自然语言的SQL生成:由变压器架构和关联规则提供支持的序列到序列模型

获取原文
           

摘要

Using Natural Language (NL) to interacting with relational databases allows users from any background to easily query and analyze large amounts of data. This requires a system that understands user questions and automatically converts them into structured query language such as SQL. The best performing Text-to-SQL systems use supervised learning (usually formulated as a classification problem) by approaching this task as a sketch-based slot-filling problem, or by first converting questions into an Intermediate Logical Form (ILF) then convert it to the corresponding SQL query. However, non-supervised modeling that directly converts questions to SQL queries has proven more difficult. In this sense, we propose an approach to directly translate NL questions into SQL statements. In this study, we present a Sequence-to-Sequence (Seq2Seq) parsing model for the NL to SQL task, powered by the Transformers Architecture exploring the two Language Models (LM): Text-To-Text Transfer Transformer (T5) and the Multilingual pre-trained Text-To-Text Transformer (mT5). Besides, we adopt the transformation-based learning algorithm to update the aggregation predictions based on association rules. The resulting model achieves a new state-of-the-art on the WikiSQL DataSet, for the weakly supervised SQL generation.
机译:使用自然语言(NL)与关系数据库进行交互允许用户从任何背景中易于查询和分析大量数据。这需要一个理解用户问题的系统,并自动将它们转换为结构化查询语言,例如SQL。通过将此任务作为基于草图的插槽填充问题接近此任务,或者首先将问题转换为中间逻辑表单(ILF),使用监督学习(通常为分类问题)使用监督学习(通常为分类问题),或者将问题转换为中间逻辑表单(ILF)然后转换它到相应的SQL查询。但是,直接将问题转换为SQL查询的非监督建模已经证明更加困难。从这个意义上讲,我们提出了一种方法可以将NL问题直接翻译成SQL陈述。在这项研究中,我们向NL到SQL任务的序列 - 序列(SEQ2Seq)解析模型,由探索两种语言模型(LM)的变压器架构供电:文本到文本传输变压器(T5)和多语言预培训的文本传感器(MT5)。此外,我们采用基于转换的学习算法来基于关联规则更新聚合预测。由此产生的模型在WikiSQL数据集上实现了新的最先进的,对于弱监督的SQL生成。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号