首页> 外文会议>International conference on computational linguistics >Inducing Multilingual Text Analysis Tools Using Bidirectional Recurrent Neural Networks
【24h】

Inducing Multilingual Text Analysis Tools Using Bidirectional Recurrent Neural Networks

机译:使用双向递归神经网络诱导多语言文本分析工具

获取原文

摘要

This work focuses on the rapid development of linguistic annotation tools for resource-poor languages. We experiment several cross-lingual annotation projection methods using Recurrent Neural Networks (RNN) models. The distinctive feature of our approach is that our multilingual word representation requires only a parallel corpus between source and target languages. More precisely, our method has the following characteristics: (a) it does not use word alignment information, (b) it does not assume any knowledge about foreign languages, which makes it applicable to a wide range of resource-poor languages, (c) it provides truly multilingual taggers. We investigate both uni- and bi-directional RNN models and propose a method to include external information (for instance low level information from Part-Of-Speech tags) in the RNN to train higher level taggers (for instance, super sense taggers). We demonstrate the validity and genericity of our model by using parallel corpora (obtained by manual or automatic translation). Our experiments are conducted to induce cross-lingual POS and super sense taggers.
机译:这项工作的重点是针对资源贫乏的语言的语言注释工具的快速开发。我们使用递归神经网络(RNN)模型对几种跨语言注释投影方法进行了实验。我们的方法的独特之处在于,我们的多语言单词表示仅需要源语言和目标语言之间的平行语料库。更准确地说,我们的方法具有以下特征:(a)它不使用单词对齐信息,(b)它不假设任何有关外语的知识,这使其可用于多种资源贫乏的语言,(c ),它提供了真正的多语言标记器。我们研究了单向和双向RNN模型,并提出了一种在RNN中包括外部信息(例如,来自词性标签的低级信息)的方法,以训练更高级别的标记器(例如,超常感标记器)。我们通过使用并行语料库(通过手动或自动翻译获得)证明了我们模型的有效性和通用性。进行我们的实验以诱导跨语言POS和超常标签。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号