首页> 外文期刊>Mathematical Problems in Engineering >A Novel Deep Learning Method for Obtaining Bilingual Corpus from Multilingual Website
【24h】

A Novel Deep Learning Method for Obtaining Bilingual Corpus from Multilingual Website

机译:一种从多语言网站获取双语语料库的新型深度学习方法

获取原文
获取原文并翻译 | 示例
       

摘要

Machine translation needs a large number of parallel sentence pairs to make sure of having a good translation performance. However, the lack of parallel corpus heavily limits machine translation for low-resources language pairs. We propose a novel method that combines the continuous word embeddings with deep learning to obtain parallel sentences. Since parallel sentences are very invaluable for low-resources language pair, we introduce cross-lingual semantic representation to induce bilingual signals. Our experiments show that we can achieve promising results under lacking external resources for low-resource languages. Finally, we construct a state-of-the-art machine translation system in low-resources language pair.
机译:机器翻译需要大量的并行句子对,以确保具有良好的翻译性能。但是,缺乏并行语料库严重限制了资源较少的语言对的机器翻译。我们提出了一种将连续单词嵌入与深度学习相结合以获得平行句子的新颖方法。由于平行句子对于资源匮乏的语言对非常宝贵,因此我们引入了跨语言语义表示来诱导双语信号。我们的实验表明,在缺乏外部资源的情况下,对于低资源语言,我们可以取得可喜的结果。最后,我们以资源匮乏的语言对构建了最新的机器翻译系统。

著录项

  • 来源
    《Mathematical Problems in Engineering》 |2019年第1期|7495436.1-7495436.7|共7页
  • 作者单位

    Chinese Acad Sci, Xinjiang Tech Inst Phys & Chem, Urumqi, Peoples R China|Key Lab Speech Language Informat Proc Xinjiang, Urumqi, Peoples R China|Univ Chinese Acad Sci, Beijing, Peoples R China;

    Chinese Acad Sci, Xinjiang Tech Inst Phys & Chem, Urumqi, Peoples R China|Key Lab Speech Language Informat Proc Xinjiang, Urumqi, Peoples R China;

    Chinese Acad Sci, Xinjiang Tech Inst Phys & Chem, Urumqi, Peoples R China|Key Lab Speech Language Informat Proc Xinjiang, Urumqi, Peoples R China;

    Chinese Acad Sci, Xinjiang Tech Inst Phys & Chem, Urumqi, Peoples R China|Key Lab Speech Language Informat Proc Xinjiang, Urumqi, Peoples R China;

    Chinese Acad Sci, Xinjiang Tech Inst Phys & Chem, Urumqi, Peoples R China|Key Lab Speech Language Informat Proc Xinjiang, Urumqi, Peoples R China;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

  • 入库时间 2022-08-18 04:19:19

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号