首页> 外文期刊>International Journal of Pattern Recognition and Artificial Intelligence >Deep Learning-based Roman-Urdu to Urdu Transliteration
【24h】

Deep Learning-based Roman-Urdu to Urdu Transliteration

机译:基于深度学习的罗马 - 核武器到Urdu音译

获取原文
获取原文并翻译 | 示例
       

摘要

Attention-based encoder-decoder models have superseded conventional techniques due to their unmatched performance on many neural machine translation problems. Usually, the encoders and decoders are two recurrent neural networks where the decoder is directed to focus on relevant parts of the source language using attention mechanism. This data-driven approach leads to generic and scalable solutions with no reliance on manual hand-crafted features. To the best of our knowledge, none of the modern machine translation approaches has been applied to address the research problem of Urdu machine transliteration. Ours is the first attempt to apply the deep neural network-based encoder-decoder using attention mechanism to address the aforementioned problem using Roman-Urdu and Urdu parallel corpus. To this end, we present (i) the first ever Roman-Urdu to Urdu parallel corpus of 1.1 million sentences, (ii) three state of the art encoder-decoder models, and (iii) a detailed empirical analysis of these three models on the Roman-Urdu to Urdu parallel corpus. Overall, attention-based model gives state-of-the-art performance with the benchmark of 70 BLEU score. Our qualitative experimental evaluation shows that our models generate coherent transliterations which are grammatically and logically correct.
机译:由于其在许多神经机翻译问题上,基于注意力的编码器 - 解码器模型具有取代的传统技术。通常,编码器和解码器是两个经常性神经网络,其中解码器被引导用于使用注意机制专注于源语言的相关部分。此数据驱动方法导致通用和可扩展的解决方案,无需依赖手动手工制作的功能。据我们所知,已应用现代化的机器翻译方法均未解决Urdu机器音译的研究问题。我们的首次尝试使用注意机制应用深度神经网络的编码器解码器来解决上述问题,以解决上述问题,并使用roman-urdu和urdu并行语料库来解决上述问题。为此,我们展示(i)第一个罗马 - 乌尔都语到乌尔都语并联语料库110万句,(ii)第三态的octoder-decoder模型,和(iii)对这三种模型的详细实证分析罗马-URDU到乌尔都语并行语料库。总体而言,基于注意力的模型提供了最先进的性能,具有70个BLEU分数的基准。我们的定性实验评估表明,我们的模型产生了语法和逻辑上的相干音译。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号