首页> 外文会议>Federated Conference on Computer Science and Information Systems >Exploration for Polish-* bi-lingual translation equivalents from comparable and quasi-comparable corpora
【24h】

Exploration for Polish-* bi-lingual translation equivalents from comparable and quasi-comparable corpora

机译:从可比和准可比语料库中探索波兰语*双语翻译对等物

获取原文

摘要

In contemporary world, translation becomes a critical need of the time. Parallel dictionaries have now become a most accessible source by humans, but confines are there as they do not offer good quality translation function, because of neologisms and words that are out of vocabulary. To overcome this problem in the usage of statistical translation systems is becoming more and more important in maintaining the eminence and quantity of the training data. But due to the limitations in these systems they have very limited availability for few languages and very limited narrow text areas. The purpose of this research is to bring calculation time up gradation via GPU acceleration, tuning script introduction and the enhancement and improvements in the methodologies of the contemporary comparable corpora mining through re-implementation of analogous algorithms through Needleman-Wunch algorithm. Experiments have been conducted on multiple language data which were extracted on numerous domains from Wikipedia. For the sake of Wikipedia, multiple cross-lingual contrasts and comparison were established. Optimistic impact on the both quantity and quality of mined data was observed due to such changes and adaptation. The solution is language independent and highly practical especially for under-resourced languages.
机译:在当代世界中,翻译变成了时代的危急需要。并行词典现在已成为人类最无障碍的来源,但由于新闻和词汇,因此不提供优质的翻译功能,因此不提供良好的质量翻译功能。为了克服这个问题,在统计翻译系统的使用中,在维护培训数据的卓越和数量时变得越来越重要。但由于这些系统中的局限性,它们具有很少的可用性对于几种语言和非常有限的狭义文本区域。本研究的目的是通过GPU加速,调整脚本介绍以及通过通过针对针对算法重新实施类似的算法的当代可比较的Corpora挖掘方法来实现计算时级。在多语言数据上进行了实验,这些数据在维基百科的许多域中提取。为维基百科的缘故,建立了多种交叉对比和比较。由于此类变化和适应,观察到对在挖掘数据的两种数量和质量的乐观影响。该解决方案是语言独立,非常实用,特别适用于资源不足的语言。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号