COLLECTING POLISH-GERMAN PARALLEL CORPORA IN THE WEB

MONIKA ROSINSKA

首页> 外文期刊>Systems Science >COLLECTING POLISH-GERMAN PARALLEL CORPORA IN THE WEB

【24h】

COLLECTING POLISH-GERMAN PARALLEL CORPORA IN THE WEB

机译：在网络中收集波兰语-德语平行语

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Parallel corpus has recently become an indispensable resource in multilingual natural language processing. Manual preparation of a bilingual corpus is a laborious task. Therefore methods for the automated creation of parallel corpus are currently a topic of concern for many researchers. A number of sophisticated and effective algorithms for collecting parallel texts from the Web have already been created. Unfortunately, none of them have been used in the process of Polish-German corpus creation. That is why the aim of the research has been to verify the efficiency of existing algorithms for the collecting of Polish-German parallel corpus, intended as a reference source for a Machine Translation system, to propose a new algorithm and present results achieved by the new algorithm.

机译：并行语料库最近已成为多语言自然语言处理中必不可少的资源。手动准备双语语料库是一项艰巨的任务。因此，自动创建平行语料库的方法目前是许多研究人员关注的话题。已经创建了许多用于从Web收集并行文本的复杂有效的算法。不幸的是，在波兰语-德语语料库的创建过程中都没有使用它们。因此，本研究的目的是验证现有的用于收集波兰语-德语并行语料库的算法的效率，该算法旨在作为机器翻译系统的参考源，以提出一种新算法并介绍新算法所取得的结果算法。

著录项

来源
《Systems Science》 |2008年第4期|41-45|共5页
作者
MONIKA ROSINSKA;
展开▼
作者单位

Adam Mickiewicz University in Poznan, Faculty of Mathematics and Computer Science;

展开▼
收录信息美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类
关键词
corpus; parallel;

机译：语料库平行;
入库时间 2022-08-17 23:10:40

相似文献

外文文献
中文文献
专利

1. Extracting Parallel Sentences from Nonparallel Corpora Using Parallel Hierarchical Attention Network [J] . Shaolin Zhu, Yong Yang, Chun Xu Computational intelligence and neuroscience . 2020,第4期

机译：使用并行分层注意网络从非平行语料库中提取并行句子
2. An Efficient Framework for Extracting Parallel Sentences from Non-Parallel Corpora [J] . Cuong Hoang, Anh-Cuong Le, Phuong-Thai Nguyen, Fundamenta Informaticae . 2014,第2期

机译：从非平行语料库中提取平行句子的有效框架
3. Voice Conversion Using Dynamic Frequency Warping With Amplitude Scaling, for Parallel or Nonparallel Corpora [J] . Godoy E., Rosec O., Chonavel T. Audio, Speech, and Language Processing, IEEE Transactions on . 2012,第4期

机译：针对并行或非并行语料库，使用具有幅度缩放比例的动态频率规整的语音转换
4. Evaluating different methods for automatically collecting large general corpora for Basque from the web [C] . Igor LETURIA International conference on computational linguistics . 2012

机译：评估从网络上自动收集巴斯克大型通用语料库的不同方法
5. Parallel automatic term extraction from large Web corpora. [D] . Zhang, Lingyan. 2004

机译：从大型Web语料库中并行自动提取术语。
6. Extracting Parallel Sentences from Nonparallel Corpora Using Parallel Hierarchical Attention Network [O] . Shaolin Zhu, Yong Yang, Chun Xu 2020

机译：使用并行分层注意网络从非平行语料库中提取并行句子
7. Automatic parallel corpora and bilingual terminology extraction from parallel WebSites [O] . Almeida J. J., Simões Alberto 2010

机译：从并行网站自动提取并行语料库和双语术语

COLLECTING POLISH-GERMAN PARALLEL CORPORA IN THE WEB

摘要

著录项

相似文献

相关主题

期刊订阅