首页> 外国专利> Discovery of parallel text portions in comparable collections of corpora and training using comparable texts

Discovery of parallel text portions in comparable collections of corpora and training using comparable texts

机译：在可比较的语料库中发现平行文本部分，并使用可比较的文本进行训练

页面导航

摘要
著录项
相似文献

摘要

A translation training device which extracts from two nonparallel Corpora a set of parallel sentences. The system finds parameters between different sentences or phrases, in order to find parallel sentences. The parallel sentences are then used for training a data-driven machine translation system. The process can be applied repetitively until sufficient data is collected or until the performance of the translation system stops improving.

机译：一种翻译训练设备，它从两个不平行的语料库中提取出一组平行的句子。该系统在不同句子或短语之间找到参数，以便找到平行句子。然后，平行句子用于训练数据驱动的机器翻译系统。可以重复应用该过程，直到收集到足够的数据或翻译系统的性能停止提高为止。

著录项

公开/公告号US8296127B2

专利类型
公开/公告日2012-10-23

原文格式PDF
申请/专利权人 DANIEL MARCU;DRAGOS STEFAN MUNTEANU;
展开▼

申请/专利号US20050087376
发明设计人 DANIEL MARCU;DRAGOS STEFAN MUNTEANU;
展开▼

申请日2005-03-22
分类号G06F17/28;
国家 US
入库时间 2022-08-21 17:30:14

相似文献

专利
外文文献
中文文献