首页> 外国专利> Bilingual corpus creation method, apparatus and program, and machine translation system

Bilingual corpus creation method, apparatus and program, and machine translation system

机译:双语语料库的创建方法,装置和程序以及机器翻译系统

摘要

A translation corpus creation method of the present disclosure includes generating plural paraphrasing candidate sentences for a first original sentence in a first language by paraphrasing one or plural fragments among plural fragments included in the first original sentence into other expressions in the first language by a paraphrasing candidate sentence generation unit, identifying one or plural paraphrasing candidate sentences in the same meaning as the meaning of the first original sentence from the plural paraphrasing candidate sentences as one or plural paraphrasing sentences by a paraphrasing sentence identification unit, and generating a new set of sentences by setting the one or plural identified paraphrasing sentences and a second original sentence translated from the first original sentence as a set of sentences to create a translation corpus with the generated and new set of sentences by a translation corpus creation unit.
机译:本公开的翻译语料库创建方法包括:通过将第一原始句子中包括的多个片段中的一个或多个片段释义为第一语言中的第一语言的其他表达,来生成第一语言的第一原始句子的多个释义候选句子。句子生成单元,通过复述句子识别单元从复述候选词中识别与第一原始句子的含义相同的一个或多个复述候选句子作为复述句子中的一个或多个复述句子,并通过以下方式生成新的句子集合:将一个或多个识别的释义句子和从第一原始句子翻译的第二原始句子设置为句子集合,以由翻译语料库创建单元使用生成的句子和新的句子集来创建翻译语料库。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号