首页>
外国专利>
SYSTEMS AND METHODS FOR IDENTIFYING PARALLEL DOCUMENTS AND SENTENCE FRAGMENTS IN MULTILINGUAL DOCUMENT COLLECTIONS
SYSTEMS AND METHODS FOR IDENTIFYING PARALLEL DOCUMENTS AND SENTENCE FRAGMENTS IN MULTILINGUAL DOCUMENT COLLECTIONS
展开▼
机译:用于识别多语言文档集合中的并行文档和句子片段的系统和方法
展开▼
页面导航
摘要
著录项
相似文献
摘要
Systems, computer programs, and methods for identifying parallel documents and/or fragments in a bilingual collection are provided. The method for identifying parallel sub-sentential fragments in a bilingual collection comprises translating a source document from a bilingual collection. The method further includes querying a target library associated with the bilingual collection using the translated source document, and identifying one or more target documents based on the query. Subsequently, a source sentence associated with the source document is aligned to one or more target sentences associated with the one or more target documents. Finally, the method includes determining whether a source fragment associated with the source sentence comprises a parallel translation of a target fragment associated with the one or more target sentences.
展开▼