首页>
外国专利>
METHOD AND APPARATUS FOR EXPANDING DATA OF BILINGUAL CORPUS AND STORAGE MEDIUM
METHOD AND APPARATUS FOR EXPANDING DATA OF BILINGUAL CORPUS AND STORAGE MEDIUM
展开▼
机译:双语语料库和存储介质的数据扩展方法和装置
展开▼
页面导航
摘要
著录项
相似文献
摘要
A method and apparatus for expanding data in a bilingual corpus is disclosed. The method of expanding data of a bilingual corpus comprises: querying at least one first central language phrase matching a word of a first source language phrase in a source language-centric language corpus; Querying at least one second source language phrase matching a word of each first central language phrase in a source language-backbone language corpus and constructing a source language phrase set with each second source language phrase; Querying at least one first target language phrase matching a word of each first central language phrase in a central language-target language corpus and constructing a target language phrase set with each first target language phrase; Forming at least one pair of phrases in which a source language phrase and a target language phrase are matched by combining a second source language phrase in the source language phrase set and a first target language phrase in the target language phrase set; And storing at least one pair of phrases in the source language-target language corpus where the phrases of the source language phrase and the target language phrase are matched. The problem of data scarcity in the bilingual corpus is solved by expanding the data in the bilingual corpus.
展开▼