首页>
外国专利>
Constructing a translation lexicon from comparable, non-parallel corpora
Constructing a translation lexicon from comparable, non-parallel corpora
展开▼
机译:从可比的非平行语料库构建翻译词典
展开▼
页面导航
摘要
著录项
相似文献
摘要
A machine translation system may use non-parallel monolingual corpora to generate a translation lexicon. The system may identify identically spelled words in the two corpora, and use them as a seed lexicon. The system may use various clues, e.g., context and frequency, to identify and score other possible translation pairs, using the seed lexicon as a basis. An alternative system may use a small bilingual lexicon in addition to non-parallel corpora to learn translations of unknown words and to generate a parallel corpus.
展开▼