首页>
外国专利>
Identification of source languages for terms
Identification of source languages for terms
展开▼
机译:识别术语的源语言
展开▼
页面导航
摘要
著录项
相似文献
摘要
To prevent typos in electronic documents, quality checks are performed to identify portions of the document that have been misspelled. Major sources of false positives are words that have been transliterated from other languages into English. The quality control system described herein identifies transliterated words in electronic documents. The system may be trained to evaluate a set of words, and determine likelihoods that individual words in the set of words are transliterated (i.e., they are not native to the language and/or alphabet of the electronic document). In some embodiments, the model may be further configured to identify a language and/or alphabet that the individual word was transliterated from.
展开▼