首页>
外国专利>
SYSTEMATIC MASS NORMALIZATION OF INTERNATIONAL TITLES
SYSTEMATIC MASS NORMALIZATION OF INTERNATIONAL TITLES
展开▼
机译:国际标题的系统质量标准化
展开▼
页面导航
摘要
著录项
相似文献
摘要
A system for generating a database of labeled foreign canonical titles includes an interface and a processor. The interface is to receive a title in a second language. The processor is to 1) store a set of n-grams in a first language in a first database; 2) sanitize the title into a sanitize title in the second language; 3) translate the sanitized title into a translated title in the first language; 4) break the translated title into n-grams; 5) determine labels for the n-grams using the first database; and 6) determine label to associate with the title.
展开▼