首页>
外国专利>
METHOD AND DEVICE FOR REGISTERING UNKNOWN WORD WITH NOUN THESAURUS AND RECORDING MEDIUM WITH UNKNOWN WORD REGISTRATION PROGRAM RECORDED THEREIN
METHOD AND DEVICE FOR REGISTERING UNKNOWN WORD WITH NOUN THESAURUS AND RECORDING MEDIUM WITH UNKNOWN WORD REGISTRATION PROGRAM RECORDED THEREIN
展开▼
机译:用名词词库注册未知单词并用其中记录的未知单词注册程序记录媒体的方法和装置
展开▼
页面导航
摘要
著录项
相似文献
摘要
PROBLEM TO BE SOLVED: To statistically strictly decide a node of a noun thesaurus having a multinomial distribution being close to the multinomial distribution of unknown words as an unknown work registration node by using a Bayesian estimator logically having guarantee under a limited sample instead of a cooccurrence frequency and Kullback-Leibler information quantity(KL information quantity) being an inter-distribution distance in probability distribution space instead of a cosine between vectors in vector space. ;SOLUTION: This device consists of a means 100 which calculates the cooccurrence frequency of an unknown word and each verb in document data in corpus 120 and the cooccurrence frequency of each node and each verb of a noun thesaurus 130 in the document data in the corpus, a means 200 which uses the cooccurrence frequency information and calculates the Bayesian estimator of a multinominal distribution in which the unknown word co-occurs with each verb and the Bayesian estimator of a multinomial distribution in which each node of the noun thesaurus co-occurs with each verb and a means 300 which uses the Bayesian estimators and outputs a node of the noun thesaurus having a multinomial distribution being the closest to the unknown word as an unknown work registration node with Kullback-Leibler information quantity as a standard.;COPYRIGHT: (C)2000,JPO
展开▼