首页>
外国专利>
EXTRACTING PROCESSING SYSTEM FOR CHARACTERISTIC VOCABULARY IN JAPANESE OBJECT SENTENCE
EXTRACTING PROCESSING SYSTEM FOR CHARACTERISTIC VOCABULARY IN JAPANESE OBJECT SENTENCE
展开▼
机译:日本语对象特征性词汇提取处理系统
展开▼
页面导航
摘要
著录项
相似文献
摘要
PURPOSE:To automatically extract characteristic vocabulary in an object sentence by classifying a Japanese document into character type code strings, extracting the candidates of object characteristic vocabulary from the code strings, further extracting the candidates of the object characteristic vocabulary having high accuracy out of all the above-mentioned candidates based on language information, and further outputting vocabulary which does not exist in a Japanese dictionary for analysis. CONSTITUTION:For the inputted Japanese document, a code string expanding part 1 generates plural types of the character type code strings for every character in the Japanese document. A characteristic vocabulary candidate extracting part 2 extracts all the character strings corresponding to the code strings in an extracting character type string prescribing table 7 as the candidates of the characteristic vocabulary in the Japanese object sentence and classifies the candidates according to conditions in a classifying table 8. Next, a characteristic vocabulary language processing part 3 retrieves a language information table 9, processes respective above-mentioned candidates, and extracts some candidates out of the above- mentioned candidates having higher accuracy. A characteristic vocabulary language selecting part 4 retrieves a dictionary 10 for the analysis with the shapes of the characters of the candidates from the processing part 3 as keys and removes the candidates from all the candidates when the candidates are already registered in the dictionary 10. The candidates are regarded as the characteristic vocabulary in the Japanese object sentence, sent to a registering part, and written and registered into a file 6 when the candidates are not registered yet.
展开▼