首页>
外国专利>
A Method for the N-gram Language Modeling Based on Keyword
A Method for the N-gram Language Modeling Based on Keyword
展开▼
机译:基于关键词的N元语法建模方法
展开▼
页面导航
摘要
著录项
相似文献
摘要
PURPOSE: A method for constructing a keyword-based N-gram language model is provided to define a part of speech necessary for meaning transmission as a keyword part of speech, extract a corpus composed as the keyword part of speech from a large-scale corpus, and construct the keyword-based N-gram language model. CONSTITUTION: A text corpus is preprocessed to include only Hangul characters(S201). A part tagging of speech is performed with respect to a morpheme composing a paragraph of the preprocessed text corpus, and the morpheme is analyzed(S202). The morpheme is merged by a pseudo-morpheme unit(S203). A sentence corpus composed as a keyword part of speech is extracted from the processed text corpus(S204). A keyword vocabulary dictionary is written using the extracted sentence corpus(S205). A keyword-based N-gram language model and a keyword pronunciation dictionary are constructed through the keyword vocabulary dictionary(S206,S207).
展开▼