首页>
外国专利>
Method for multi-phase category assignment on text categorization system
Method for multi-phase category assignment on text categorization system
展开▼
机译:文本分类系统中多阶段类别分配的方法
展开▼
页面导航
摘要
著录项
相似文献
摘要
1. TECHNICAL FIELD OF THE INVENTION;The present invention relates to a multi-category assignment method in a document automatic classification system and a computer-readable recording medium having recorded thereon a program for realizing the method.;2. Technical problem to be solved by the invention;The present invention relates to a computer that records a multi-category assignment method and a program for realizing the method by assigning an appropriate category to each document in a document automatic classification system. To provide a readable recording medium.;3. Summary of the Solution of the Invention;A multi-category assignment method in an automatic document classification system, comprising: a first step of selecting words capable of predicting a category and constructing a list of category / word pairs; A second step of referring to the list of category / word pairs, expressing the learning documents as corresponding words and their importance, and storing the learning documents in an inverted index file; Selecting, from a learning document set, example documents most similar to the documents to be newly classified among the learning documents; And calculating, for each new document to be classified, the probability of the category to be classified, selecting only the categories having the highest probability one by one in each step, and assigning the category with the highest classification possibility to the new document at each step. Includes 4 levels.;4. Important uses of the invention;The present invention is used for automatic sorting of documents.
展开▼