首页>
外国专利>
TEXT INFORMATION CLUSTERING METHOD AND TEXT INFORMATION CLUSTERING SYSTEM
TEXT INFORMATION CLUSTERING METHOD AND TEXT INFORMATION CLUSTERING SYSTEM
展开▼
机译:文本信息聚类方法和文本信息聚类系统
展开▼
页面导航
摘要
著录项
相似文献
摘要
A text information clustering method and system. The clustering method comprises the following steps: performing word segmentation on each of multiple pieces of text information, so as to form multiple words (S101); performing initial clustering on the multiple pieces of text information on which word segmentation has been performed, so as to form multiple first-level subjects, each first-level subject comprising at least two pieces of text information (S102); determining the number of second-level subjects under each first-level subject according to the number of pieces of text information under each first-level subject (S103); and performing secondary clustering on at least two pieces of text information comprised in each first-level subject according to the number of second-level subjects under each first-level subject, so as to form multiple second-level subjects (S104). By using the layered clustering method, the total number of first-level subjects is decreased in initial clustering, thereby accelerating the computing efficiency; in secondary clustering, the number of second-level subjects is dynamically determined according to the number of pieces of text information, thereby accelerating the computing speed of the second-level subjects.
展开▼