首页>
外国专利>
A SYSTEM AND METHOD FOR DECISION DRIVEN HYBRID TEXT CLUSTERING
A SYSTEM AND METHOD FOR DECISION DRIVEN HYBRID TEXT CLUSTERING
展开▼
机译:一种决策驱动的混合文本聚类系统与方法
展开▼
页面导航
摘要
著录项
相似文献
摘要
The present invention discloses a method and a system for clustering of short and long text documents. The documents are input through an input module and a pre-processing module overtakes the documents from the input module. The pre-processing module refines the documents and removes unwanted text from the documents. Then a decision driven hybrid text clustering algorithm is applied via different modules to achieve clustering of the documents. Firstly, a context module computes a moment value of a feature signifying at least one feature importance value of the feature for the documents. The moment value is used by a decision module to calculate a decision score. Basis the decision score the documents are split into two sets. A clustering module then forms clusters of the two sets of documents basis n-tuple word distribution. Finally, a convergence module congregates the clusters in a final set of documents.
展开▼