With the rapid development of the Web, the need for text classification of large data volumes is permanently growing. Texts represented as bags-of-words possess usually very high dimensionality in the input space and often also in the output space if labeled with many categories. As a result, neural classifiers should be adapted to such large-scale data. We present here a well scalable extension to the fuzzy Adaptive Resonance Associative Map (ARAM) neural network which was specially developed for high-dimensional and large data. This extension aims at increasing the classification speed by adding an extra ART layer for clustering learned prototypes into large clusters. In this case the activation of all prototypes can be replaced by the activation of a small fraction of them, leading to a significant reduction of the classification time. This extension can be especially useful for multi-label classification tasks.
展开▼