Text documents are often represented as high-dimensional and sparse vectors using words as features in a multidimensional space. These vectors require a large number of computer resources and it is difficult to capture underlying concepts referred to by the terms. In this paper, we propose to use the technique of dimensionality reduction using Concept Vectors Based on PDDP as a way of solving these problems in the vector space information retrieval model. we give experimental results of the dimensionality reduction by using this method and show that this method is an improvement over conventional vector space model.
展开▼