Feature selection is an essential part of text categorization, which can effectively improve classification precision and efficiency. With some drawbacks proposed from traditional IG approach, an optimized approach that takes concentration and distribution into account is proposed for improving IG approach. The experimental results show that the improved IG approach is superior to traditional IG approach in feature selection.
展开▼