首页> 外文会议>The 2nd International Conference on Software Engineering and Data Mining >Study on feature selection algorithm in topic tracking
【24h】

Study on feature selection algorithm in topic tracking

机译:主题跟踪中的特征选择算法研究

获取原文

摘要

Text classification is the key technology for topic tracking, and vector space model (VSM) is one of the most simple and effective model for topics representation. Feature selection algorithm in VSM is an important means of data pre-processing, and it can reduce vector space dimension and improve the generalization ability of the algorithm. Therefore, it is necessary for feature selection algorithms to be in-depth and extensive research. So we study how feature space dimension and feature selection algorithm affect topic tracking. Then we get the variation law that they affect topic tracking, and add up their optimal values in topic tracking. Finally, TDT evaluation methods prove that optimal topic tracking performance based on weight of evidence for text increases by 8.762% more than mutual information.
机译:文本分类是主题跟踪的关键技术,向量空间模型(VSM)是主题表示最简单有效的模型之一。 VSM中的特征选择算法是数据预处理的重要手段,它可以减小向量空间维数,提高算法的泛化能力。因此,有必要对特征选择算法进行深入而广泛的研究。因此,我们研究了特征空间维数和特征选择算法如何影响主题跟踪。然后我们得到它们影响主题跟踪的变异定律,并在主题跟踪中将其最佳值相加。最后,TDT评估方法证明,基于文本证据权重的最佳主题跟踪性能比互信息增加了8.762%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号