首页> 外文会议>International Conference on Software Engineering and Data Mining >Study on Feature Selection Algorithm in Topic Tracking
【24h】

Study on Feature Selection Algorithm in Topic Tracking

机译:主题跟踪特征选择算法研究

获取原文

摘要

Text classification is the key technology for topic tracking, and vector space model (VSM) is one of the most simple and effective model for topics representation. Feature selection algorithm in VSM is an important means of data pre-processing, and it can reduce vector space dimension and improve the generalization ability of the algorithm. Therefore, it is necessary for feature selection algorithms to be in-depth and extensive research. So we study how feature space dimension and feature selection algorithm affect topic tracking. Then we get the variation law that they affect topic tracking, and add up their optimal values in topic tracking. Finally, TDT evaluation methods prove that optimal topic tracking performance based on weight of evidence for text increases by 8.762% more than mutual information.
机译:文本分类是主题跟踪的关键技术,传染媒介空间模型(VSM)是主题表示最简单有效的模型之一。 VSM中的特征选择算法是数据预处理的重要手段,可以减少矢量空间尺寸并提高算法的泛化能力。因此,需要选择算法是深入和广泛的研究。因此,我们研究了功能空间维度和特征选择算法如何影响主题跟踪。然后我们得到了它们影响主题跟踪的变体法,并在主题跟踪中加起了最佳值。最后,TDT评估方法证明了基于文本证据重量的最佳主题跟踪性能增加了8.762%,而不是相互信息。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号