【24h】

Study on key technology of topic tracking based on VSM

机译:基于VSM的话题跟踪关键技术研究

获取原文

摘要

Text classification is the key technology for topic tracking, and vector space model (VSM) is one of the most simple and effective models for topics representation. On the basis of information gain algorithm and chi square χ2 in VSM, we have studied how feature selection algorithm and feature dimension in VSM affect topic tracking. And then we get the variation law that they affect topic tracking, and add up their optimal values in topic tracking. Finally, TDT evaluation method proves that their optimal values can make topic tracking gain very good tracking performance. In addition, we also prove in the experiment that chi square χ2 in VSM has better performance for topic tracking than information gain algorithm.
机译:文本分类是主题跟踪的关键技术,向量空间模型(VSM)是主题表示最简单有效的模型之一。基于VSM中的信息增益算法和卡方χ 2 ,我们研究了VSM中的特征选择算法和特征量如何影响主题跟踪。然后,我们得到了它们影响主题跟踪的变异定律,并在主题跟踪中将它们的最佳值相加。最后,TDT评估方法证明了它们的最佳值可以使主题跟踪获得非常好的跟踪性能。此外,我们还通过实验证明,VSM中的卡方χ 2 具有比信息增益算法更好的主题跟踪性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号