首页> 外文会议>International Conference on Advanced Materials and Information Technology Processing >User interest based text classification: a novel feature extraction method
【24h】

User interest based text classification: a novel feature extraction method

机译:基于用户兴趣的文本分类:一种新颖的特征提取方法

获取原文

摘要

Text communication is an important means for network and mobile terminal users. Spam text brings great distress to the user and the service provider. Traditional text classification method failed to consider the influence of user interest toward text classification and spam filtering. It brutally analyzed and classified the content of text communication of the user. The paper proposed a user interest feature extraction algorithm access frequency-inverse class frequency (AF-ICF) based on TF-IDF algorithm. It utilizes operate behaviour of the user of the text communication to summarize the degree of a text item that the user concerns. Compared with the existing text classification method, AF-ICF effectively described the interest distribution of the user. It provides an important reference for other text classification algorithms and has very high practical value.
机译:文本通信是网络和移动终端用户的重要手段。垃圾邮件文本给用户和服务提供商带来了极大的痛苦。传统文本分类方法未能考虑用户兴趣对文本分类和垃圾邮件过滤的影响。它粗暴地分析并分类了用户的文本通信的内容。本文提出了一种基于TF-IDF算法的用户兴趣特征提取算法访问频率 - 逆等级频率(AF-ICF)。它利用文本通信的用户的操作行为来总结用户问题的文本项目的程度。与现有文本分类方法相比,AF-ICF有效地描述了用户的利息分布。它为其他文本分类算法提供了一个重要的参考,具有非常高的实用价值。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号