首页> 外文会议>International Conference on Multimedia Information Networking and Security >A Categorization Algorithm for Harmful Text Information Filtering
【24h】

A Categorization Algorithm for Harmful Text Information Filtering

机译:有害文本信息过滤的分类算法

获取原文

摘要

Harmful text information filtering is a typical pattern recognition problem of small sample, the prediction result of classifier was biased towards the class with more samples, because of the samples that including the harmful information were difficult to gain. Construct virtual samples is an effective means to solve the problem of pattern recognition in the small sample, using the up-sampling method to construct virtual samples in the data layer, the traditional KNN algorithm has been improved: a small sample set is divided into clusters by using the K-means clustering, the virtual samples are generated and verified the validity in the cluster. The experimental results show that this method can construct the virtual samples which are similar to the real sample characteristics, and expand the small sample collection in order to effectively identify the harmful text information.
机译:有害文本信息滤波是小样本的典型模式识别问题,分类器的预测结果与更多样本偏向课程,因为包括有害信息难以获得的样本。 构造虚拟样本是解决小型样本中模式识别问题的有效手段,使用上采样方法构建数据层中的虚拟样本,传统的KNN算法已经提高:小样本集被分成簇 通过使用K-means群集,生成虚拟样本并验证群集中的有效性。 实验结果表明,该方法可以构建类似于真实样本特性的虚拟样本,并扩展小样本收集,以有效地识别有害文本信息。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号