A Categorization Algorithm for Harmful Text Information Filtering

机译：有害文本信息过滤的分类算法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Harmful text information filtering is a typical pattern recognition problem of small sample, the prediction result of classifier was biased towards the class with more samples, because of the samples that including the harmful information were difficult to gain. Construct virtual samples is an effective means to solve the problem of pattern recognition in the small sample, using the up-sampling method to construct virtual samples in the data layer, the traditional KNN algorithm has been improved: a small sample set is divided into clusters by using the K-means clustering, the virtual samples are generated and verified the validity in the cluster. The experimental results show that this method can construct the virtual samples which are similar to the real sample characteristics, and expand the small sample collection in order to effectively identify the harmful text information.

机译：有害文本信息滤波是小样本的典型模式识别问题，分类器的预测结果与更多样本偏向课程，因为包括有害信息难以获得的样本。构造虚拟样本是解决小型样本中模式识别问题的有效手段，使用上采样方法构建数据层中的虚拟样本，传统的KNN算法已经提高：小样本集被分成簇通过使用K-means群集，生成虚拟样本并验证群集中的有效性。实验结果表明，该方法可以构建类似于真实样本特性的虚拟样本，并扩展小样本收集，以有效地识别有害文本信息。

著录项

来源
《International Conference on Multimedia Information Networking and Security》|2012年||共4页
会议地点
作者
Du Juan; Yi Zhi an;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP37-53;
关键词
Harmful information filtering; Network information security; Small sample pattern recognition; Virtual sample;

机译：有害信息过滤;网络信息安全;小样本模式识别;虚拟样本;

相似文献

外文文献
中文文献
专利

1. Contextual Text Categorization: An Improved Stemming Algorithm to Increase the Quality of Categorization in Arabic Text [J] . Gadri Said, Moussaoui Abdelouahab The international arab journal of information technology . 2017,第6期

机译：上下文文本分类：一种改进的词干算法，可提高阿拉伯文本分类的质量
2. An intelligent news recommender agent for filtering and categorizing large volumes of text corpus [J] . Chiang JH., Chen YC. International Journal of Intelligent Systems . 2004,第3期

机译：一个智能的新闻推荐代理，用于对大量文本语料库进行过滤和分类
3. Improving performance of text categorization by combining filtering and support vector machines [J] . Irene Diaz, Jose Ranilla, Elena Montanes, Journal of the American Society for Information Science and Technology . 2004,第7期

机译：通过结合使用过滤器和支持向量机来提高文本分类的性能
4. A Categorization Algorithm for Harmful Text Information Filtering [C] . Du Juan, Yi Zhi an 2012 Fourth International Conference on Multimedia Information Networking and Security. . 2012

机译：一种有害文本信息过滤的分类算法
5. Study of feature selection algorithms for text-categorization. [D] . Dave, Kandarp. 2011

机译：用于文本分类的特征选择算法的研究。
6. Prospective Validation of Text Categorization Filters for Identifying High-Quality Content-Specific Articles in MEDLINE. [O] . Y. Aphinyanaphongs, C.F. Aliferis 2006

机译：文本分类过滤器的前瞻性验证用于识别MEDLINE中的高质量特定内容的文章。
7. Answer filtering via Text Categorization in Question Answering Systems [O] . Alessandro Moschitti 2015

机译：通过问答系统中的文本分类来回答过滤
8. Categorization of Survey Text Utilizing Natural Language Processing and Demographic Filtering. [R] . Cairoli, C. M. 2017

机译：利用自然语言处理和人口过滤对调查文本进行分类。

A Categorization Algorithm for Harmful Text Information Filtering

摘要

著录项

相似文献

相关主题

期刊订阅