首页> 外文会议>IEEE International Conference on Data Mining Workshops >Text Classification Models for Web Content Filtering and Online Safety
【24h】

Text Classification Models for Web Content Filtering and Online Safety

机译:Web内容过滤和在线安全的文本分类模型

获取原文

摘要

Living in an era of anywhere anytime connectedness for the great mass, safety and security on the web presents enormous challenges. There is a great need for better content detection systems that can more accurately identify excessively offensive and harmful websites. Web classification models in the early days are limited by the methods and data available. Today advanced developments in computing methodologies and technology have brought us many new and better means for text content analysis, for example new methods for topic extraction, topic modeling and sentiment analysis. Our recent studies suggested the promising potential of combing topic analysis and sentiment analysis in web content classification. This paper further explores new classification models for better classification performance, especially to enhance precision and reduce false positives, by incorporation of semantics in developing classification models and by examination and handling of the issues with the dataset reliability, class imbalance and covariate shift.
机译:生活在一个随时随地的时代,网络上的庞大,安全性和安全性提出了巨大的挑战。迫切需要一种更好的内容检测系统,它可以更准确地识别过度冒犯性和有害的网站。早期的Web分类模型受到可用方法和数据的限制。如今,计算方法和技术的先进发展为我们带来了许多新的更好的文本内容分析方法,例如用于主题提取,主题建模和情感分析的新方法。我们最近的研究表明,在网页内容分类中结合主题分析和情感分析具有广阔的潜力。本文进一步探索了新的分类模型,以通过在开发分类模型中纳入语义并检查和处理与数据集可靠性,类不平衡和协变量偏移有关的问题,从而更好地实现分类性能,特别是提高精度并减少误报。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号