首页> 外文会议>2013 International Conference on Computing, Networking and Communications >TFD: A multi-pattern matching algorithm for large-scale URL filtering
【24h】

TFD: A multi-pattern matching algorithm for large-scale URL filtering

机译:TFD:用于大规模URL过滤的多模式匹配算法

获取原文
获取原文并翻译 | 示例

摘要

During the past decade, URL filtering systems have been widely applied to prevent people from browsing undesirable or malicious websites. However, the key method of URL filtering, such as URL blacklist filter, is more challenging due to the limited performance of existing multi-pattern matching algorithms. In this paper, we propose a multi-pattern matching algorithm named TFD for large-scale and high-speed URL filtering. TFD employs Two-phase hash, Finite state machine and Double-array storage to eliminate the performance bottleneck of blacklist filter. Experimental results show that TFD achieves better performance than existing work in terms of matching speed, preprocessing time and memory usage. Specially, on large-scale URL pattern sets (over 10 million URLs), with single thread, TFD's matching speed reaches over 100Mbps on a general x86 platform.
机译:在过去的十年中,URL过滤系统已广泛应用于防止人们浏览不良或恶意网站。但是,由于现有的多模式匹配算法的性能有限,URL过滤的关键方法(例如URL黑名单过滤器)更具挑战性。在本文中,我们提出了一种称为TFD的多模式匹配算法,用于大规模和高速URL过滤。 TFD采用两阶段哈希,有限状态机和双阵列存储来消除黑名单过滤器的性能瓶颈。实验结果表明,TFD在匹配速度,预处理时间和内存使用方面都比现有工作更好。特别是,在大规模URL模式集(超过1000万个URL)上,通过单线程,TFD的匹配速度在通用x86平台上达到100Mbps以上。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号