首页> 外文期刊>Journal of Information Recording >Breaking Text-Based CAPTCHAs using Average Vertical Partition
【24h】

Breaking Text-Based CAPTCHAs using Average Vertical Partition

机译:使用平均垂直分区打破基于文本的验证码

获取原文
获取原文并翻译 | 示例
       

摘要

CAPTCHA, which stands for Completely Automated Public Turing Test to Tell Computers and Humans Apart, has been widely used as a security mechanism to defend against automated registration, spam and malicious bot programs. There have been many successful attacks on CAPTCHAs deployed by popular websites, e.g., Google, Yahoo!, and Microsoft. However, most of these methods are ad hoc, and they have lost efficacy with the evolution of CAPTCHA. In this paper, we propose a simple but effective attack on text-based CAPTCHA that uses machine learning to solve the segmentation and recognition problems simultaneously. The method first divides a CAPTCHA image into average blocks and attempts to combine adjacent blocks to form individual characters. A modified K-Nearest Neighbor (KNN) engine is used to recognize these combinations, and using a Dynamic Programming (DP) graph search algorithm, the most likely combinations are selected as the final result. We tested our attack on the popular CAPTCHAs deployed by the top 20 Alexa ranked websites. The success rates range from 5.0% to 74.0%, illustrating the effectiveness and universality of our method. We also tested the applicability of our method on three well-known CAPTCHA schemes. Our attack casts serious doubt on the security of existing text-based CAPTCHAs; therefore, guidelines for designing better text-based CAPTCHAs are discussed at the end of this paper.
机译:CAPTCHA代表完全自动化的公共Turing测试,以告诉计算机和人类以外的人,已被广泛用作一种安全机制,以防御自动注册,垃圾邮件和恶意bot程序。流行的网站(例如Google,Yahoo!和Microsoft)部署了许多对CAPTCHA的成功攻击。但是,这些方法大多数都是临时性的,并且随着CAPTCHA的发展而失去了功效。在本文中,我们针对基于文本的CAPTCHA提出了一种简单而有效的攻击,该攻击使用机器学习同时解决了分割和识别问题。该方法首先将CAPTCHA图像划分为平均块,然后尝试组合相邻的块以形成单个字符。修改后的K最近邻(KNN)引擎用于识别这些组合,并使用动态编程(DP)图形搜索算法,选择最可能的组合作为最终结果。我们测试了对排名前20位的Alexa网站部署的流行验证码的攻击。成功率从5.0%到74.0%,说明了我们方法的有效性和普遍性。我们还测试了我们的方法在三个著名的验证码方案上的适用性。我们的攻击使现有基于文本的验证码的安全性受到严重怀疑;因此,本文末尾讨论了设计更好的基于文本的验证码的指南。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号