【24h】

Zipf’s Law in Passwords

机译:齐普夫密码定律

获取原文
获取原文并翻译 | 示例
       

摘要

Despite three decades of intensive research efforts, it remains an open question as to what is the underlying distribution of user-generated passwords. In this paper, we make a substantial step forward toward understanding this foundational question. By introducing a number of computational statistical techniques and based on 14 large-scale data sets, which consist of 113.3 million real-world passwords, we, for the first time, propose two Zipf-like models (i.e., PDF-Zipf and CDF-Zipf) to characterize the distribution of passwords. More specifically, our PDF-Zipf model can well fit the popular passwords and obtain a coefficient of determination larger than 0.97; our CDF-Zipf model can well fit the entire password data set, with the maximum cumulative distribution function (CDF) deviation between the empirical distribution and the fitted theoretical model being 0.49%~4.59% (on an average 1.85%). With the concrete knowledge of password distributions, we suggest a new metric for measuring the strength of password data sets. Extensive experimental results show the effectiveness and general applicability of the proposed Zipf-like models and security metric.
机译:尽管经过三十年的深入研究,对于用户生成的密码的基本分布是什么仍然是一个悬而未决的问题。在本文中,我们朝着理解这个基本问题迈出了实质性的一步。通过引入多种计算统计技术并基于14个大规模数据集(其中包含1.133亿个现实世界的密码),我们首次提出了两个类似Zipf的模型(即PDF-Zipf和CDF- Zipf)来表征密码的分布。更具体地说,我们的PDF-Zipf模型可以很好地适应流行的密码,并且确定系数大于0.97;我们的CDF-Zipf模型可以很好地拟合整个密码数据集,经验分布与拟合的理论模型之间的最大累积分布函数(CDF)偏差为0.49%〜4.59%(平均为1.85%)。有了密码分配的具体知识,我们建议了一种用于衡量密码数据集强度的新指标。大量的实验结果证明了拟议的Zipf类模型和安全度量的有效性和普遍适用性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号