首页> 外文期刊>International journal of information security and privacy >Analysis and Text Classification of Privacy Policies From Rogue and Top-100 Fortune Global Companies
【24h】

Analysis and Text Classification of Privacy Policies From Rogue and Top-100 Fortune Global Companies

机译:来自流氓和前100家财富全球公司的隐私政策分析和文本分类

获取原文
获取原文并翻译 | 示例
       

摘要

In the present article, the authors investigate to what extent supervised binary classification can be used to distinguish between legitimate and rogue privacy policies posted on web pages. 15 classification algorithms are evaluated using a data set that consists of 100 privacy policies from legitimate websites (belonging to companies that top the Fortune Global 500 list) as well as 67 policies from rogue websites. A manual analysis of all policy content was performed and clear statistical differences in terms of both length and adherence to seven general privacy principles are found. Privacy policies from legitimate companies have a 98% adherence to the seven privacy principles, which is significantly higher than the 45% associated with rogue companies. Out of the 15 evaluated classification algorithms, Naïve Bayes Multinomial is the most suitable candidate to solve the problem at hand. Its models show the best performance, with an AUC measure of 0.90 (0.08), which outperforms most of the other candidates in the statistical tests used.
机译:在本文中,作者调查了监督二进制分类在多大程度上可用于区分在网页上发布的合法和流氓隐私政策。使用由合法网站(属于财富全球500个列表的公司属于最新的公司)以及来自Rogue网站的67个政策,使用数据集进行分类算法进行评估。对所有策略内容进行了手动分析,并在长度和遵守七项普通隐私原则方面清除统计差异。合法公司的隐私政策有98%的遵守七项隐私原则,该原则明显高于与流氓公司相关的45%。在15个评估的分类算法中,Naïve贝叶斯多行之妇是解决手头问题最合适的候选者。其型号显示出最佳性能,AUC措施为0.90(0.08),这占据了所使用的统计测试中的大多数其他候选者。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号