...
首页> 外文期刊>Annals of the American Thoracic Society >Algorithm of Web Page Similarity Comparison Based on Visual Block
【24h】

Algorithm of Web Page Similarity Comparison Based on Visual Block

机译:基于可视块的网页相似比较算法

获取原文
获取原文并翻译 | 示例

摘要

Phishing often deceives users due to the relative similarity to the true pages on a layout and leads to considerable losses for the society. Consequently, detecting phishing sites has been an urgent activity. By researching phishing web pages using web page screenshots, we discover that this kind of web pages use numerous web page screenshots to achieve the close similarity to the true page and avoid the text and structure similarity detection. This study introduces a new similarity matching algorithm based on visual blocks. First, the RenderLayer tree of the web page is obtained to extract the visual block. Second, an algorithm that will settle the jumbled visual blocks, including the deletion of the small visual blocks and the emergence of the overlapping visual blocks, is designed. Finally, the similarity between the two web pages is assessed. The proposed algorithm sets different thresholds to achieve the optimal missing and false alarm rates.
机译:网络钓鱼经常欺骗用户由于布局上的真实页面的相对相似,并对社会带来相当大的损失。 因此,检测网络钓鱼位点是一种紧迫的活性。 通过使用Web页面屏幕截图研究网络钓鱼网页,我们发现这种网页使用众多网页屏幕截图来实现与真实页面的密切相似性,避免文本和结构相似度检测。 本研究介绍了一种基于视觉块的新的相似性匹配算法。 首先,获得网页的渲染层树以提取视觉块。 其次,设计了一种沉降混乱的视觉块的算法,包括删除小视块和重叠的视觉块的出现。 最后,评估了两个网页之间的相似性。 所提出的算法设置了不同的阈值以实现最佳缺失和误报率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号