首页> 外文会议>Second IIAI International Conference on Advanced Applied Informatics >Proposal of Seam Degree and Content Similarity for Web Page Segmentation
【24h】

Proposal of Seam Degree and Content Similarity for Web Page Segmentation

机译:关于网页细分的接缝度和内容相似度的建议

获取原文
获取原文并翻译 | 示例

摘要

Page segmentation has received great attention in recent years. However, most research has been based on some pre-defined heuristics or visual cues which may be not suitable for large-scale page segmentation. In this paper, we proposed two parameters: seam degree and content similarity, to indicate the coherent degree of a page block. Instead of analyzing pre-defined heuristics or visual cues, our method utilizes the visual and content features to determine whether a page block should be divided into smaller blocks. We also proposed a principled page segmentation method using these two parameters. An experiment was conducted to determine the relationship between the two parameters and the number of segment results. The empirical results also show that our segmentation method can effectively segment a page into different semantic parts.
机译:近年来,页面分割受到了极大的关注。但是,大多数研究都是基于一些预定义的启发式方法或视觉提示,这些提示可能不适用于大规模页面分割。在本文中,我们提出了两个参数:接缝度和内容相似度,以指示页面块的相干度。我们的方法不是分析预定义的启发式方法或视觉提示,而是利用视觉和内容功能来确定是否应将页面块划分为较小的块。我们还提出了使用这两个参数的原则页面分割方法。进行了一项实验,以确定两个参数与分段结果数之间的关系。实验结果还表明,我们的分割方法可以有效地将页面分割为不同的语义部分。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号