首页> 外文期刊>IEEE Transactions on Knowledge and Data Engineering >Automatic Website Summarization by Image Content: A Case Study with Logo and Trademark Images
【24h】

Automatic Website Summarization by Image Content: A Case Study with Logo and Trademark Images

机译:通过图像内容自动进行网站摘要:带有徽标和商标图像的案例研究

获取原文
获取原文并翻译 | 示例

摘要

Image-based abstraction (or summarization) of a Web site is the process of extracting the most characteristic (or important) images from it. The criteria for measuring the importance of images in Web sites are based on their frequency of occurrence, characteristics of their content and Web link information. As a case study, this work focuses on logo and trademark images. These are important characteristic signs of corporate Web sites or of products presented there. The proposed method incorporates machine learning for distinguishing logo and trademarks from images of other categories (e.g., landscapes, faces). Because the same logo or trademark may appear many times in various forms within the same Web site, duplicates are detected and only unique logo and trademark images are extracted. These images are then ranked by importance taking frequency of occurrence, image content and Web link information into account. The most important logos and trademarks are finally selected to form the image-based summary of a Web site. Evaluation results of the method on real Web sites are also presented. The method has been implemented and integrated into a fully automated image-based summarization system which is accessible on the Web (www.intelligence.tuc.gr/websummarization)
机译:网站的基于图像的抽象(或摘要)是从网站中提取最具特征(或重要)的图像的过程。衡量网站中图像重要性的标准是基于图像的出现频率,内容的特征和Web链接信息。作为案例研究,这项工作着重于徽标和商标图像。这些是公司网站或此处提供的产品的重要特征标志。所提出的方法结合了机器学习,用于区分徽标和商标与其他类别的图像(例如风景,面部)。因为相同的徽标或商标可能以相同的形式多次出现在同一网站中,所以将检测到重复项,并且仅提取唯一的徽标和商标图像。然后,根据出现的频率,图像内容和Web链接信息,按照重要性对这些图像进行排名。最后选择最重要的徽标和商标,以形成基于图像的网站摘要。还介绍了该方法在真实网站上的评估结果。该方法已实现并集成到可在Web上访问的基于图像的全自动摘要系统(www.intelligence.tuc.gr/websummarization)

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号