Near-Duplicates Detection and Elimination Based on Web Provenance for Effective Web Search

Y. Syed Mudhasir; J. Deepika; S. Sendhilkumar; G.S. Mahalakshmi

首页> 外文期刊>International Journal on Internet and Distributed Computing Systems >Near-Duplicates Detection and Elimination Based on Web Provenance for Effective Web Search

【24h】

Near-Duplicates Detection and Elimination Based on Web Provenance for Effective Web Search

机译：基于Web来源的近重复检测和消除以实现有效的Web搜索

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Users of World Wide Web utilize search engines for information retrieval in web as search engines play a vital role in finding information on the web. However, the performance of a web search is greatly affected by flooding of search results with information that is redundant in nature i.e., existence of near-duplicates. Such near-duplicates holdup the other promising results to the users. Many of these near-duplicates are from distrusted websites and/or authors who host information on web. Such near-duplicates may be eliminated by means of Provenance. Thus, this paper proposes a novel approach to identify such near-duplicates based on provenance. In this approach a provenance model has been built using web pages which are the search results returned by existing search engine. The proposed model combines both content based and trust based factors for classifying the results as original or near-duplicates

机译：万维网的用户利用搜索引擎在网络中检索信息，因为搜索引擎在寻找网络信息方面起着至关重要的作用。但是，网络搜索的性能在很大程度上受到搜索结果泛滥的影响，这些信息本质上是多余的，即存在重复项。这样的重复几乎为用户带来了其他有希望的结果。这些近重复项中有许多来自不信任的网站和/或在网络上托管信息的作者。可以通过出处消除这种重复的现象。因此，本文提出了一种新颖的方法来基于来源鉴定这种近重复。在这种方法中，已使用网页建立了物源模型，这些网页是现有搜索引擎返回的搜索结果。提出的模型结合了基于内容和基于信任的因素，将结果分类为原始或近似重复

著录项

来源
《International Journal on Internet and Distributed Computing Systems》 |2011年第1期|共11页
作者
Y. Syed Mudhasir; J. Deepika; S. Sendhilkumar; G.S. Mahalakshmi;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类 TP311.13;
关键词

相似文献

外文文献
中文文献
专利

1. Concept-based near-duplicate video clip detection for novelty re-ranking of web video search results [J] . Chidansh A. Bhatt, Pradeep K. Atrey, Mohan S. Kankanhalli Multimedia Systems . 2012,第4期

机译：基于概念的近重复视频剪辑检测，可对网络视频搜索结果进行新颖性重新排名
2. Real-Time Near-Duplicate Elimination for Web Video Search With Content and Context [J] . Wu X., Ngo C.-W., Hauptmann A. G., IEEE transactions on multimedia . 2009,第2期

机译：具有内容和上下文的Web视频搜索的实时近乎重复消除
3. Mining Near-Duplicate Graph for Cluster-Based Reranking of Web Video Search Results [J] . Zl HUAN, BO HU, HONG CHENG, ACM Transactions on Information Systems . 2010,第4期

机译：挖掘近似重复的图，用于基于群集的Web视频搜索结果排名
4. Practical elimination of near-duplicates from web video search [C] . Xiao Wu, Alexander G. Hauptmann, Chong-Wah Ngo, Proceedings of the 15th international conference on Multimedia . 2007

机译：实际消除网络视频搜索中的重复项
5. Providing content by Web -based delivery methods: Using digital video, instructor -selected Websites, and search engines, to deliver information about the principles of behaviorism. [D] . Quinn, Andrew Stewart. 2004

机译：通过基于Web的传递方法提供内容：使用数字视频，讲师选择的网站和搜索引擎来传递有关行为主义原理的信息。
6. Intelligent Image-Based Railway Inspection System Using Deep Learning-Based Object Detection and Weber Contrast-Based Image Comparison [O] . Jinbeum Jang, Minwoo Shin, Sohee Lim, 2019

机译：基于深度学习的目标检测和基于Weber对比度的图像比较的基于图像的铁路智能检查系统
7. LARGE-SCALE NEAR-DUPLICATE WEB VIDEO SEARCH: CHALLENGE AND OPPORTUNITY [O] . Wan-lei Zhao, Song Tan, Chong-wah Ngo 2013

机译：大规模的近乎重复的网络视频搜索：挑战和机会

Near-Duplicates Detection and Elimination Based on Web Provenance for Effective Web Search

摘要

著录项

相似文献

相关主题

期刊订阅