Analyzing Information Retrieval Methods to Recover Broken Web Links

机译：分析信息检索方法以恢复断开的Web链接

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

In this work we compare different techniques to automatically find candidate web pages to substitute broken links. We extract information from the anchor text, the content of the page containing the link, and the cache page in some digital library. The selected information is processed and submitted to a search engine. We have compared different information retrieval methods for both, the selection of terms used to construct the queries submitted to the search engine, and the ranking of the candidate pages that it provides, in order to help the user to find the best replacement. In particular, we have used term frequencies, and a language model approach for the selection of terms; and cooccurrence measures and a language model approach for ranking the final results. To test the different methods, we have also defined a methodology which does not require the user judgments, what increases the objectivity of the results.

机译：在这项工作中，我们比较了不同的技术来自动查找候选网页来替换断开的链接。我们从锚文本，包含链接的页面内容以及某些数字图书馆中的缓存页面中提取信息。所选信息将被处理并提交给搜索引擎。我们比较了两种不同的信息检索方法，即用于构造提交给搜索引擎的查询的术语的选择以及它提供的候选页面的排名，以帮助用户找到最佳的替换方法。特别是，我们使用术语频率和语言模型方法来选择术语。并发措施和语言模型方法来对最终结果进行排名。为了测试不同的方法，我们还定义了一种方法，该方法不需要用户判断，从而增加了结果的客观性。

著录项

来源
《Advances in information retrieval》|2010年|p.26-37|共12页
会议地点 Milton Keynes(GB);Milton Keynes(GB)
作者
Juan Martinez-Romo; Lourdes Araujo;
展开▼
作者单位

NLP IR Group, UNED, Madrid 28040, Spain;

NLP IR Group, UNED, Madrid 28040, Spain;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类信息处理（信息加工）;
关键词
information retrieval; link integrity; recommender system;

机译：信息检索；链接完整性；推荐系统;

相似文献

外文文献
中文文献
专利

1. Analyzing Broken Links on the Web of Data: An Experiment With DBpedia [J] . Enayat Rajabi, Salvador Sanchez-Alonso, Miguel-Angel Sicilia Journal of the American Society for Information Science and Technology . 2014,第8期

机译：分析数据网上的断开链接：DBpedia的实验
2. Recovering Traceability Links in Software Artifact Management Systems using Information Retrieval Methods [J] . ANDREA DE LUCIA, FAUSTO FASANO, ROCCO OLIVETO, ACM transactions on software engineering and methodology . 2007,第4期

机译：使用信息检索方法在软件工件管理系统中恢复可追溯性链接
3. Repairing broken RDF links in the web of data [J] . Mohammad Pourzaferani, Mohammad Ali Nematbakhsh International Journal of Web Engineering and Technology . 2013,第4期

机译：修复数据网络中损坏的RDF链接
4. Analyzing Information Retrieval Methods to Recover Broken Web Links [C] . Juan Martinez-Romo, Lourdes Araujo European Conference on Information Retrieval Research . 2010

机译：分析信息检索方法恢复虚拟Web链接
5. Combining text-, link-, and classification-based retrieval methods to enhance information discovery on the Web. [D] . Yang, Kiduk. 2002

机译：结合基于文本，链接和分类的检索方法，以增强Web上的信息发现能力。
6. An innovative method of retrieval of the broken suction cannula tip during medical termination of pregnancy using a fibrobronchoscope during the coronavirus disease 2019 pandemic [O] . Kamalpreet Singh, Binay Mitra, Kanwarjit Singh, 2021

机译：在冠状病毒疾病中使用纤维藻血症期间妊娠医学终止中的破碎吸入套管尖端的创新方法2019大流行病
7. Analyzing information retrieval methods to recover broken web links [O] . Juan Martinez-romo, Lourdes Araujo 2016

机译：分析信息检索方法以恢复损坏的Web链接
8. A Simulation Methodology for Analyzing Storage and Retrieval Systems. [R] . Morgan, R. W. 1975

机译：一种分析存储和检索系统的仿真方法。

Analyzing Information Retrieval Methods to Recover Broken Web Links

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅