Considering Hyper Documents and Context for indexing the Web

机译：考虑用于建立Web索引的超级文档和上下文

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

The growth of the Web, with hundreds of millions users and billions of pages, gives new challenges to the Information Retrieval (IR). Most of current systems are based on a re-use of traditional models, which have been developed for textual, atomic and independents documents, and are not adapted to the Web. A promising research orientation consists in studying the impact of Web structure on indexing and querying. Some approaches use Web structure for IR, but most of them consider a "bag-of-links", modelling the Web as a graph with HTML pages as nodes and hypertext links as edges without taking into account the links types. The HyperDocument model presented in this article is based on essential aspects of information description and comprehension: contents, composition, linear or non-linear reading and context. We present the main aspects of our Structured IR System for the Web.

机译：随着拥有数亿用户和数十亿页面的Web的发展，对信息检索（IR）提出了新的挑战。当前大多数系统基于对传统模型的重用，这些传统模型是为文本，原子和独立文档开发的，并且不适合Web。有希望的研究方向在于研究Web结构对索引和查询的影响。有些方法将Web结构用于IR，但大多数方法都将其视为“链接袋”，将Web建模为以HTML页面作为节点，将超文本链接作为边缘的图形，而不考虑链接类型。本文介绍的HyperDocument模型基于信息描述和理解的基本方面：内容，组成，线性或非线性阅读以及上下文。我们介绍了Web的结构化IR系统的主要方面。

著录项

来源
《International Conference on Artificial Intelligence IC-AI'02 Vol.1, Jun 24-27, 2002, Las Vegas, Nevada, USA》|2002年|p.267-273|共7页
会议地点 Las Vegas NV(US);Las Vegas NV(US)
作者
Mathias Gery;
展开▼
作者单位

MRIM Team (Modeling and Multimedia Information Retrieval) CLIPS-IMAG Laboratory B.P. 53, 38041 Grenoble Cedex 9, France;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类自动化技术、计算机技术;
关键词
web structure; HyperDocument; reading path; relevant area; context;

机译：网络结构；超级文件;阅读路径；有关领域；语境;

相似文献

外文文献
中文文献
专利

1. A novel approach to perform context-based automatic spoken document retrieval of political speeches based on wavelet tree indexing [J] . Gupta Anishka, Yadav Divakar Multimedia Tools and Applications . 2021,第14期

机译：基于小波树索引的基于语境的自动口语文献检索的新方法
2. Large-scale graph indexing using binary embeddings of node contexts for information spotting in document image databases [J] . Riba Pau, Llados Josep, Fornes Alicia, Pattern recognition letters . 2017,第FEBa1期

机译：使用节点上下文的二进制嵌入进行大规模图形索引，以在文档图像数据库中发现信息
3. Multi-Document Summarization using Phrase Context based Indexing and Geometric Model [J] . Kantu. Vijaya Kumar, Abburi. Venkatesh International Journal of Computer Trends and Technology . 2014,第5期

机译：使用基于短语上下文的索引和几何模型进行多文档摘要
4. Considering Hyper Documents and Context for indexing the Web [C] . Mathias Gery International conference on artificial intelligence . 2002

机译：考虑索引Web的超文档和上下文
5. From document clues to descriptive metadata: Document characteristics used by graduate students in judging the usefulness of Web documents. [D] . Lan, Wen-Chin. 2002

机译：从文档线索到描述性元数据：研究生在判断Web文档有用性时使用的文档特征。
6. Prospective Indexing for Enhanced Retrieval of Medical Documents on the World Wide Web [O] . Catherine L. Schell, Richard J. Rathe 1997

机译：万维网上增强检索医疗文档的预期索引
7. Polytematický strukturovaný heslář a jeho potenciál v oblasti třídění a zpřístupňování webových dokumentů: Polythematic Structured Subject Heading System and its potential in the area of indexing and searching web documents 000000026 506__ $$aPublic [O] . 2011

机译：多主题结构化主题词及其在排序和使Web文档可访问性方面的潜力：多主题结构化主题词系统及其在对文档进行索引和搜索中的潜力000000026 506__ $$ aPublic

Considering Hyper Documents and Context for indexing the Web

摘要

著录项

相似文献

相关主题

期刊订阅