Extracting Information from Semi-Structured Web Pages by Considering User's Context

机译：通过考虑用户的上下文，从半结构化网页中提取信息

获取原文

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Nowadays, many users use web search engines to find and gather information. User faces an increasing amount of various semi-structured information sources. The issue of correlating, integrating and presenting related information to users becomes important. When a user uses a search engine such as Yahoo and Google to seek a specific information, the results are not only information about the availability of the desired information, but also information about other pages on which the desired information is mentioned The number of selected pages is enormous. Therefore, the performance capabilities, the overlap among results for the same queries and limitations of web search engines are an important and large area of research. Extracting information from the web data sources also becomes very important because the massive and increasing amount of diverse semi-structured information sources in the Internet that are available to users, and the variety of web pages making the process of information extraction from web a challenging problem. It is more challenging when an extracted information which is relevant to a user might not be relevant to other users. Thus, an information extraction that considers user's context more specifically user preferences would provide better results to the user. Thus, this paper proposed a framework for extracting information from semi-structured web pages by considering user's context.

机译：如今，许多用户使用Web搜索引擎查找和收集信息。用户面临越来越多的各种半结构信息源。关联，集成和呈现与用户相关信息的问题变得重要。当用户使用诸如雅虎和谷歌的搜索引擎来寻找特定信息时，结果不仅是关于所需信息的可用性的信息，而且还提到了关于所需信息的其他页面的信息，所以所选页面的数量是巨大的。因此，性能能力，结果相同查询的重叠和网络搜索引擎的限制是一个重要的和大的研究领域。从Web数据源中提取信息也变得非常重要，因为用户可用于用户的互联网中的多样化和越来越多的半结构化信息源，以及使来自Web的信息提取过程的各种网页成为一个具有挑战性的问题。当与用户相关的提取信息可能与其他用户无关时，它更具挑战性。因此，考虑用户的上下文更具体地说是用户偏好的信息提取将为用户提供更好的结果。因此，本文提出了一种用于通过考虑用户的上下文从半结构化网页提取信息的框架。

著录项

来源
《International Conference on Information Knowledge Engineering》|2010年||共6页
会议地点
作者
Mahmoud Shaker; Hamidah Ibrahim; Ali Alwan; Aida Mustapha; Lili Nurliyana Abdullah;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 G20-53;
关键词
Extracting Information; Web Pages; User's Context;

机译：提取信息;网页;用户的上下文;

相似文献

外文文献
中文文献
专利

1. A survey on semi-structured web data manipulations by non-expert users [J] . Gilbert Tekli Computer science review . 2021,第May期

机译：非专家用户的半结构化网络数据操作调查
2. Learning to Extract Information from Semi-structured Text using a Discriminative Context Free Grammar [J] . Paul Viola, Mukund Narasimhan ACM SIGIR FORUM . 2005,第Spe期

机译：学习使用判别性上下文无关文法从半结构化文本中提取信息
3. Extracting lists of data records from semi-structured web pages [J] . Manuel Alvarez, Alberto Pan, Juan Raposo, Data & Knowledge Engineering . 2008,第2期

机译：从半结构化网页中提取数据记录列表
4. Extracting Information from Semi-Structured Web Pages by Considering User's Context [C] . Mahmoud Shaker, Hamidah Ibrahim, Ali Alwan, International conference on information knowledge engineering . 2010

机译：考虑用户上下文从半结构化网页中提取信息
5. Extracting Users in Community Question-Answering in Particular Contexts [D] . Le, Long T. 2017

机译：在特定上下文中提取社区问题解答中的用户
6. Correction: Building Large Collections of Chinese and English Medical Terms from Semi-Structured and Encyclopedia Websites [O] . Yan Xu, Yining Wang, Jian-Tao Sun, -1

机译：更正：从半结构化和百科全书网站构建大量的中英文医学术语集合
7. Complex question answering on semi-structured repositories: a user centric process enhanced with context [O] . Brandão José Ricardo Marques de Jesus 2012

机译：半结构化存储库上的复杂问题解答：以上下文为中心的以用户为中心的过程

Extracting Information from Semi-Structured Web Pages by Considering User's Context

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅