首页> 外文会议>International Conference on Practical Applications of Agents and Multiagent Systems >Mining Web Pages Using Features of Rendering HTML Elements in the Web Browser

【24h】

Mining Web Pages Using Features of Rendering HTML Elements in the Web Browser

机译：使用Web浏览器中的HTML元素的功能挖掘网页

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The Web is the largest repository of useful information available for human users, but it is usual that Web Pages do not provide an API to get access to its information automatically. In order to solve this problem, Information Extractors are developed. We present a new methodology to induce Information Extractors from the Web. It is based on rendering HTML elements in the Web browser. The methodology uses a KDD process to mining a dataset with features of the elements in the Web page. An experimentation over 10 web sites has been made and the results show the effectiveness of the methodology.

机译：Web是人类用户可用的最大信息存储库，但通常的网页不提供API以自动访问其信息。为了解决这个问题，开发了信息提取器。我们提出了一种新的方法来引导来自网络的信息提取器。它基于呈现Web浏览器中的HTML元素。该方法使用KDD进程来挖掘数据集，其中包含网页中元素的功能。已经进行了超过10个网站的实验，结果表明了方法的有效性。

著录项

来源
《International Conference on Practical Applications of Agents and Multiagent Systems 》|2010年||共8页
会议地点
作者
F.J. Fernandez; Jose L. Alvarez; Pedro J. Abad; Patricia Jimenez;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类人工智能理论 ;
关键词
Wrapper generation; web data extraction; data mining;

机译：包装器生成;网络数据提取;数据挖掘;

相似文献

外文文献
中文文献
专利

1. Developing a Tile-Based Rendering Method to Improve Rendering Speed of 3D Geospatial Data with HTML5 and WebGL [J] . Kang Seokchan, Lee Jiyeong Journal of Sensors . 2017 ,第Pta4期

机译：使用HTML5和WebGL开发基于地形渲染方法，提高3D地理空间数据的渲染速度
2. PubMed-EX: a web browser extension to enhance PubMed search with text mining features [J] . Tsai Richard Tzong-Han, Dai Hong-Jie, Lai Po-Ting, Bioinformatics . 2009 ,第22期

机译：PubMed-EX：一种网络浏览器扩展，可通过文本挖掘功能增强PubMed搜索
3. PubMed-EX: a web browser extension to enhance PubMed search with text mining features [J] . Richard Tzong-Han Tsai1* Hong-Jie Dai23 Po-Ting Lai1 and Chi-Hsin Huang2 Bioinformatics . 2009 ,第22期

机译：PubMed-EX：一种网络浏览器扩展，可通过文本挖掘功能增强PubMed搜索
4. Mining Web Pages Using Features of Rendering HTML Elements in the Web Browser [C] . F.J. Fernandez, Jose L. Alvarez, Pedro J. Abad, Trends in practical applications of agents and multiagent systems . 2011

机译：使用Web浏览器中呈现HTML元素的功能来挖掘网页
5. Block-scoped Access Restriction Technique for HTML Content in Web Browsers [D] . Watt, Timothy 2012

机译：Web浏览器中HTML内容的块范围访问限制技术
6. Web Browser as Medical Educator/Researcher Using HTML JavaScript [O] . Craig W. Johnson, George Oser, Allan J. Abedor 1998

机译：使用HTML和JavaScript作为医学教育者/研究者的Web浏览器
7. PubMed-EX: a web browser extension to enhance PubMed search with text mining features [O] . Richard Tzong-Han Tsai, Hong-Jie Dai, Po-Ting Lai, 2009

机译：PubMed-ex：Web浏览器扩展，以增强文本挖掘功能的PubMed搜索

Mining Web Pages Using Features of Rendering HTML Elements in the Web Browser

摘要

著录项

相似文献

相关主题

期刊订阅