Learning to Extract Symbolic Knowledge from the World Wide Web

机译：学习从万维网中提取符号知识

获取原文

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

The World Wide Web is a vast source of information accessible to computers, but understandable only to humans. The goal of the research described here is to automatically create a computer understandable knowledge base whose content mirrors that of the World Wide Web. Such a knowledge base would enable much more effective retrieval of Web information, and promote new uses of the Web to support knowledge based inference and problem solving. Our approach is to develop a trainable information extraction system that takes two inputs. The first is an ontology that defines the classes (e.g., Company, Person, Employee, Product) and relations (e.g., Employed.By, Produced.By) of interest when creating the knowledge base. The second is a set of training data consisting of labeled regions of hypertext that represent instances of these classes and relations. Given these inputs, the system learns to extract information from other pages and hyperlinks on the Web. This paper describes our general approach, several machine learning algorithms for this task, and promising initial results with a prototype system that has created a knowledge base describing university people, courses, and research projects.

著录项

作者
Craven, M. ; McCallum, A. ; PiPasquo, D. ; Mitchell, T. ; Freitag, D.;
展开▼
作者单位

展开▼
年度 1998
页码 1-53
总页数 53
原文格式 PDF
正文语种 eng
中图分类工业技术;
关键词
Information retrieval; Internet; Hypertext; Algorithms; Data management; Information exchange; Computer communications; Learning machines; Man computer interface;

机译：信息检索;互联网;超文本;算法;数据管理;信息交换;计算机通信;学习机;人机界面;

相似文献

外文文献
中文文献
专利

1. A Machine Learning Method for Extracting Symbolic Knowledge from Recurrent Neural Networks [J] . A. Vahed, C.W. Omlin Neural computation . 2004,第1期

机译：一种从递归神经网络中提取符号知识的机器学习方法
2. Media Reviews: Center for Organizational Learning, Innovation and Knowledge website, Institute for Innovation and Knowledge Management website, and Learning in the Modern Workplace blog [J] . Teresa Rebelo The learning organization . 2017,第4期

机译：媒体评论：组织学习中心，创新与知识网站，创新与知识管理研究所网站和“现代职场学习”博客
3. Extracting knowledge from the World Wide Web [J] . Monika Henzinger, Steve Lawrence Proceedings of the National Academy of Sciences of the United States of America . 2004,第Supplementa1期

机译：从万维网提取知识
4. Learning to Extract Symbolic Knowledge from the World Wide Web [C] . Mark Craveny, Dan DiPasquoy, Dayne Freitagy, National Conferences on Aritificial Intelligence . 1999

机译：学习从万维网提取象征性知识
5. Knowledge discovery in databases of Web use: Data mining for informetric and behavioral models of information seeking on the World Wide Web. [D] . Turnbull, Donald R. 2002

机译：Web使用数据库中的知识发现：数据挖掘，用于在Internet上搜索信息的信息和行为模型。
6. Colloquium PaperMapping Knowledge Domains: Extracting knowledge from the World Wide Web [O] . Monika Henzinger, Steve Lawrence 2004

机译：专题讨论会论文制图知识领域：从万维网中提取知识
7. Extracting knowledge from the World Wide Web [O] . Henzinger, Monika, Lawrence, Steve 2004

机译：从万维网提取知识

Learning to Extract Symbolic Knowledge from the World Wide Web

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅