首页> 外文期刊>Decision support systems >Managing knowledge on the Web - Extracting ontology from HTML Web
【24h】

Managing knowledge on the Web - Extracting ontology from HTML Web

机译:在Web上管理知识-从HTML Web提取本体

获取原文
获取原文并翻译 | 示例

摘要

In recent years, the Internet has become one of the most important sources of information, and it is now imperative that companies are able to collect, retrieve, process, and manage information from the Web. However, due to the sheer amount of information available, browsing web content by searches using keywords is inefficient, largely because unstructured HTML web pages are written for human comprehension and not for direct machine processing. For the same reason, the degree of web automation is limited. It is recognized that semantics can enhance web automation, but it will take an indefinite amount of effort to convert the current HTML Web into the Semantic Web. This study proposes a novel ontology extractor, called OntoSpider, for extracting ontology from the HTML Web. The contribution of this work is the design and implementation of a six-phase process that includes the preparation, transformation, clustering, recognition, refinement, and revision for extracting ontology from unstructured HTML pages. The extracted ontology provides structured and relevant information for applications such as e-commerce and knowledge management that can be compared and analyzed more effectively. We give detailed information on the system and provide a series of experimental results that validate the system design and illustrate the effectiveness of OntoSpider.
机译:近年来,Internet已成为最重要的信息源之一,现在,公司必须能够从Web收集,检索,处理和管理信息。但是,由于可用的信息量很大,因此通过使用关键字进行的搜索来浏览Web内容的效率很低,这在很大程度上是因为编写非结构化HTML网页是为了使人理解而不是直接用于机器处理。由于相同的原因,Web自动化的程度受到限制。人们已经认识到语义可以增强Web自动化,但是要将当前的HTML Web转换为语义Web会花费无限的精力。这项研究提出了一种新颖的本体提取器,称为OntoSpider,用于从HTML Web提取本体。这项工作的目的是设计和实现一个六个阶段的过程,包括从非结构化HTML页面提取本体的准备,转换,聚类,识别,细化和修订。提取的本体为电子商务和知识管理等应用程序提供了结构化且相关的信息,可以更有效地进行比较和分析。我们将提供有关该系统的详细信息,并提供一系列实验结果,这些结果可验证系统设计并说明OntoSpider的有效性。

著录项

  • 来源
    《Decision support systems》 |2009年第4期|319-331|共13页
  • 作者

    Timon C. Du; Feng Li; Irwin King;

  • 作者单位

    Department of Decision Sciences and Managerial Economics. The Chinese University of Hong Kong, Hong Kong;

    School of Business Administration. South China University of Technology, China;

    Department of Computer Science and Engineering. The Chinese University of Hong Kong, Hong Kong;

  • 收录信息 美国《科学引文索引》(SCI);美国《工程索引》(EI);
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    ontology; semantic web; knowledge management applications; intelligent web services;

    机译:本体语义网知识管理应用;智能网络服务;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号