首页> 外文OA文献 >A semi-automatic approach for building ontologies from a collection of structured web documents
【2h】

A semi-automatic approach for building ontologies from a collection of structured web documents

机译:一种从结构化Web文档集合中构建本体的半自动方法

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Many collections of structured documents are available on the web. The collection generally describes the characteristics of entities from a single type, where each page describes one entity. These documents are adequate knowledge sources for building ontologies. As they benefit from a strong and shared layout, they contain less well written text than plain text files but their architecture is very meaningful. Classical linguistic-based methods for identifying concepts and relations are no longer appropriate for analyzing them. The approach we propose in this paper exploits various properties of such documents, combining layout/formatting analysis and linguistic analysis, and using semantic annotation.
机译:网络上有许多结构化文档的集合。该集合通常从单一类型描述实体的特征,其中每个页面描述一个实体。这些文档是用于构建本体的足够的知识来源。由于它们受益于强大且共享的布局,因此与纯文本文件相比,它们包含的书写文字更少,但是它们的体系结构非常有意义。用于识别概念和关系的基于语言的经典方法不再适合于对其进行分析。我们在本文中提出的方法利用了此类文档的各种属性,将布局/格式分析与语言分析相结合,并使用了语义标注。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号