首页> 外文会议>Proceedings of the 2003 annual national conference on Digital government research >Extending metadata definitions by automatically extracting and organizing glossary definitions
【24h】

Extending metadata definitions by automatically extracting and organizing glossary definitions

机译:通过自动提取和组织词汇表定义来扩展元数据定义

获取原文
获取原文并翻译 | 示例

摘要

Metadata descriptions of database contents are required to build and use systems that access and deliver data in response to user requests. When numerous heterogeneous databases are brought together in a single system, their various metadata formalizations must be homogenized and integrated in order to support the access planning and delivery system. This integration is a tedious process that requires human expertise and attention. In this paper we describe a method of speeding up the formalization and integration of new metadata. The method takes advantage of the fact that databases are often described in web pages containing natural language glossaries that define pertinent aspects of the data. Given a root URL, our method identifies likely glossaries, extracts and formalizes aspects of relevant concepts defined in them, and automatically integrates the new formalized metadata concepts into a large model of the domain and associated conceptualizations.
机译:建立和使用响应于用户请求访问和传递数据的系统需要数据库内容的元数据描述。当将多个异构数据库整合到单个系统中时,必须将其各种元数据形式化统一并集成在一起,以支持访问计划和交付系统。这种集成是一个繁琐的过程,需要人类的专业知识和关注。在本文中,我们描述了一种加速新元数据的形式化和集成的方法。该方法利用了这样一个事实,即数据库通常在包含定义数据相关方面的自然语言词汇表的网页中描述。给定一个根URL,我们的方法将识别可能的词汇表,提取并形式化其中定义的相关概念的各个方面,然后将新的形式化元数据概念自动集成到领域和相关概念化的大型模型中。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号