首页> 外文会议>Data Mining Workshops, ICDMW, 2008 IEEE International Conference on >Exploiting Data Semantics to Discover, Extract, and Model Web Sources
【24h】

Exploiting Data Semantics to Discover, Extract, and Model Web Sources

机译:利用数据语义来发现,提取和建模Web源

获取原文

摘要

We describe Deimos, a system that automatically discovers and models new sources of information.The system exploits four core technologies developed by our group that makes an end-to-end solution to this problem possible. First, given an example source, Deimos finds other similar sources online. Second, it invokes and extracts data from these sources. Third, given the syntactic structure of a source, Deimos maps its inputs and outputs to semantic types. Finally, it infers the source's semantic definition, i.e., the function that maps the inputs to the outputs. Deimos is able to successfully automate these steps by exploiting a combination of background knowledge and data semantics. We describe the challenges in integrating separate components into a unified approach to discovering, extracting and modeling new online sources. We provide an end-to-end validation of the system in two information domains to show that it can successfully discover and model new data sources in those domains.
机译:我们描述了Deimos,这是一个自动发现新信息并对其建模的系统。该系统利用了我们小组开发的四项核心技术,从而使该问题的端到端解决方案成为可能。首先,给出示例源,Deimos在网上找到其他类似的源。其次,它从这些来源中调用并提取数据。第三,给定源的句法结构,Deimos将其输入和输出映射到语义类型。最后,它推断出源的语义定义,即将输入映射到输出的函数。通过利用背景知识和数据语义的结合,Deimos能够成功地自动执行这些步骤。我们描述了将单独的组件集成到发现,提取和建模新的在线资源的统一方法中所面临的挑战。我们在两个信息域中提供了对系统的端到端验证,以表明它可以成功发现和建模那些域中的新数据源。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号