首页> 外文会议> >Jedi: extracting and synthesizing information from the Web
【24h】

Jedi: extracting and synthesizing information from the Web

机译:绝地武士:从Web提取和综合信息

获取原文

摘要

Jedi (Java based Extraction and Dissemination of Information) is a lightweight tool for the creation of wrappers and mediators to extract, combine, and reconcile information from several independent information sources. For wrappers it uses attributed grammars, which are evaluated with a fault-tolerant parsing strategy to cope with ambiguous grammars and irregular sources. For mediation it uses a simple generic object-model that can be extended with Java-libraries for specific models such as HTML, XML or the relational model. This paper describes the architecture of Jedi, and then focuses on Jedi's wrapper generator.
机译:Jedi(基于Java的信息提取和传播)是一种轻量级工具,用于创建包装程序和中介程序,以从多个独立的信息源中提取,合并和协调信息。对于包装程序,它使用属性语法,并使用容错解析策略对其进行评估,以应对歧义语法和不规则来源。为了进行调解,它使用了一个简单的通用对象模型,该对象模型可以与Java库一起扩展为特定模型,例如HTML,XML或关系模型。本文描述了Jedi的体系结构,然后重点介绍了Jedi的包装器生成器。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号