【24h】

An architecture for web information agents

机译:Web信息代理的体系结构

获取原文

摘要

Many authors are researching on information extraction techniques to transform the semi-structured information in typical web pages into structured information. When a researcher devises a new technique, he or she has to validate it, which requires implementing it, experimenting, gathering precision and recall results, comparing it to others, and drawing conclusions. This involves an array of details that are specific to this technique, but many others that are actually shared with other proposals. Unfortunately, the literature does not provide a single up-to-date platform to guide software engineers and researches in the design and implementation of information extractors. In this paper, we present a platform to design and implement learners of information extraction rules. Due to space constraints, we focus on the class of learners that learn hierarchical transducers. We have implemented our platform, and we have validated it by means of three case studies.
机译:许多作者正在研究信息提取技术,以将典型网页中的半结构化信息转换为结构化信息。当研究人员设计出一种新技术时,他或她必须对其进行验证,这需要实施,进行实验,收集精确度和召回结果,将其与其他技术进行比较并得出结论。这涉及该技术特定的一系列细节,但实际上与其他建议共享许多其他细节。不幸的是,文献没有提供单一的最新平台来指导软件工程师和信息提取器的设计和实现方面的研究。在本文中,我们提供了一个平台来设计和实现信息提取规则的学习者。由于篇幅所限,我们专注于学习分层换能器的学习者类别。我们已经实现了平台,并通过三个案例研究对其进行了验证。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号