...
首页> 外文期刊>SIGMOD record >Introduction to the Special Issue on Managing Information Extraction
【24h】

Introduction to the Special Issue on Managing Information Extraction

机译:管理信息抽取的特刊简介

获取原文
获取原文并翻译 | 示例
           

摘要

The field of information extraction (IE) focuses on extracting structured data, such as person names and organizations, from unstructured text. This field has had a long history. It attracted steady attention in the 80s and 90s, largely in the AI community.rnIn the past decade, however, spurred on by the explosion of unstructured data on the World-Wide Web, this attention has turned into a torrent, gathering the efforts of researchers in the AI, DB, WWW, KDD, Semantic Web and IR communities. New IE problems have been identified, new IE techniques developed, many workshops organized, tutorials presented, companies founded, academic and industrial products deployed, and open-source prototypes developed (e.g., [5, 4, 3, 1, 2]; see [5] for the latest survey). The next few years are poised to witness even more accelerated activities in these areas.rnIt is against this vibrant backdrop that we assemble this special issue. Our objective is threefold. First, we want to provide a glimpse into the current state of the field, highlighting in particular the wide range of IE problems. Second, we want to show that many IE problems can significantly benefit from the wealth of work on managing structured data in the database community. We believe therefore that our community can make a substantial contribution to the IE field. Finally, we hope that examining IE problems can in turn help us gain valuable insights into managing data in this Internet-centric world, a long-term goal of our community.rnKeeping in mind the above goals, we end this introduction by briefly describing the nine papers assembled for the issue. These papers fall into four broad categories.
机译:信息提取(IE)领域的重点是从非结构化文本中提取结构化数据,例如人员姓名和组织。这个领域历史悠久。在80年代和90年代,它一直受到AI界的持续关注。然而,在过去的十年中,由于万维网上非结构化数据的爆炸式增长,这种关注变成了洪流,聚集了AI,DB,WWW,KDD,语义网和IR社区的研究人员。已经确定了新的IE问题,开发了新的IE技术,组织了许多研讨会,介绍了教程,创建了公司,部署了学术和工业产品,并开发了开源原型(例如[5、4、3、1、2];请参见[5]为最新调查)。在接下来的几年中,我们将目睹这些领域的进一步加速发展。在这种充满活力的背景下,我们组织了本期特刊。我们的目标是三重的。首先,我们想瞥一眼该领域的当前状况,尤其要强调各种IE问题。其次,我们想证明许多IE问题可以从数据库社区中管理结构化数据的大量工作中受益匪浅。因此,我们相信我们的社区可以为IE领域做出重大贡献。最后,我们希望通过检查IE问题可以反过来帮助我们获得宝贵的见解,以管理这个以Internet为中心的世界中的数据,这是我们社区的一项长期目标。考虑到上述目标,我们通过简要描述以下内容来结束本简介:为该期汇编了九篇论文。这些论文分为四大类。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号