首页> 外文会议>IEEE International Conference on Big Data >Towards a Flexible System Architecture for Automated Knowledge Base Construction Frameworks
【24h】

Towards a Flexible System Architecture for Automated Knowledge Base Construction Frameworks

机译:面向自动化知识库构建框架的灵活系统架构

获取原文

摘要

Although knowledge bases play an important role in many domains (including in archives, where they are sometimes used for entity extraction and semantic annotation tasks), it is challenging to build knowledge bases by hand. This is owing to a number of factors: Knowledge bases must be accurate, up-to-date, comprehensive, and as flexible and as efficient as possible. These requirements mean a large undertaking, in the form of extensive work by subject matter experts (such as scientists, programmers, archivists, and other information professionals). Even when successfully engineered, manually built knowledge bases are typically one-off, use-case-specific, non-standardized, hard-to-maintain solutions.Recent advances in the field of automated knowledge base construction (AKBC) offer a promising alternative. A knowledge base construction framework takes as input source documents (such as journal articles containing text, figures, and tables) and produces as output a database of the extracted information.An important motivation behind these frameworks is to relieve domain experts from having to worry about the complexity of building knowledge bases. Unfortunately, such frameworks fall short when it comes to scalability (ingesting and extracting information at scale), extensibility (ability to add or modify functionality), and usability (ability to easily specify information extraction rules). This is partly because these frameworks are often constructed with relatively limited consideration for architectural design, compared to the attention given to algorithmic performance and low-level optimizations.As knowledge bases will be increasingly relevant to many domains, we present a scalable, flexible, and extensible architecture for knowledge base construction frameworks. As a work in progress, we extend a specific framework to address some of its design limitations. The contributions presented in this short paper can shed a light on the suitability of using AKBC frameworks for computational use cases in this domain and provide future directions for building improved AKBC frameworks.
机译:尽管知识库在许多领域(包括在归档中,有时用于实体提取和语义注释任务)中都起着重要作用,但是手工建立知识库仍然是一个挑战。这是由于许多因素造成的:知识库必须是准确的,最新的,全面的,并且要尽可能灵活和高效。这些要求意味着一项艰巨的任务,其形式是主题专家(例如科学家,程序员,档案管理员和其他信息专业人员)的大量工作。即使成功地进行了工程设计,手动建立的知识库通常也是一次性的,针对特定用例的,非标准化的,难以维护的解决方案。自动知识库构建(AKBC)领域的最新进展提供了一种有希望的替代方案。知识库构建框架以输入源文档(例如包含文本,图形和表格的期刊文章)为输入,并输出所提取信息的数据库作为输出。这些框架背后的重要动机是使领域专家不必担心建立知识库的复杂性。不幸的是,当涉及到可伸缩性(大规模地摄取和提取信息),可扩展性(添加或修改功能的能力)以及可用性(易于指定信息提取规则的能力)时,这样的框架都不够。部分原因是,与对算法性能和低级优化的关注相比,构建这些框架时通常在架构设计上的考虑相对有限。随着知识库与许多领域的关系越来越密切,我们提出了一种可扩展,灵活且可扩展的框架。知识库构建框架的可扩展体系结构。作为一项正在进行的工作,我们扩展了一个特定的框架来解决其某些设计限制。这篇简短的论文中的贡献可以阐明在此领域中使用AKBC框架进行计算用例的适用性,并为构建改进的AKBC框架提供未来的方向。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号