首页> 外文会议>Workshop on open infrastructures and analysis frameworks for HLT 2014 >Integrated Tools for Query-driven Development of Light-weight Ontologies and Information Extraction Components
【24h】

Integrated Tools for Query-driven Development of Light-weight Ontologies and Information Extraction Components

机译:用于查询驱动的轻量级本体和信息提取组件开发的集成工具

获取原文
获取原文并翻译 | 示例

摘要

This paper reports on a user-friendly terminology and information extraction development environment that integrates into existing infrastructure for natural language processing and aims to close a gap in the UIMA community. The tool supports domain experts in data-driven and manual terminology refinement and refactoring. It can propose new concepts and simple relations and includes an information extraction algorithm that considers the context of terms for disambiguation. With its tight integration of easy-to-use and technical tools for component development and resource management, the system is especially designed to shorten times necessary for domain adaptation of such text processing components. Search support provided by the tool fosters this aspect and is helpful for building natural language processing modules in general. Specialized queries are included to speed up several tasks, for example, the detection of new terms and concepts, or simple quality estimation without gold standard documents. The development environment is modular and extensible by using Eclipse and the Apache UIMA framework. This paper describes the system's architecture and features with a focus on search support. Notably, this paper proposes a generic middleware component for queries in a UIMA based workbench.
机译:本文报告了一种用户友好的术语和信息提取开发环境,该环境已集成到用于自然语言处理的现有基础结构中,旨在缩小UIMA社区中的空白。该工具支持领域专家进行数据驱动和手动术语的完善与重构。它可以提出新的概念和简单的关系,并且包括一种考虑了术语歧义的信息提取算法。通过紧密集成易于使用的技术工具和用于组件开发和资源管理的技术工具,该系统特别设计为缩短了此类文本处理组件的域适应所需的时间。该工具提供的搜索支持促进了这一方面,并​​且总体上有助于构建自然语言处理模块。包括专门的查询,以加快一些任务的速度,例如,检测新术语和概念,或在没有黄金标准文件的情况下进行简单的质量估算。开发环境是模块化的,并且可以通过使用Eclipse和Apache UIMA框架进行扩展。本文介绍了系统的体系结构和功能,重点是搜索支持。值得注意的是,本文提出了一个通用的中间件组件,用于基于UIMA的工作台中的查询。

著录项

  • 来源
  • 会议地点 Dublin(IE)
  • 作者单位

    Department of Computer Science Ⅵ University of Wuerzburg, Am Hubland Wuerzburg, Germany;

    Department of Computer Science Ⅵ University of Wuerzburg, Am Hubland Wuerzburg, Germany;

    Department of Computer Science Ⅵ University of Wuerzburg, Am Hubland Wuerzburg, Germany;

    Department of Computer Science Ⅵ University of Wuerzburg, Am Hubland Wuerzburg, Germany,Comprehensive Heart Failure Center University of Wuerzburg, Straubmuehlweg 2a Wuerzburg, Germany;

    Department of Computer Science Ⅵ University of Wuerzburg, Am Hubland Wuerzburg, Germany;

  • 会议组织
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号