...
首页> 外文期刊>Procedia Computer Science >Natural Language Processing Using Kepler Workflow System: First Steps
【24h】

Natural Language Processing Using Kepler Workflow System: First Steps

机译:使用开普勒工作流系统进行自然语言处理:第一步

获取原文
   

获取外文期刊封面封底 >>

       

摘要

Scientific community across many disciplines is exploring new ways to extract knowledge from all available sources. Historically, written manuscripts have been the media of choice for recording experimental findings. Many disciplines such as social science, medical science are exploring ways to automate knowledge discovery from a vast repository of published scientific work. This work attempts to accelerate the process of information extraction by extending Kepler, a graphical workflow management tool. Kepler provides a simple way of designing and executing complex workflows in the form of directed graphs. This work presents a scalable approach to convert published research as PDF documents into indexable XML documents using Kepler. This conversion is a critical step in the Natural Language Processing pipeline. Kepler's distributed data processing capability enables scientists to scale this critical computation by simply adding more computing resources over the cloud.
机译:许多学科的科学共同体正在探索从所有可用资源中提取知识的新方法。从历史上看,书面手稿一直是记录实验结果的首选媒体。社会科学,医学等许多学科都在探索从大量已发表的科学著作中自动进行知识发现的方法。这项工作试图通过扩展图形工作流程管理工具Kepler来加快信息提取的过程。开普勒提供了一种有向图形式的设计和执行复杂工作流程的简单方法。这项工作提出了一种可扩展的方法,可以使用开普勒将已发表的研究作为PDF文档转换为可索引的XML文档。这种转换是自然语言处理管道中的关键步骤。开普勒的分布式数据处理功能使科学家能够通过在云上简单地添加更多计算资源来扩展关键计算。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号