...
首页> 外文期刊>Journal of digital information management >Orchestrating the Natural Language Processing Software in the Cloud Computing Environment
【24h】

Orchestrating the Natural Language Processing Software in the Cloud Computing Environment

机译:在云计算环境中编排自然语言处理软件

获取原文
获取原文并翻译 | 示例

摘要

The most of natural language processing problems are data-intensive. An important step in the distributed orchestration of natural language processing software is a rational choice of the specific middleware. The middleware should solve the presented problem with minimal deployment, support and usage costs. It is necessary to run and use that software in the distributed cloud computing environment to achieve such advantages such as consolidation, isolation, and efficient use of the existent infrastructure. It is often impossible to modify the existent natural language processing software to integrate it into the cloud computing environment because of licensing or organizational issues. This paper studies various popular distributed data processing tools and evaluates the selected natural language processing tools on a relatively large document collection in distributed way using the Gearman framework. The document collection is a 10'000 sentences from the Russian news subcorpus of the Leipzig corpora. The benchmarks are presented and discussed.
机译:大多数自然语言处理问题都是数据密集型的。自然语言处理软件的分布式编排中的重要一步是对特定中间件的合理选择。中间件应以最小的部署,支持和使用成本解决提出的问题。必须在分布式云计算环境中运行和使用该软件,以实现诸如合并,隔离和有效利用现有基础架构之类的优势。由于许可或组织问题,通常不可能修改现有的自然语言处理软件以将其集成到云计算环境中。本文研究了各种流行的分布式数据处理工具,并使用Gearman框架以分布式方式在一个相对较大的文档集中评估了所选的自然语言处理工具。该文档集来自莱比锡语料库的俄罗斯新闻子库有10,000个句子。介绍并讨论了基准。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号