Training and Evaluating a Statistical Part of Speech Tagger for Natural Language Applications using Kepler Workflows

Doug Briesch; Reginald Hobbs; Claire Jaja; Brian Kjersten; Clare Voss

首页> 外文期刊>Procedia Computer Science >Training and Evaluating a Statistical Part of Speech Tagger for Natural Language Applications using Kepler Workflows

【24h】

Training and Evaluating a Statistical Part of Speech Tagger for Natural Language Applications using Kepler Workflows

机译：使用开普勒工作流为自然语言应用训练和评估语音标注器的统计部分

获取原文

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

A core technology of natural language processing (NLP) incorporated into many text processing applications is a part of speech (POS) tagger, a software component that labels words in text with syntactic tags such as noun, verb, adjective, etc. These tags may then be used within more complex tasks such as parsing, question answering, and machine translation (MT). In this paper we describe the phases of our work training and evaluating statistical POS taggers on Arabic texts and their English translations using Kepler workflows. While the original objectives for encapsulating our research code within Kepler workflows were driven by software engineering needs to document and verify the re usability of our software, our research benefitted as well: the ease of rapid retraining and testing enabled our researchers to detect reporting discrepancies, document their source, independently validating the correct results.

机译：集成到许多文本处理应用程序中的自然语言处理（NLP）的一项核心技术是语音（POS）标记器的一部分，该软件组件使用诸如名词，动词，形容词等语法标记来标记文本中的单词。这些标记可能然后用于更复杂的任务，例如解析，问题解答和机器翻译（MT）。在本文中，我们描述了我们的工作培训阶段以及使用开普勒工作流评估阿拉伯文及其英语翻译的统计POS标签的阶段。将研究代码封装在开普勒工作流程中的最初目标是由软件工程记录和验证我们软件的可重用性的需求所驱动的，我们的研究也从中受益：快速重新培训和测试的便捷性使研究人员能够发现报告差异，记录其来源，独立验证正确的结果。

著录项

来源
《Procedia Computer Science》 |2012年第1期|共7页
作者
Doug Briesch; Reginald Hobbs; Claire Jaja; Brian Kjersten; Clare Voss;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类计算技术、计算机技术;
关键词

相似文献

外文文献
中文文献
专利

1. Natural Language Processing Using Kepler Workflow System: First Steps [J] . Ankit Goyal, Alok Singh, Shitij Bhargava, Procedia Computer Science . 2016,第1期

机译：使用开普勒工作流系统进行自然语言处理：第一步
2. A tree-based statistical language model for natural language speech recognition [J] . Bahl L.R., Brown P.F. IEEE Transactions on Acoustics, Speech, and Signal Processing . 1989,第7期

机译：用于自然语言语音识别的基于树的统计语言模型
3. Preliminary evaluation of a low-intensity parent training program on speech-language stimulation for children with language delay [J] . Rajesh Vipula, Venkatesh Lakshmi International journal of pediatric otorhinolaryngology . 2019,第期

机译：初步评估语言延迟儿童语言刺激的低强度父母培训计划
4. Training and Evaluating a Statistical Part-of-Speech Tagger for Natural Language Applications using Kepler Workflows [C] . Doug Briesch, Reginald Hobbs, Claire Jaja, International Conference on Computational Science . 2013

机译：使用Bepler工作流程为自然语言应用程序进行培训和评估统计部分语音标记器
5. Reducing pipeline error propagation in natural language processing: Part-of-speech tagging applied to clinical narratives [D] . Ferraro, Jeffrey Page 2013

机译：减少自然语言处理中的管道误差传播：应用于临床叙述的语音标记
6. The application of naturalistic conversation training to speech production in children with speech disabilities. [O] . S Camarata 1993

机译：自然主义会话训练在言语障碍儿童言语产生中的应用。
7. Training and Evaluating a Statistical Part of Speech Tagger for Natural Language Applications using Kepler Workflows [O] . Briesch Doug, Hobbs Reginald, Jaja Claire, 2012

机译：使用开普勒工作流程为自然语言应用训练和评估语音标注器的统计部分

Training and Evaluating a Statistical Part of Speech Tagger for Natural Language Applications using Kepler Workflows

摘要

著录项

相似文献

相关主题

期刊订阅