首页> 外文学位 >Paradigms of evaluation in natural language processing: Field linguistics for glass box testing.
【24h】

Paradigms of evaluation in natural language processing: Field linguistics for glass box testing.

机译:自然语言处理中的评估范例:玻璃箱测试的现场语言学。

获取原文
获取原文并翻译 | 示例

摘要

Although software testing has been well-studied in computer science, it has received little attention in natural language processing. Nonetheless, a fully developed methodology for glass box evaluation and testing of language processing applications already exists in the field methods of descriptive linguistics. This work lays out a number of experiments that in the aggregate demonstrate the feasibility of software testing or glass box evaluation for natural language processing, and in the process validates the claim that the techniques of descriptive linguistics and field methods are a sound methodological approach to doing such testing. Various chapters consider the issue from the perspectives of the application of fieldwork techniques to software testing, applications of linguistics-informed software engineering to NLP, applications of the descriptive linguistics concept of complementary distribution to problems in NLP, and applications of descriptive linguistics concepts to the problem of quality assurance for semantic representations in proposition banks.;In the experiment that most clearly shows the connection between linguistic fieldwork and software testing, a test suite that is constructed like a field linguist's elicitation schedule is used to find performance errors in five named entity recognition programs and to predict the performance of one program on several equivalence classes of named entities. In another experiment, from the software engineering perspective, a linguistically-informed fault model is used to isolate the source of a performance anomaly in a language processing application. In three subsequent experiments, a discovery procedure for minimal pairs and free variation is used to approach a problem in the normalization of named entities and a discovery procedure for complementary distribution is used to diagnose problematic semantic representations. The latter technique is applied to two corpora and two sets of predicate-argument structures; it is shown that the technique labels true positives with an accuracy of 69%.
机译:尽管软件测试已经在计算机科学中得到了很好的研究,但是在自然语言处理中却很少受到关注。尽管如此,在描述语言学的现场方法中已经存在一种用于玻璃盒评估和语言处理应用测试的全面开发的方法。这项工作安排了许多实验,这些实验总体上证明了软件测试或玻璃盒评估对自然语言处理的可行性,并且在这一过程中证实了描述性语言学和现场方法是一种合理的方法论方法的说法。这样的测试。各个章节从现场工作技术在软件测试中的应用,语言学的软件工程在NLP中的应用,互补分布的描述性语言学概念在NLP中的问题的应用以及描述性语言学概念在NLP中的应用的角度来考虑该问题。命题库中语义表示的质量保证问题。;在最清楚地显示语言现场工作与软件测试之间联系的实验中,使用了一种类似于现场语言学家的启发计划的测试套件来查找五个命名实体中的性能错误识别程序并预测一个程序在命名实体的几个等效类上的性能。在另一个实验中,从软件工程的角度来看,使用语言告知的故障模型来隔离语言处理应用程序中性能异常的根源。在随后的三个实验中,使用最小对和自由变化的发现过程来解决命名实体的规范化问题,并使用互补分布的发现过程来诊断有问题的语义表示。后一种技术适用于两个语料库和两组谓词参数结构。结果表明,该技术以69%的准确度标记真阳性。

著录项

  • 作者

    Cohen, Kevin Bretonnel.;

  • 作者单位

    University of Colorado at Boulder.;

  • 授予单位 University of Colorado at Boulder.;
  • 学科 Language Linguistics.;Computer Science.;Biology Bioinformatics.
  • 学位 Ph.D.
  • 年度 2010
  • 页码 175 p.
  • 总页数 175
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号