首页> 外文学位 >A computational study of lexicalized noun phrases in English.
【24h】

A computational study of lexicalized noun phrases in English.

机译:英语中词汇化名词短语的计算研究。

获取原文
获取原文并翻译 | 示例

摘要

Lexicalized noun phrases are noun phrases that function as words. In English, lexicalized noun phrases are usually realized as noun-noun compounds such as theater ticket and garbage man, or as adjective-noun phrases such as black market and high school. In specialized or technical subject domains, phrases such as urban planning, air traffic control, highway engineering and combinatorial mathematics represent conventional names for concepts that are just as important to the as single-word terms such as adsorbents, hydrology, or aerodynamics. Yet despite the fact that lexicalized noun phrases are frequent enough to be cited in dictionaries, book indexes, the traditional linguistic literature has failed to identify consistent and categorical formal criteria for identifying them.; This study develops and evaluates a linguistically natural computational method for recognizing lexicalized noun phrases in a large corpus of English-language engineering text by synthesizing the insights of studies in traditional linguistics and computational linguists. From the scholarship in theoretical linguistics, the analysis adopts the perspective that lexicalized noun phrases represent the names of concepts that are important to a community of speakers and have survived a single context of use. Theoretical linguists have also proposed diagnostic tests for identifying lexicalized noun phrases, many of which can be formalized in a computational study. From the scholarship in computational linguistics, the analysis incorporates the view that a linguistic investigation can be extended and verified by processing relevant evidence from a corpus of text, which can be evaluated using mathematical models that do not require categorical input.; In a engineering text, a small set of linguistic contexts, including professor of, department of or studies in, yields state machines, complex systems, computer graphics, and mathematical morphology. The study reported here identifies lexical and syntactic contexts that harbor lexicalized noun phrases and submits them to a machine-learning algorithm that classifies the lexical status of noun phrases extracted from the text. Results from several evaluations show that this evidence is relevant to the classification, and informal evidence from many other subject domains implies that the results can be generalized.
机译:词汇化名词短语是充当单词的名词短语。在英语中,词汇化名词短语通常被实现为名词名词化合物,例如 theater ticket garbage man ,或形容词名词短语例如 black market < / italic>和高中。在专业或技术主题领域,诸如城市规划,空中交通管制,公路工程组合数学之类的词组代表了对于单个概念同样重要的概念的常规名称。 -单词一词,例如吸附剂,水文 aerodynamics 。然而,尽管事实上词汇化的名词短语经常出现在字典,书目索引中经常被引用,但传统的语言文献仍未能找出一致的和分类的形式标准来加以识别。这项研究通过综合传统语言学和计算语言学家的研究见解,开发并评估了一种语言自然的计算方法,用于识别大型英语工程文本中的词汇化名词短语。从理论语言学方面的研究出发,分析采用了以下观点:词汇化名词短语代表对讲者社区很重要且在单个使用环境中幸存下来的概念名称。理论语言学家还提出了用于识别词汇化名词短语的诊断测试,其中许多可以在计算研究中形式化。从计算语言学的学术观点出发,分析纳入了一种观点,即可以通过处理来自文本语料库的相关证据来扩展和验证语言学研究,可以使用不需要分类输入的数学模型对其进行评估。在工程文本中,一小部分语言环境,包括教授,系或研究,产生了状态机,复杂系统,计算机图形学,和数学形态学。此处报告的研究确定了包含词汇化名词短语的词汇和句法语境,并将其提交给机器学习算法,该算法对从文本中提取的名词短语的词汇状态进行分类。几次评估的结果表明,该证据与分类有关,而来自其他许多学科领域的非正式证据则意味着可以对结果进行概括。

著录项

  • 作者

    Godby, Carol Jean.;

  • 作者单位

    The Ohio State University.;

  • 授予单位 The Ohio State University.;
  • 学科 Language Linguistics.
  • 学位 Ph.D.
  • 年度 2002
  • 页码 144 p.
  • 总页数 144
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 语言学;
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号