首页> 外文学位 >SISU: A PROGRAMMING LANGUAGE FOR COMPUTATIONAL LINGUISTICS, WITH AN APPLICATION TO THE ANALYSIS OF FINNISH.
【24h】

SISU: A PROGRAMMING LANGUAGE FOR COMPUTATIONAL LINGUISTICS, WITH AN APPLICATION TO THE ANALYSIS OF FINNISH.

机译:SISU:一种计算语言学的编程语言,用于芬兰语分析。

获取原文
获取原文并翻译 | 示例

摘要

This dissertation describes SISU, a new programming language for Computational Linguistics (CL). The motivation for the creation of SISU is discussed in detail. The historical beginnings and development of CL are given, beginning with machine translation and going on through associations with artificial intelligence. The influences of linguistic research upon the field are shown. It is pointed out that in nearly all the literature about CL applications, very little is said about the programming details, or how the system was actually implemented on the computer. This observation leads to an examination of the languages and systems which have been used. It is noted that they either had been developed for other purposes and were adapted for natural-language processing, or that they were developed for only part of the CL problem domain. None of them are designed to meet the needs of the linguistic researcher who is inexperienced in programming. It is proposed that a language designed for the broad range of CL applications, which encourages structured programming techniques and which would be easy for the novice to learn and use, would be an advance in the state of the art of computer science.;A number of CL and natural-language processing application areas and examples are examined to develop a list of requirements for such a programming language.;The SISU language was developed to meet those requirements. It provides a large number of features to simplify the construction of CL programs. It is implemented as an embedded extension to the PL/I programming language by using a preprocessor. It provides powerful commands and data abstractions to facilitate handling blocks of text, sentences and words by referring to them as just that--text, sentences and words. There are also a number of commands which operate on individual words for the purpose of morphemic analysis, allowing the words to be easily taken apart, examined and altered. A facile dictionary-building and referencing technique is also included, as well as debugging and text output features.;As an example of its use, SISU is applied to the Finnish language. A brief description of Finnish is given. It is pointed out that through the process of agglutination, or the adding on of a number of inflectional endings, a single word can generate literally tens of variant forms; however, it is only the basic form which is carried in dictionaries. Moreover, the endings often cause changes in the basic form so that the remaining stem, after removal of the endings, must be changed back into the basic form before it can be found in a computer dictionary, which could not contain all the variant forms. Without this capability, computer analysis of Finnish would be very limited. This problem was analyzed and solved for the class of plural nouns. A general method was developed which allowed the extension to other word classes.;The resulting algorithm was found to be easily and quickly programmable in SISU. The program and the results are given.
机译:本文介绍了一种用于计算语言学(CL)的新型编程语言SISU。将详细讨论创建SISU的动机。给出了CL的历史起点和发展,从机器翻译开始,一直到与人工智能的关联。显示了语言学研究对该领域的影响。需要指出的是,在几乎所有有关CL应用程序的文献中,关于编程细节或系统在计算机上的实际实现方式的论述很少。这种观察导致对已使用的语言和系统的检查。应当指出,它们或者是为其他目的而开发的,并且适合于自然语言处理,或者它们只是为CL问题领域的一部分而开发的。它们都不是为了满足没有编程经验的语言研究人员的需求而设计的。有人提出了一种为广泛的CL应用而设计的语言,它鼓励了结构化的编程技术,并且对于新手来说很容易学习和使用,这将是计算机科学领域的一种进步。研究了CL和自然语言处理的应用领域和示例,以开发出对这种编程语言的要求列表。SISU语言是为了满足这些要求而开发的。它提供了大量功能来简化CL程序的构造。通过使用预处理器,它被实现为PL / I编程语言的嵌入式扩展。它提供了强大的命令和数据抽象,从而通过将文本,句子和单词称为文本,句子和单词来促进处理文本,句子和单词的块。为了进行词素分析,还有许多对单个单词进行操作的命令,使这些单词易于被分解,检查和更改。还包括一种简便的字典构建和引用技术,以及调试和文本输出功能。作为其用法的一个示例,SISU应用于芬兰语。简要介绍了芬兰语。需要指出的是,通过凝集或添加多个曲折结尾的过程,单个单词实际上可以生成数十种变体形式;但是,它只是字典中的基本形式。此外,结尾通常会导致基本形式的变化,因此,在删除结尾之后,必须将剩余的词干变回基本形式,然后才能在不能包含所有变体形式的计算机词典中找到它。如果没有此功能,芬兰语的计算机分析将非常有限。对于复数名词类别,分析并解决了该问题。开发了一种通用方法,该方法可以扩展到其他单词类别。;发现生成的算法可以在SISU中轻松,快速地编程。给出了程序和结果。

著录项

  • 作者

    WOOLF, IRVING DAVID.;

  • 作者单位

    The George Washington University.;

  • 授予单位 The George Washington University.;
  • 学科 Computer Science.
  • 学位 D.Sc.
  • 年度 1981
  • 页码 422 p.
  • 总页数 422
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号