首页> 外文学位 >An ontology for linguistics on the Semantic Web.
【24h】

An ontology for linguistics on the Semantic Web.

机译:语义网上的语言学本体。

获取原文
获取原文并翻译 | 示例

摘要

The current research presents an ontology for linguistics useful for an implementation on the Semantic Web. By adhering to this model, it is shown that data of the kind routinely collected by field linguists may be represented so as to facilitate automatic analysis and semantic search. The literature concerning typological databases, knowledge engineering, and the Semantic Web is reviewed. It is argued that the time is right for the integration of these three areas of research. Linguistic knowledge is discussed in the overall context of common-sense knowledge representation. A three-layer approach to meaning is assumed, one that includes conceptual, semantic, and linguistic levels of knowledge. In particular the level of semantics is shown to be crucial for a notional account of grammatical categories such as tense, aspect, and case. The level of semantic is viewed as an encoding of common-sense reality. To develop the ontology an upper model based on the Suggested Upper Merged Ontology (SUMO) is adopted, though elements from other ontologies are utilized as well. A brief comparison of available upper models is presented. It is argued that any ontology for linguistics should provide an account of at least (1) linguistic expressions, (2) mental linguistic units, (3) linguistic categories, and (4) discrete semantic units. The concepts and relations concerning these four domains are motivated as part of the ontology. Finally, an implementation for the Semantic Web is given by discussing the various data constructs necessary for markup (interlinear text, lexicons, paradigms, grammatical descriptions). It is argued that a characterization of the data constructs should not be included in the general ontology, but should be left up to the individual data provider to implement in XML Schema. A search scenario for linguistic data is discussed. It is shown that an ontology for linguistics provides the machinery for pure semantic search, that is, an advanced search framework whereby the user may use linguistic concepts, not just simple strings, as the search query.
机译:当前的研究提出了一种语言学的本体论,对在语义网上的实现很有用。通过遵守该模型,表明可以表示由现场语言学家常规收集的那种数据,以便于自动分析和语义搜索。评论有关类型数据库,知识工程和语义网的文献。有人认为,现在是整合这三个研究领域的时机。在常识性知识表示的整体上下文中讨论语言知识。假设了一种意义的三层方法,其中包括知识的概念,语义和语言水平。特别是,对于语法类别(例如时态,方面和大小写)的概念性说明,语义级别被证明是至关重要的。语义级别被视为常识现实的编码。为了开发本体,采用了基于建议的高级合并本体(SUMO)的较高的模型,尽管也利用了来自其他本体的元素。简要介绍了可用的上部模型。有人认为任何语言学本体论都应至少说明(1)语言表达,(2)心理语言单元,(3)语言类别和(4)离散语义单元。与这四个领域有关的概念和关系是本体的一部分。最后,通过讨论标记所需的各种数据结构(线性文本,词典,范例,语法说明),给出了语义网的实现。有人认为,数据构造的特征不应该包含在一般的本体中,而应该留给各个数据提供者以XML Schema来实现。讨论了语言数据的搜索方案。结果表明,语言学本体为纯粹的语义搜索提供了机制,也就是说,高级搜索框架使用户可以使用语言概念,而不仅仅是简单的字符串作为搜索查询。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号