首页> 外文期刊>Journal of Language Modelling >Slovak Morphosyntactic Tagset
【24h】

Slovak Morphosyntactic Tagset

机译:斯洛伐克语素句法标记集

获取原文
       

摘要

Morphological annotation constitutes essential, very useful and very common linguistic information presented in corpora, especially for highly inflectional languages. The morphological tagset used in the Slovak National Corpus has been designed with several goals in mind – the tags are compact and easily human-readable, without sacrificing their informational contents. The tags consist of ASCII letters, numbers and several other characters. In general, they have a variable number of symbols, but their order is obligatory, and each category or specific feature is assigned a particular character, which can be shared among several parts of speech. The tagset is highly functional and pragmatic, although some allowances had to be made to accommodate traditional analysis of Slovak morphology and part of speech categories. In particular, function words are classified according to their syntactic (and semantic) roles, which is a reason why the tagset is sometimes described as a morphosyntactic one.
机译:语态注释构成了语料库中呈现的必不可少,非常有用且非常常见的语言信息,尤其是对于高度变形的语言而言。斯洛伐克国家语料库中使用的形态标记集在设计时考虑了几个目标-标记紧凑且易于阅读,而不会牺牲其信息内容。标签由ASCII字母,数字和其他几个字符组成。通常,它们具有可变数量的符号,但是它们的顺序是强制性的,并且为每个类别或特定功能分配了特定字符,可以在语音的多个部分之间共享它们。该标签集功能强大且实用,尽管必须考虑一些限制以适应对斯洛伐克语形态和部分语音类别的传统分析。特别地,功能词是根据其句法(和语义)角色进行分类的,这就是为什么有时将标记集描述为词法句法的原因。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号