首页> 外文会议>Workshop on statistical parsing of morphologically rich languages >Representation of Morphosyntactic Units and Coordination Structures in the Turkish Dependency Treebank
【24h】

Representation of Morphosyntactic Units and Coordination Structures in the Turkish Dependency Treebank

机译:土耳其语依赖树库中形态句法单位的表示形式和协调结构

获取原文

摘要

This paper presents our preliminary conclusions as part of an ongoing effort to construct a new dependency representation framework for Turkish. We aim for this new framework to accommodate the highly agglutinative morphology of Turkish as well as to allow the annotation of unedited web data, and shape our decisions around these considerations. In this paper, we firstly describe a novel syntactic representation for morphosyntactic sub-word units (namely inflectional groups (IGs) in Turkish) which allows inter-IG relations to be discerned with perfect accuracy without having to hide lexical information. Secondly, we investigate alternative annotation schemes for coordination structures and present a better scheme (nearly 11% increase in recall scores) than the one in Turkish Treebank (Oflazer et al., 2003) for both parsing accuracies and compatibility for colloquial language.
机译:本文介绍了我们的初步结论,这是为土耳其构建新的依赖关系表示框架的持续努力的一部分。我们的目标是建立一个新的框架,以适应土耳其语的高度凝集形态,并允许注释未编辑的Web数据,并根据这些考虑因素来决定我们的决策。在本文中,我们首先描述了一种新的句法句法子词单元(即土耳其语中的屈折词组(IGs))的句法表示形式,该句法表示法可以精确地识别IG之间的关系,而不必隐藏词汇信息。其次,我们研究了用于协调结构的替代注释方案,并提出了一种比土耳其树库中的方案更好的方案(召回分数提高了近11%)(Oflazer等人,2003年),无论是解析准确性还是口语语言的兼容性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号