首页> 外文会议> >The methods of lemmatization of bound case markers in modern Tibetan
【24h】

The methods of lemmatization of bound case markers in modern Tibetan

机译:现代藏语绑定词标记词的词形化方法

获取原文

摘要

This paper discusses identifying approaches of bound case markers in modern Tibetan language. The aim is to differentiate bound case markers adhered to presyllables from those homographic endings which is a part of the words. (1) To build up a table consisting of words with (-r/-s) endings, and match words from texts with them. (2) To judge the property of ending forms with the information extracted from predicate verbs and their attributive table. Yet, from the result of our experiment, we still need (3) to further analyze the rules of word-formation of nouns and adjectives, and pay more attention to lexicalized examples or specific words. All of these processing technologies are called lemmatization in our project.
机译:本文讨论了现代藏语中区分大小写标记的识别方法。目的是区分粘附在预音节上的大小写标记与那些单词的同形异义词的区别。 (1)建立一个表,该表由以(-r / -s)结尾的单词组成,并匹配文本中的单词。 (2)利用从谓词动词及其属性表中提取的信息来判断结尾形式的性质。然而,从我们的实验结果来看,我们仍然需要(3)进一步分析名词和形容词的构词规则,并更多地关注词汇化的示例或特定词。所有这些处理技术在我们的项目中都称为lemmatization。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号