首页> 外文会议>Workshop on cognitive aspects of the lexicon >The Power of Language Music: Arabic Lemmatization through Patterns
【24h】

The Power of Language Music: Arabic Lemmatization through Patterns

机译:语言音乐的力量:通过模式的阿拉伯语lemmatization

获取原文

摘要

The interaction between roots and patterns in Arabic has intrigued lexicographers and morphol-ogists for centuries. While roots provide the consonantal building blocks, patterns provide the syllabic vocalic moulds. While roots provide abstract semantic classes, patterns realize these classes in specific instances. In this way both roots and patterns are indispensable for understanding the derivational, morphological and, to some extent, the cognitive aspects of the Arabic language. In this paper we perform lemmatization (a high-level lexical processing) without relying on a lookup dictionary. We use a hybrid approach that consists of a machine learning classifier to predict the lemma pattern for a given stem, and mapping rules to convert stems to their respective lemmas with the vocalization defined by the pattern.
机译:以阿拉伯语的根和模式之间的互动具有兴趣的词典和Mentphol-Ogists。虽然根系提供了Consonantal构建块,但图案提供了音节声学模具。虽然根系提供抽象语义类,但图案在特定情况下实现了这些类。以这种方式,根系和模式都是不可或缺的,用于了解阿拉伯语的认知方面的衍生性,形态学和一定程度。在本文中,我们执行lemmatization(高级词汇处理)而不依赖于查找字典。我们使用一种混合方法,该方法包括机器学习分类器,以预测给定杆的引物模式,以及用由图案定义的发声,将茎转换为它们各自的lemmas的映射规则。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号