首页> 外文会议>43rd Annual Meeting of the Association for Computational Linguistics: Proceeding of the Conference >Memory-based morphological analysis generation andpart-of-speech tagging of Arabic
【24h】

Memory-based morphological analysis generation andpart-of-speech tagging of Arabic

机译:基于内存的阿拉伯语形态分析生成和词性标注

获取原文

摘要

We explore the application of memorybasedlearning to morphological analysisand part-of-speech tagging of writtenArabic, based on data from the ArabicTreebank. Morphological analysis – theconstruction of all possible analyses ofisolated unvoweled wordforms – is performedas a letter-by-letter operation predictiontask, where the operation encodessegmentation, part-of-speech, characterchanges, and vocalization. Part-of-speechtagging is carried out by a bi-modular taggerthat has a subtagger for known wordsand one for unknown words. We report onthe performance of the morphological analyzerand part-of-speech tagger. We observethat the tagger, which has an accuracyof 91.9% on new data, can be used toselect the appropriate morphological analysisof words in context at a precision of64.0 and a recall of 89.7.
机译:我们探索基于内存的应用 学习形态分析 和词性标记的书面 阿拉伯语,基于阿拉伯语的数据 树库。形态分析– 构造所有可能的分析 孤立的非元音字形-执行 作为逐个字母的操作预测 任务,操作进行编码 分割,词性,字符 变化和发声。词性 标记由双模块标记器执行 对已知字词有一个小写字母 一个代表未知词。我们报告 形态分析仪的性能 和词性标记器。我们观察 标记器,它具有准确性 占新数据的91.9%,可用于 选择合适的形态分析 上下文中的单词的精度为 64.0和89.7的召回率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号