首页> 外文期刊>Computer Science & Information Technology >Semantic Extraction of Arabic Multiword Expressions
【24h】

Semantic Extraction of Arabic Multiword Expressions

机译:阿拉伯多词表达的语义提取

获取原文
           

摘要

A considerable interest has been given to Multiword Expression (MWEs) identification andtreatment. The identification of MWEs affects the quality of results of different tasks heavilyused in natural language processing (NLP) such as parsing and generation. Differentapproaches for MWEs identification have been applied such as statistical methods whichemployed as an inexpensive and language independent way of finding co-occurrence patterns.Another approach relays on linguistic methods for identification, which employ informationsuch as part of speech (POS) filters and lexical alignment between languages is also used andproduced more targeted candidate lists. This paper presents a framework for extracting ArabicMWEs (nominal or verbal MWEs) for bi-gram using hybrid approach. The proposed approachstarts with applying statistical method and then utilizes linguistic rules in order to enhance theresults by extracting only patterns that match relevant language rule. The proposed hybridapproach outperforms other traditional approaches.
机译:人们对多字表达(MWE)的识别和治疗给予了极大的兴趣。 MWE的标识会影响在自然语言处理(NLP)中大量使用的不同任务(例如解析和生成)的结果质量。已经采用了不同的方法来识别MWE,例如采用统计方法来寻找共现模式的廉价且独立于语言的方法。另一种方法是基于语言学方法进行识别,该方法利用了诸如词性(POS)过滤器和词之间的词法对齐等信息。语言也被使用并产生了更有针对性的候选列表。本文提出了一种使用混合方法提取用于二元语法的阿拉伯语MWE(标称或言语MWE)的框架。所提出的方法从应用统计方法开始,然后利用语言规则以通过仅提取与相关语言规则匹配的模式来增强结果。提出的混合方法优于其他传统方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号