首页> 外文会议>International conference on computational linguistics >Automatic Extraction of Arabic Multiword Expressions
【24h】

Automatic Extraction of Arabic Multiword Expressions

机译:自动提取阿拉伯语多个表达式

获取原文

摘要

In this paper we investigate the automatic acquisition of Arabic Multiword Expressions (MWE). We propose three complementary approaches to extract MWEs from available data resources. The rst approach relies on the correspondence asymmetries between Arabic Wikipedia titles and titles in 21 different languages. The second approach collects English MWEs from Princeton WordNet 3.0, translates the collection into Arabic using Google Translate, and utilizes different search engines to validate the output. The third uses lexical association measures to extract MWEs from a large unannotated corpus. We experimentally explore the feasibility of each approach and measure the quality and coverage of the output against gold standards.
机译:在本文中,我们调查了阿拉伯多语表达式(MWE)的自动获取。我们提出了三种互补方法来从可用数据资源中提取MWE。第一个方法依赖于阿拉伯维基百科标题和21种不同语言的标题与标题的对应不对称。第二种方法从Princeton Wordnet 3.0收集英语MWE,将该集合转换为Arabic使用Google翻译,并利用不同的搜索引擎来验证输出。第三种使用词汇关联措施从大型未解压的语料库中提取MWE。我们通过实验探索各种方法的可行性,并衡量输出对金标准的质量和覆盖范围。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号