首页> 外文期刊>Behavior Research Methods >Aralex: A lexical database for Modern Standard Arabic
【24h】

Aralex: A lexical database for Modern Standard Arabic

机译:Aralex:现代标准阿拉伯语的词汇数据库

获取原文
           

摘要

In this article, we present a new lexical database for Modern Standard Arabic: Aralex. Based on a contemporary text corpus of 40 million words, Aralex provides information about (1) the token frequencies of roots and word patterns, (2) the type frequency, or family size, of roots and word patterns, and (3) the frequency of bigrams, trigrams in orthographic forms, roots, and word patterns. Aralex will be a useful tool for studying the cognitive processing of Arabic through the selection of stimuli on the basis of precise frequency counts. Researchers can use it as a source of information on natural language processing, and it may serve an educational purpose by providing basic vocabulary lists. Aralex is distributed under a GNU-like license, allowing people to interrogate it freely online or to download it from www.mrc-cbu.cam.ac.uk:8081/aralex .online/login.jsp.
机译:在本文中,我们为现代标准阿拉伯语:Aralex提供了一个新的词汇数据库。根据一个具有4000万个单词的当代文本语料库,Aralex提供以下信息:(1)词根和单词模式的标记频率;(2)词根和单词模式的类型频率或家族大小;(3)频率双字词,正字法的卦,词根和单词模式。通过在精确的频率计数基础上选择刺激,Aralex将成为研究阿拉伯语认知过程的有用工具。研究人员可以将其用作自然语言处理的信息源,并且可以通过提供基本词汇表来达到教育目的。 Aralex是在类似GNU的许可下分发的,允许人们免费在线查询或从www.mrc-cbu.cam.ac.uk:8081/aralex .online / login.jsp下载。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号