首页> 外文会议>2015 First International Conference on Arabic Computational Linguistics >Tunisian Arabic aeb Wordnet: Current State and Future Extensions
【24h】

Tunisian Arabic aeb Wordnet: Current State and Future Extensions

机译:突尼斯阿拉伯语aeb词网:现状和未来扩展

获取原文
获取原文并翻译 | 示例

摘要

Nowadays, Internet communication and especially informal Internet communication such as social networks, blogs, etc. is directing politic, economic, financial and social environments all over the world. Consequently, Internet monitoring is taking more and more scale particularly in Tunisia suffering from unsteadiness since the politic revolution in 2011. In a Tunisian context, Internet communication is characterized by the increasing use of aeb language (i.e. an Arabic dialect called Tunisian Arabic). Therefore, Tunisian Internet monitoring needs primarily aeb language processing tools, especially an aeb lexicon. However, few aeb lexicon were developed seen the lack of written resources. Some of these lexicons are created from Arabic lexicons. They cover aeb lexicon originally Arabic and ignore the large borrowed aeb lexicon. Others are build using the informal Web. In fact, they need a rigorous linguistic verification, correction and validation. In this case, we suggest building a standard, large and robust Wordnet taking in charge phonetic. Our Wordnet is created by the expand approach used for EuroWordnet building as in [12], based on the bilingual English-Tunisian Arabic dictionary Peace corps dictionary prepared by the linguists: R. Ben abdelkader, A. Ayed and A. Naouar [13], and the last version of Princeton Wordnet PWN 3.1. Moreover, it is modelized according to ISO-LMF by a switable Wordnet-LMF model for aeb language. In this paper, we present aeb wordnet building approach, describe its current state and propose extensions.
机译:如今,Internet通讯,尤其是非正式的Internet通讯,例如社交网络,博客等,正在指导全世界的政治,经济,金融和社交环境。因此,自2011年政治革命以来,Internet监控的规模越来越大,特别是在突尼斯,其不稳定。在突尼斯,互联网通信的特点是越来越多地使用aeb语言(即称为突尼斯阿拉伯语的阿拉伯方言)。因此,突尼斯Internet监视主要需要aeb语言处理工具,尤其是aeb词典。但是,由于缺乏书面资源,很少开发aeb词典。其中一些词典是根据阿拉伯语词典创建的。它们涵盖了最初为阿拉伯语的aeb词典,而忽略了大量借来的aeb词典。其他人则使用非正式的Web构建。实际上,他们需要严格的语言验证,更正和确认。在这种情况下,我们建议构建一个标准,大型且健壮的Wordnet来负责语音。我们的Wordnet是基于[12]中的EuroWordnet构建所使用的扩展方法而创建的,基于英语-突尼斯阿拉伯语双语词典,由以下语言学家编写的Peace corps词典:R. Ben abdelkader,A.Ayed和A.Naouar [13]以及Princeton Wordnet PWN 3.1的最新版本。此外,它是通过针对aeb语言的可转换Wordnet-LMF模型根据ISO-LMF建模的。在本文中,我们介绍了aeb词网构建方法,描述了其当前状态并提出了扩展方案。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号