首页> 外文会议>Workshop on NLP for similar languages, varieties and dialects >Processing Dialectal Arabic: Exploiting Variability and Similarity to Overcome Challenges and Discover Opportunities (invited talk)
【24h】

Processing Dialectal Arabic: Exploiting Variability and Similarity to Overcome Challenges and Discover Opportunities (invited talk)

机译:处理方言阿拉伯语:利用可变性和相似性克服挑战并发现机会(特邀演讲)

获取原文

摘要

We recently witnessed an exponential growth in dialectal Arabic usage in both textual data and speech recordings especially in social media. Processing such media is of great utility for alt kinds of applications ranging from information extraction to social media analytics for political and commercial purposes to building decision support systems. Compared to other languages, Arabic, especially the informal variety, poses a significant challenge to natural language processing algorithms since it comprises multiple dialects, linguistic code switching, and a lack of standardized orthographies, to top its relatively complex morphology. Inherently, the problem of processing Arabic in the context of social media is the problem of how to handle resource poor languages. In this talk I will go over some of our insights to some of these problems and show how there is a silver lining where we can generalize some of our solutions to other low resource language contexts.
机译:我们最近目睹了文本数据和语音记录中方言阿拉伯语用法的指数增长,特别是在社交媒体中。处理此类媒体对于从信息提取到社交媒体分析(出于政治和商业目的)到构建决策支持系统的各种应用具有巨大的实用性。与其他语言相比,阿拉伯语(尤其是非正式语言)对自然语言处理算法构成了重大挑战,因为阿拉伯语包含多种方言,语言代码转换,而且缺乏标准化的拼字法,因此其形态相对复杂。从本质上讲,在社交媒体环境中处理阿拉伯语的问题是如何处理资源贫乏的语言的问题。在本次演讲中,我将探讨我们对其中一些问题的见解,并展示如何一线希望可以将我们的一些解决方案推广到其他低资源语言环境。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号