首页> 外文会议>Workshop on NLP for similar languages, varieties and dialects >Processing Dialectal Arabic: Exploiting Variability and Similarity to Overcome Challenges and Discover Opportunities (invited talk)
【24h】

Processing Dialectal Arabic: Exploiting Variability and Similarity to Overcome Challenges and Discover Opportunities (invited talk)

机译:处理方言阿拉伯语:利用可变性和相似性以克服挑战,发现机会(邀请谈话)

获取原文

摘要

We recently witnessed an exponential growth in dialectal Arabic usage in both textual data and speech recordings especially in social media. Processing such media is of great utility for alt kinds of applications ranging from information extraction to social media analytics for political and commercial purposes to building decision support systems. Compared to other languages, Arabic, especially the informal variety, poses a significant challenge to natural language processing algorithms since it comprises multiple dialects, linguistic code switching, and a lack of standardized orthographies, to top its relatively complex morphology. Inherently, the problem of processing Arabic in the context of social media is the problem of how to handle resource poor languages. In this talk I will go over some of our insights to some of these problems and show how there is a silver lining where we can generalize some of our solutions to other low resource language contexts.
机译:我们最近在文本数据和语音记录中目睹了辩证阿拉伯语使用中的指数增长,特别是在社交媒体上。处理此类媒体对ALT种类的应用范围是从信息提取到社交媒体分析的应用,以获得政治和商业目的来构建决策支持系统。与其他语言相比,阿拉伯语,尤其是非正式品种,对自然语言处理算法构成了重大挑战,因为它包括多种方言,语言代码切换和缺乏标准化的拼字标准,以取代其相对复杂的形态。本质上,在社交媒体背景下加工阿拉伯语的问题是如何处理资源差的语言的问题。在这个谈话中,我将介绍一些关于这些问题的一些洞察力,并展示了如何在我们可以将一些解决方案概括到其他低资源语言上下文中的银色衬里。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号