首页> 外文会议>Workshop on language in social media >Processing Informal, Romanized Pakistani Text Messages

【24h】

Processing Informal, Romanized Pakistani Text Messages

机译：处理非正式，罗马化的巴基斯坦短信

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Regardless of language, the standard character set for text messages (SMS) and many other social media platforms is the Roman alphabet. There are romanization conventions for some character sets, but they are used inconsistently in informal text, such as SMS. In this work, we convert informal, romanized Urdu messages into the native Arabic script and normalize non-standard SMS language. Doing so prepares the messages for existing downstream processing tools, such as machine translation, which are typically trained on well-formed, native script text. Our model combines information at the word and character levels, allowing it to handle out-of-vocabulary items. Compared with a baseline deterministic approach, our system reduces both word and character error rate by over 50%.

机译：无论语言如何，为短信（SMS）和许多其他社交媒体平台设置的标准字符是罗马字母表。有些字符集有罗马化约定，但它们在非正式文本中不一致地使用，例如短信。在这项工作中，我们将非正式的罗马核心邮件转换为原生阿拉伯语脚本并使非标准短信语言正常化。这样做准备了现有下游处理工具的消息，例如机器翻译，这些工具通常在良好的本机脚本文本上培训。我们的模型将信息与字符级别相结合，允许它处理词汇外项目。与基线确定性方法相比，我们的系统将单词和字符错误率降低超过50％。

著录项

来源
《Workshop on language in social media 》|2012年||共4页
会议地点
作者
Ann Irvine; Jonathan Weese; Chris Callison-Burch;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类程序设计、软件工程 ;
关键词

相似文献

外文文献
中文文献
专利

1. Texting Islam: Text messages and religiosity among young Pakistanis [J] . Paul Rolliera* Contemporary South Asia . 2010 ,第4期

机译：发伊斯兰教短信：巴基斯坦年轻人中的短信和宗教信仰
2. Personalized Text Messages and Automated Calls for Improving Vaccine Coverage Among Children in Pakistan: Protocol for a Community-Based Cluster Randomized Clinical Trial [J] . Abdul Momin Kazi, Nazia Ahsan, Ayub Khan, JMIR Research Protocols . 2019 ,第5期

机译：个性化短信和自动通话以提高巴基斯坦儿童的疫苗覆盖率：基于社区的集群随机临床试验方案
3. Effect of Mobile Phone Text Message Reminders on Routine Immunization Uptake in Pakistan: Randomized Controlled Trial [J] . Abdul Momin Kazi, Khurram Zubair, Hussain Kalimuddin, JMIR public health and surveillance. . 2018 ,第1期

机译：巴基斯坦手机短信提醒对常规免疫摄入的影响：随机对照试验。
4. Processing Informal, Romanized Pakistani Text Messages [C] . Ann Irvine, Jonathan Weese, Chris Callison-Burch orkshop on Language in Social Media . 2013

机译：处理非正式，罗马化的巴基斯坦短信
5. I Text Therefore I Am: Message Interactivity vs. Message Exchange in Addictive Use of Instant Messaging [D] . Wu, Mu. 2016

机译：因此，我是：我是：消息交互性与消息交换在上瘾使用即时消息中
6. Monitoring polio supplementary immunization activities using an automated short text messaging system in Karachi Pakistan [O] . AM Kazi, A Murtaza, S Khoja, 2014

机译：在巴基斯坦卡拉奇使用自动短信息系统监控小儿麻痹症补充免疫活动
7. Evaluating the Effectiveness of Text Messaging and Phone Call Reminders to Minimize No Show at Pediatric Outpatient Clinics in Pakistan: Protocol for a Mixed-Methods Study [O] . Sana Saeed, Noureen Somani, Fatima Sharif, 2018

机译：评估文本消息传递和电话提醒的有效性，以最小化巴基斯坦小儿外科诊所的显示：用于混合方法研究的协议

Processing Informal, Romanized Pakistani Text Messages

摘要

著录项

相似文献

相关主题

期刊订阅