首页> 外文会议>orkshop on Language in Social Media >Analyzing Urdu Social Media for Sentiments using Transfer Learning with Controlled Translations
【24h】

Analyzing Urdu Social Media for Sentiments using Transfer Learning with Controlled Translations

机译:用转移学习与受控翻译的情绪分析乌尔都语社交媒体

获取原文

摘要

The main aim of this work is to perform sentiment analysis on Urdu blog data. We use the method of structural correspondence learning(SCL) to transfer sentiment analysis learning from Urdu newswire data to Urdu blog data. The pivots needed to transfer learning from newswire domain to blog domain is not trivial as Urdu blog data, unlike newswire data is written in Latin script and exhibits code-mixing and code-switching behavior. We consider two oracles to generate the pivots. 1. Transliteration oracle, to accommodate script variation and spelling variation and 2. Translation oracle, to accommodate code-switching and code-mixing behavior. In order to identify strong candidates for translation, we propose a novel part-of-speech tagging method that helps select words based on POS categories that strongly reflect code-mixing behavior. We validate our approach against a supervised learning method and show that the performance of our proposed approach is comparable.
机译:这项工作的主要目的是对URDU博客数据进行情感分析。我们使用结构对应学习(SCL)的方法将情绪分析从URDU NewsWire数据传输到Urdu博客数据。将从NewsWire域转移到博客域所需的竞争是王子博客数据的琐碎,与NewSWIRE数据不同以拉丁文脚本编写,并展示代码混合和代码切换行为。我们考虑两个令人讨厌的令人讨厌。 1.翻译Oracle,适应脚本变化和拼写变化和2.翻译Oracle,以适应代码切换和代码混合行为。为了识别强大的翻译候选人,我们提出了一种新的语音标记方法,帮助基于强烈反映码混合行为的POS类别选择单词。我们验证了我们对受监督的学习方法的方法,并表明我们所提出的方法的表现是可比的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号