首页> 外文会议>International Conference on Latest Advances in Machine Learning and Data Science >OTAWE-Optimized Topic-Adaptive Word Expansion for Cross Domain Sentiment Classification on Tweets
【24h】

OTAWE-Optimized Topic-Adaptive Word Expansion for Cross Domain Sentiment Classification on Tweets

机译:OTAWE优化的主题自适应词扩展,用于推文上的跨域情感分类

获取原文

摘要

The enormous growth of Internet usage, number of social interactions, and activities in social networking sites results in users adding their opinions on the products. An automated system, called sentiment classifier, is required to extract the sentiments and opinions from social media data. Classifier that is trained using the labeled tweets of one domain may not efficiently classify the tweets from another domain. This is a basic problem with the tweets as twitter data is very diverse. Therefore, Cross Domain Sentiment Classification is required. In this paper, we propose a semi-supervised domain-adaptive sentiment classifier with Optimized Topic-Adaptive Word Expansion (OTAWE) model on tweets. Initially, the classifier is trained on common sentiment words and mixed labeled tweets from various topics. Then, OTAWE algorithm selects more reliable unlabeled tweets from a particular domain and updates domain-adaptive words in every iteration. OTAWE outperforms existing domain-adaptive algorithms as it saves the feature weights after every iteration. This ensures that moderate sentiment words are not missed out and avoids the inclusion of weak sentiment words.
机译:互联网使用量,社交交互数量和社交网站的活动的巨大增长导致用户在产品上添加他们的意见。需要一种被称为情感分类器的自动系统,以从社交媒体数据中提取情绪和意见。使用标记为一个域的标记推文接受培训的分类器可能无法有效地将推文与另一个域的推文分类。这是推文的基本问题,因为Twitter数据非常多样化。因此,需要跨域情绪分类。在本文中,我们提出了一个半监督域 - 自适应情感分类器,具有关于推文的优化主题自适应词扩展(OTAWE)模型。最初,分类器受到常见情绪词语的培训,并从各种主题混合标记的推文。然后,OTAWE算法从特定域中选择更可靠的未标记推文,并在每次迭代中更新域自适应单词。 OTAWE优于现有的域自适应算法,因为它在每次迭代后保存要素权重。这确保了不容错过的中等情绪词语,避免包含弱情绪词语。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号