首页> 外文会议>International conference on Asian language processing >Valence-arousal ratings prediction of Chinese words using similarity measures based on Word2Vec
【24h】

Valence-arousal ratings prediction of Chinese words using similarity measures based on Word2Vec

机译:基于Word2VEC的相似措施使用相似度量的中文单词的价值 - 唤醒评级预测

获取原文

摘要

In this paper the systems submitted by the joint team of Dublin City University and National Taiwan University to the IALP 2016 Shared Task: Dimensional Sentiment Analysis for Chinese Words are presented. The systems learn the vector representation using Word2Vec algorithm for each Chinese word for sentiment analysis. The corpus used for the calculation of vector representation is 5 years (2006 to 2010) of the LDC Chinese Gigaword Fifth Edition corpus. The systems calculated similarities between a test Chinese word and each word in training corpus of the shared task with human annotation and took the valence-arousal ratings of the most similar words as the ratings of the test word. The performance of the submitted systems are around the same level of the shared task's baseline system. We will be looking at the performance gap with top-ranked systems in several aspects including corpus used for training and methodology.
机译:本文本文由都柏林城市大学联合团队提交的系统和国立台湾大学的IALP 2016年分享任务:提出了中国单词的尺寸情绪分析。系统使用Word2Vec算法来学习矢量表示,为每个汉字进行情感分析。用于计算载体表示的语料库是5年(2006年至2010年)的最不发达国家的第五版第五版语料库。该系统在具有人类注释的共享任务培训语料库中计算了测试中文词和每个单词之间的相似之处,并将最令人讨厌的评级作为测试词的评级作为测试词的评级。提交的系统的性能差别在共享任务的基线系统的相同级别。我们将在包括用于培训和方法的语料库中的若干方面来看看具有排名级的系统的性能差距。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号