首页> 外文会议>International Conference on Software, Knowledge Information Management and Applications >Multilingual author profiling using word embedding averages and SVMs
【24h】

Multilingual author profiling using word embedding averages and SVMs

机译:多语种作者分析使用Word嵌入平均值和SVMS

获取原文

摘要

This paper describes an experiment done to investigate author profiling of tweets in English and Spanish, particularly for cross genre evaluation. Profiling consists of age and gender classification. The training sets were taken from tweets while genres for evaluation come from blogs, hotel reviews, other tweets collected in a different time, as well as other social media. Comparisons were done between tfidf as a baseline and average of word vectors, using a Support Vector Machine algorithm. Results show that using average of word vectors outperforms tfidf in most cross genre problems for age and gender.
机译:本文介绍了一个实验,用于调查英语和西班牙语的发布作者分析,特别是对于交叉类型评估。分析包括年龄和性别分类。培训套装来自推文,而评估的流派来自博客,酒店评论,其他推文在不同的时间,以及其他社交媒体。使用支持向量机算法,TFIDF作为字线的基线和平均值在TFIDF之间进行比较。结果表明,使用单词矢量平均值优于TFIDF在大多数交叉类型的年龄和性别问题中。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号