首页> 外文会议>International Conference on Software, Knowledge Information Management and Applications >Multilingual author profiling using word embedding averages and SVMs
【24h】

Multilingual author profiling using word embedding averages and SVMs

机译:使用单词嵌入平均值和SVM进行多语言作者分析

获取原文

摘要

This paper describes an experiment done to investigate author profiling of tweets in English and Spanish, particularly for cross genre evaluation. Profiling consists of age and gender classification. The training sets were taken from tweets while genres for evaluation come from blogs, hotel reviews, other tweets collected in a different time, as well as other social media. Comparisons were done between tfidf as a baseline and average of word vectors, using a Support Vector Machine algorithm. Results show that using average of word vectors outperforms tfidf in most cross genre problems for age and gender.
机译:本文介绍了一项实验,旨在调查作者在英语和西班牙语中发布推文的情况,特别是跨风格评估。分析由年龄和性别分类组成。培训集来自推文,而评估类型则来自博客,酒店评论,在不同时间收集的其他推文以及其他社交媒体。使用支持向量机算法,在tfidf作为基线与单词向量的平均值之间进行了比较。结果表明,在大多数年龄和性别的跨类型问题中,使用单词向量的平均值要胜过tfidf。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号