首页> 外文会议>International Conference on Social Informatics >Multimodal Analysis and Prediction of Latent User Dimensions
【24h】

Multimodal Analysis and Prediction of Latent User Dimensions

机译:多峰分析与潜在用户尺寸的预测

获取原文

摘要

Humans upload over 1.8 billion digital images to the internet each day, yet the relationship between the images that a person shares with others and his/her psychological characteristics remains poorly understood. In the current research, we analyze the relationship between images, captions, and the latent demographic/psychological dimensions of personality and gender. We consider a wide range of automatically extracted visual and textual features of images/captions that are shared by a large sample of individuals (N ≈ 1,350). Using correlational methods, we identify several visual and textual properties that show strong relationships with individual differences between participants. Additionally, we explore the task of predicting user attributes using a multimodal approach that simultaneously leverages images and their captions. Results from these experiments suggest that images alone have significant predictive power and, additionally, multimodal methods outperform both visual features and textual features in isolation when attempting to predict individual differences.
机译:人类每天向互联网上传超过18亿美元的数字图像,但图像与他人和他/她的心理特征的人士之间的关系仍然很清楚。在目前的研究中,我们分析了人格和性别的图像,标题和潜在人口统计/心理维度之间的关系。我们考虑广泛的自动提取的图像/标题的视觉和文本特征,这些标题是由大型个体样本共享的(n≈1,350)。使用相关方法,我们确定了几种视觉和文本属性,这些属性显示出与参与者之间的个人差异的强烈关系。另外,我们探索使用同时利用图像及其字幕的多模式方法来预测用户属性的任务。来自这些实验的结果表明,单独的图像具有显着的预测力,并且另外,多式联运方法在尝试预测各个差异时,多式联运方法在隔离时占据视觉特征和文本特征。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号