首页> 外文会议>Rough sets and current trends in computing >Learning Age and Gender Using Co-occurrence of Non-dictionary Words from Stylistic Variations
【24h】

Learning Age and Gender Using Co-occurrence of Non-dictionary Words from Stylistic Variations

机译:利用非字典词从文体变化中共现来学习年龄和性别

获取原文
获取原文并翻译 | 示例

摘要

This work attempts to report the stylistic differences in blogging for gender and age group variations using slang word co-occurrences. We have mainly focused on co-occurrence of non dictionary words across bloggers of different gender and age groups. For this analysis, we have focused on the feature use of slang words to study the stylistic variations of bloggers across various age groups and gender. We have modeled the co-occurrences of slang words used by bloggers as graph based model where nodes are slang words and edges represent the number of cooccurrences and studied the variations in predicting age groups and gender. We have used demographically tagged blog corpus from ICWSM Spinner dataset for these experiments and used Naive Bayes classifier with 10 fold cross validations. Preliminary results shows that the concurrence of of slang words could be a better choice for predicting age and gender.
机译:这项工作试图报告使用using语单词共现的博客中性别和年龄组变异的风格差异。我们主要关注非性别词在不同性别和年龄组的博客中的同时出现。在此分析中,我们集中于语单词的功能使用,以研究不同年龄段和性别的博客作者的风格变化。我们已将博主使用的lang语单词的共现建模为基于图的模型,其中节点是语单词,边表示共现的次数,并研究了预测年龄组和性别的差异。我们已使用来自ICWSM Spinner数据集的人口统计标记的博客语料库进行了这些实验,并使用了10倍交叉验证的Naive Bayes分类器。初步结果表明,the语单词的并发可能是预测年龄和性别的更好选择。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号