首页> 外文会议>European Starting AI Researcher Symposium >Use of Discourse and Syntactic Features for Gender Identification
【24h】

Use of Discourse and Syntactic Features for Gender Identification

机译:使用话语和句法特征进行性别识别

获取原文

摘要

Author profiling and Gender Identification have gained relevance in the last few years. The goal of the research in these fields is to extract certain demographic information on the authors of texts by analyzing their writing at several levels. In our work, we address the problem of the identification of the gender (male vs. female) of the authors of opinion pieces published online. Unlike the overwhelming majority of the proposals, we argue that the use of deeper linguistic features (i.e., syntactic and discourse structure), instead of mainly lexical features leads to a higher accuracy of gender identification. Using such features with supervised machine learning, we achieve very competitive results with accuracies over 84%.
机译:作者的分析和性别识别在过去几年中取得了相关性。 这些领域的研究的目标是通过在几个层面分析他们的写作来提取关于文本作者的某些人口统计信息。 在我们的工作中,我们解决了在线发布的意见作品的作者的身份证明的问题。 与绝大多数提案不同,我们认为使用更深层种语言特征(即句法和话语结构),而不是主要词汇功能导致性别识别的更高准确性。 使用这些功能具有监督机器学习,我们可以通过84%的精度实现非常竞争力的结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号