首页> 外文会议>ANLP 2011 >Domain Independent Authorship Attribution without Domain Adaptation
【24h】

Domain Independent Authorship Attribution without Domain Adaptation

机译:域独立作者归属没有域适应

获取原文

摘要

Automatic authorship attribution, by its nature, is much more advantageous if it is domain (i. e., topic and/or genre) independent. That is, many real world problems that require authorship attribution may not have in-domain training data readily available. However, most previous work based on machine learning techniques focused only on in-domain text for authorship attribution. In this paper, we present comprehensive evaluation of various stylometric techniques for cross-domain authorship attribution. From the experiments based on the Project Gutenberg book archive, we discover that extremely simple techniques básed on stopwords are surprisingly robust against domain change, essentially ridding the need for domain adaptation when supplied with a large amount of data.
机译:自动作者归属,其性质是更有利的,如果是域名(即,主题和/或流派)独立。也就是说,许多要求Autheration attution的真实世界问题可能没有域名培训数据随时可用。但是,基于机器学习技术的最先前的工作仅集中在域名文本中,以获得Autheration归属。本文综合评价跨域作者归因的各种仪表技术。从基于项目的实验到古丁格书档案馆,我们发现在域内变化的稳定性上令人惊讶地对域变化的极其简单的技术,基本上在提供了大量数据时提供了对域适应的需求。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号