首页> 外文会议>European conference on IR research >Authorship Verification in the Absence of Explicit Features and Thresholds
【24h】

Authorship Verification in the Absence of Explicit Features and Thresholds

机译:缺少显式功能和阈值时的作者身份验证

获取原文

摘要

Enhancing information retrieval systems with the ability to take the writing style of people into account opens the door for a number of applications. For example, one can link articles by authorships that can help identifying authors who generate hoaxes and deliberate misinformation in news stories, distributed across different platforms. Authorship verification (AV) is a technique that can be used for this purpose. AV deals with the task to judge, whether two or more documents stem from the same author. The majority of existing AV approaches relies on machine learning concepts based on explicitly defined stylistic features and complex models that involve a fair amount of parameters. Moreover, many existing AV methods are based on explicit thresholds (needed to accept or reject a stated authorship), which are determined on training corpora. We propose a novel parameter-free AV approach, which derives its thresholds for each verification case individually and enables AV in the absence of explicit features and training corpora. In an experimental setup based on eight evaluation corpora (each one from another language) we show that our approach yields competitive results against the current state of the art and other noteworthy AV baselines.
机译:增强信息检索系统的能力考虑到考虑人的写作风格,为许多应用程序打开了门。例如,人们可以通过作者链接文章,这些作者可以帮助识别在不同平台上分布在新闻故事中产生恶作剧和故意错误信息的作者。作者验证(AV)是一种可用于此目的的技术。 AV处理任务来判断,是否有两个或更多文件从同一作者源。基于明确定义的风格特征和复杂模型的大多数现有的AV方法依赖于机器学习概念和涉及相当数量的参数的复杂模型。此外,许多现有的AV方法基于明确的阈值(需要接受或拒绝所说的作者),该阈值在培训语料库上确定。我们提出了一种新的无参数AV方法,其单独为每个验证案例获得其阈值,并在没有明确的功能和培训语料库的情况下启用AV。在基于八个评估基层(来自另一种语言的每一个)的实验设置中,我们表明我们的方法对现有技术和其他值得注意的AV基线产生竞争力。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号