首页> 外文会议>International Florida Aritificial Intelligence Research Society Conference >Authorship Attribution Using Small Sets of Frequent Part-of-Speech Skip-Grams
【24h】

Authorship Attribution Using Small Sets of Frequent Part-of-Speech Skip-Grams

机译:使用小组频繁的言语跳克的作者归属

获取原文

摘要

Computer-supported authorship attribution provides tools for extracting stylistic features that can help verify or identify the author of text documents. In many situations finding the author of a document is very important, such as the detection of plagiarism for protecting copyrights and forensic support during criminal investigations. This paper, thus explores a novel stylistic feature with the aim of accurately characterizing an author's work. In particular, the use of part-of-speech skip-grams and an in-house top-k sequential pattern mining algorithm are considered for the task of authorship attribution. A study using a collection of 30 texts, written by 10 authors, consisting of 2, 615, 856 words and 99, 903 sentences, confirms that mining part-of-speech skip-grams in texts facilitates authorship inference.
机译:计算机支持的Authorive atture提供用于提取文档功能的工具,可以帮助验证或识别文本文档的作者。在许多情况下,发现文件的作者非常重要,例如检测在刑事调查中保护版权和法医支持的抄袭。本文探讨了一个新的风格特征,目的是准确地表征作者的工作。特别地,考虑使用言语部分的跳过和内部Top-K顺序模式挖掘算法,用于作者归因的任务。使用由10名作者编写的30个文本的集合,由2,615,856字和99,903个句子组成的一项研究证实,文本中的挖掘部分挖掘跳克促进了作者推理。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号