首页> 美国卫生研究院文献>other >Using Shakespeares Sotto Voce to Determine True Identity From Text
【2h】

Using Shakespeares Sotto Voce to Determine True Identity From Text

机译:使用莎士比亚的自下而上的声音来确定文本的真实身份

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Little is known of the private life of William Shakespeare, but he is famous for his collection of plays and poems, even though many of the works attributed to him were published anonymously. Determining the identity of Shakespeare has fascinated scholars for 400 years, and four significant figures in English literary history have been suggested as likely alternatives to Shakespeare for some disputed works: Bacon, de Vere, Stanley, and Marlowe. A myriad of computational and statistical tools and techniques have been used to determine the true authorship of his works. Many of these techniques rely on basic statistical correlations, word counts, collocated word groups, or keyword density, but no one method has been decided on. We suggest that an alternative technique that uses word semantics to draw on personality can provide an accurate profile of a person. To test this claim, we analyse the works of Shakespeare, Christopher Marlowe, and Elizabeth Cary. We use Word Accumulation Curves, Hierarchical Clustering overlays, Principal Component Analysis, and Linear Discriminant Analysis techniques in combination with RPAS, a multi-faceted text analysis approach that draws on a writer's personality, or self to identify subtle characteristics within a person's writing style. Here we find that RPAS can separate the known authored works of Shakespeare from Marlowe and Cary. Further, it separates their contested works, works suspected of being written by others. While few authorship identification techniques identify self from the way a person writes, we demonstrate that these stylistic characteristics are as applicable 400 years ago as they are today and have the potential to be used within cyberspace for law enforcement purposes.
机译:威廉·莎士比亚的私人生活鲜为人知,但他以戏剧和诗歌收藏而闻名,尽管许多归因于他的作品都是匿名出版的。确定莎士比亚的身份已经使学者着迷了400年,对于一些有争议的作品,英国文学史上有四个重要人物可以作为莎士比亚的替代品:培根,德维尔,斯坦利和马洛。无数的计算和统计工具和技术已被用来确定其作品的真实作者身份。这些技术中的许多技术都依赖于基本的统计相关性,词计数,并置的词组或关键字密度,但尚未决定采用哪种方法。我们建议使用词语义来吸引人格的另一种技术可以提供一个人的准确档案。为了检验这一说法,我们分析了莎士比亚,克里斯托弗·马洛和伊丽莎白·卡里的作品。我们将词积累曲线,层次聚类叠加,主成分分析和线性判别分析技术与RPAS结合使用,RPAS是一种基于作者个性或自身的多方面文本分析方法,可以识别人的写作风格中的细微特征。在这里,我们发现RPAS可以将莎士比亚的已知著作与Marlowe和Cary分开。此外,它还将他们有争议的作品(怀疑是他人撰写的作品)分开。尽管很少有作者身份识别技术可以从人们的写作方式中识别自己,但我们证明,这些风格特征在400年前和今天一样适用,并且有可能在网络空间内用于执法目的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号