首页> 外文会议>Annual meeting of the Association for Computational Linguistics;ACL 2011 >Unsupervised Decomposition of a Document into Authorial Components
【24h】

Unsupervised Decomposition of a Document into Authorial Components

机译:无监督地将文档分解为授权组件

获取原文

摘要

We propose a novel unsupervised method for separating out distinct authorial components of a document. In particular, we show that, given a book artificially "munged" from two thematically similar biblical books, we can separate out the two constituent books almost perfectly. This allows us to automatically recapitulate many conclusions reached by Bible scholars over centuries of research. One of the key elements of our method is exploitation of differences in synonym choice by different authors.
机译:我们提出了一种新颖的无监督方法来分离出文档的不同组成部分。特别地,我们表明,给定一本书从两本主题相似的圣经书中人为地“扑朔迷离”,我们几乎可以完美地将两本构成书分开。这使我们能够自动总结圣经学者在几个世纪的研究中得出的许多结论。我们方法的关键要素之一是不同作者利用同义词选择上的差异。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号