首页> 外文会议>International Conference on Modelling, Identification and Control >Towards an authorship analysis of two religious documents
【24h】

Towards an authorship analysis of two religious documents

机译:对两个宗教文献的作者分析

获取原文

摘要

In this paper, we try to make an author identification of two ancient Arabic religious books dating from the 6th century: The holy Quran and the Hadith. The authorship identification process is achieved through four phases which are: documents collection, text preprocessing, features extraction and classification model building. Thus, two series of experiments are undergone and commented. The first experiment deals with authorship identification of the two books using a Manhattan centroid distance and SMO-SVM classifier. Whereas, in the second experiment a Hierarchical Clustering is employed to distinguish the different segments belonging to the two books. For that purpose, three types of original NLP features are combined before the classification process. The results show good authorship identification performances with an accuracy of 100%. In fact, all the results of this investigation correspond to a clear authorship distinction between the two religious books.
机译:在本文中,我们试图确定两位可追溯到6世纪的古代阿拉伯宗教书籍:古兰经和圣训。作者身份识别过程通过四个阶段完成:文档收集,文本预处理,特征提取和分类模型构建。因此,经历和评论了两个系列的实验。第一个实验使用曼哈顿重心距离和SMO-SVM分类器确定这两本书的作者身份。而在第二个实验中,采用层次聚类法来区分属于这两本书的不同部分。为此,在分类过程之前将三种类型的原始NLP功能合并在一起。结果表明,作者身份识别性能良好,准确度为100%。实际上,这项调查的所有结果都与两本宗教书籍之间明显的作者区分相对应。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号