Using Statistical Properties for Author Identification

机译：使用作者识别的统计属性

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Languages in general are highly redundant which makes text highly compressible. In this paper, English language redundancy is exploited to predict the author of English text. The method developed starts by training the system using texts with known authors. Distinct blocks for texts written by each author are determined. Those blocks are then filtered to produce, for each author, a set of unique blocks that occur in his/her writings but not in other authors' texts. In the normal operation mode, text to be categorized is processed to determine the distinct blocks in that text. A comparison between this set of distinct blocks and the unique set of distinct blocks for each author results in correct author categorization. The method described in this paper was proven to work successfully in text classification and author categorization and has the potential to be a universal method since it was tested on English and Arabic texts.

机译：语言通常是高度冗余，这使文本非常可压缩。在本文中，利用英语语言冗余来预测英语文本的作者。该方法通过使用具有已知作者的文本训练系统开发的方法。确定每个作者编写的文本的独特块。然后将这些块筛选为每个作者生成一组在他/她的着作中发生但不在其他作者的文本中发生的一组唯一块。在正常操作模式中，处理要分类的文本以确定该文本中的不同块。这组不同的块与每个作者的独特不同块集之间的比较导致正确的作者分类。本文描述的方法被证明在文本分类和作者分类中成功工作，并且有可能成为一个普遍的方法，因为它在英语和阿拉伯文文本上进行了测试。

著录项

来源
《World multi-conference on systemics, cybernetics and informatics》|2018年|188 p.|共5页
会议地点
作者
Ziad Osman; Lama Hamandi; Rached Zantout;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类计算技术、计算机技术;
关键词
distinct blocks; categorization; classification; English language properties;

机译：独特的块;分类;分类;英语语言属性;

相似文献

外文文献
中文文献
专利

1. On divergence-based author obfuscation: An attack on the state of the art in statistical authorship verification [J] . Information Technology . 2020,第2期

机译：基于分歧的作者混淆：对统计作者核查中最先进的攻击
2. Statistical relationships between corresponding authorship, international co-authorship and citation impact of national research systems [J] . de Moya-Anegon Felix, Guerrero-Bote Vicente P., Lopez-Illescas Carmen, Journal of informetrics . 2018,第4期

机译：对应作者，国际合著和国家研究系统的引文影响之间的统计关系
3. Research: Adequate statistical power in clinical trials is associated with the combination of a male first author and a female last author [J] . Willem M Otte, Joeri K Tijdink, Paul L Weerheim, eLife journal . 2018,第june期

机译：研究：临床试验中足够的统计能力与男性第一作者和女性最后作者的组合有关
4. Using Statistical Properties for Author Identification [C] . Ziad Osman, Lama Hamandi, Rached Zantout . 2018

机译：使用统计属性进行作者识别
5. Statistical estimation of two-body hydrodynamic properties using system identification. [D] . Xie, Chen. 2009

机译：使用系统识别对两体流体动力学特性进行统计估计。
6. Author Correction: GePMI: a statistical model for personal intestinal microbiome identification [O] . Zicheng Wang, Huazhe Lou, Ying Wang, 2018

机译：作者更正：GePMI：个人肠道微生物组鉴定的统计模型
7. Acreditation Certificate Acreditation No. 21/E/KPT/2018 Article Tools Print this article Indexing metadata How to cite item Email this article Email the author About The Authors Ainun Ramadhani Tri Wahyuni ORCID iD https://orcid.org/0000-0002-4071-3406 Fisheries and Marine Science Faculty, Brawijaya University Indonesia Endang Yuli Herawati Fisheries and Marine Science Faculty, Brawijaya University Indonesia Andi Kurniawan ORCID iD Fisheries and Marine Science Faculty, Brawijaya University Indonesia Abd. Aziz Amin ORCID iD Coastal and Marine Research Center, University of Brawijaya, Indonesia Indonesia About RJLS Aim and Scope Editorial Board Reviewer Acknowledgement Publication Ethics Visitor Statistic Information for Author Author Guidelines (online version) Online Submission Guideline Online Registration Author Fees Download Template User You are logged in as... riris_rjlsub My Profile Log Out Tools Mendeley User Guide Insert Citation using Mendeley Journal Index Visitor Statistic Notifications View (141 new) Manage Journal Content Search Search Scope Browse By Issue By Author By Title Information For Readers For Authors For Librarians Keywords Antioxidant Bali Strait Biogeography CODIS 13 Calamaria DPPH Dyslipidemia Eucheuma cottonii ICP11 Litopenaeus vannamei Macrobrachium rosenbergii Morphology Pandanus Physalis minima RFLP Sardinella lemuru Sperm WSSV birth weight fermentation rats Isolation, and Identification of Diesel Oil Degrading Bacteria in Water Contamination Site and Preliminary analysis with Potential Bacterial Gordonia terrae [O] . Ainun Ramadhani Tri Wahyuni, Endang Yuli Herawati, Andi Kurniawan, 2019

机译：Acreditation证书Acreditation号21 / E / KPT / 2018条工具打印这篇文章索引元数据如何引用文章项目将该文章发送给作者发邮件作者简介艾南·拉马扎尼三Wahyuni ORCID的iD https://orcid.org/0000-0002- 4071-3406渔业和海洋科学学院，Brawijaya大学印尼Endang玉立Herawati渔业和海洋科学学院，Brawijaya大学印度尼西亚安迪Kurniawan ORCID的iD渔业和海洋科学学院，Brawijaya大学印尼阿卜杜勒。阿齐兹阿明ORCID的iD沿海和海洋研究中心，Brawijaya大学，印度尼西亚印度尼西亚关于RJLS目标实现作者作者准则的范围编委会审阅确认出版道德访客统计信息（网络版）在线投稿指南在线注册作者费下载模板用户你是登录为... riris_rjlsub使用Mendeley杂志指数访客统计通知视图（141新）管理期刊内容搜索范围浏览按问题按作者按标题信息供读者对于作者为馆员关键词我的个人资料注销工具Mendeley用户指南插入引文抗氧化剂巴厘海峡生物地理学CODIS 13铁线蛇属DPPH血脂异常麒麟菜cottonii ICP11凡纳滨对虾罗氏沼虾形态露兜小酸浆RFLP黄泽小沙丁鱼精子WSSV出生体重发酵鼠隔离，并在水污染网站和预柴油降解菌的鉴定与潜在的细菌大头terrae liminary分析

Using Statistical Properties for Author Identification

摘要

著录项

相似文献

相关主题

期刊订阅