Similarity Ranking as an Attribute for Machine Learning Approach to Authorship Identification

机译：相似性排名作为作者身份识别的机器学习方法的一种属性

获取原文

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

In the authorship identification task, examples of short writings of N authors and an anonymous document written by one of these JV authors are given. The task is to determine the authorship of the anonymous text. Practically all approaches solved this problem with machine learning methods. The input attributes for the machine learning process are usually formed by stylistic or grammatical properties of individual documents or a defined similarity between a document and an author. In this paper, we present the results of an experiment to extend the machine learning attributes by ranking the similarity between a document and an author: we transform the similarity between an unknown document and one of the JV authors to the order in which the author is the most similar to the document in the set of JV authors. The comparison of similarity probability and similarity ranking was made using the Support Vector Machines algorithm. The results show that machine learning methods perform slightly better with attributes based on the ranking of similarity than with previously used similarity between an author and a document.

机译：在作者身份识别任务中，给出了N位作者的简短著述和这些合资企业作者之一撰写的匿名文档的示例。任务是确定匿名文本的作者身份。实际上，所有方法都使用机器学习方法解决了这个问题。机器学习过程的输入属性通常由单个文档的样式或语法属性或文档与作者之间定义的相似性形成。在本文中，我们通过对文档和作者之间的相似性进行排名来展示扩展机器学习属性的实验结果：我们将未知文档和一位合资企业作者之间的相似性转换为作者是与合资企业中的文档最相似。使用支持向量机算法对相似度概率和相似度等级进行比较。结果表明，与基于作者和文档的先前使用的相似性相比，基于相似性排序的属性的机器学习方法的性能稍好。

著录项

来源
《International conference on language resources and evaluation》|2012年|726-729|共4页
会议地点
作者
Jan Rygl; Ales Horak;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
authorship identification; machine learning; similarity ranking;

机译：作者身份证明;机器学习相似度排名;

相似文献

外文文献
中文文献
专利

1. Exploiting feature representations through similarity learning, post-ranking and ranking aggregation for person re-identification [J] . Jacques Julio C. S. Junior, Baro Xavier, Escalera Sergio Image and Vision Computing . 2018,第NOVa期

机译：通过相似性学习，后排名和排名聚合来利用特征表示，以进行人员重新识别
2. A machine learning approach for the identification of the deceptive reviews in the hospitality sector using unique attributes and sentiment orientation [J] . Martinez-Torres M. R., Toral S. L. Tourism management . 2019,第DECa期

机译：一种机器学习方法，使用独特的属性和情感取向来识别酒店业中的欺骗性评论
3. A machine learning approach for the identification of the deceptive reviews in the hospitality sector using unique attributes and sentiment orientation [J] . Martinez-Torres M. R., Toral S. L. Tourism management . 2019,第Deca期

机译：一种机器学习方法，使用独特的属性和情感取向来识别酒店业中的欺骗性评论
4. Similarity Ranking as an Attribute for Machine Learning Approach to Authorship Identification [C] . Jan Ryg1, Ale? Horák LREC-2012 . 2012

机译：相似性排名为机器学习方法的属性识别
5. A Natural Language Processing and Machine-Learning Based Approach to Authorship Attribution of Tweets [D] . Day, Siobahn Caroline. 2018

机译：基于自然语言处理和机器学习的推文作者身份归属方法
6. A machine learning approach for ranking clusters of docked protein‐protein complexes by pairwise cluster comparison [O] . Erik Pfeiffenberger, Raphael A.G. Chaleil, Iain H. Moal, -1

机译：通过成对聚类比较对停靠的蛋白质-蛋白质复合物的聚类进行排序的机器学习方法
7. Exploiting feature representations through similarity learning, post-ranking and ranking aggregation for person re-identification [O] . Julio C.S. Jacques, Xavier Baró, Sergio Escalera 2018

机译：通过相似性学习，排名和排名聚集的利用特征表示，用于人重新识别

Similarity Ranking as an Attribute for Machine Learning Approach to Authorship Identification

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅