Pairwise Comparative Classification for Translator Stylometric Analysis

HEBA EL-FIQI; ELENI PETRAKI; HUSSEIN A. ABBASS

首页> 外文期刊>ACM transactions on Asian language information processing >Pairwise Comparative Classification for Translator Stylometric Analysis

【24h】

Pairwise Comparative Classification for Translator Stylometric Analysis

机译：笔势分析的成对比较分类

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this article, we present a new type of classification problem, which we call Comparative Classification Problem (CCP), where we use the term data record to refer to a block of instances. Given a single data record with n instances for n classes, the CCP problem is to map each instance to a unique class. This problem occurs in a wide range of applications where the independent and identically distributed assumption is broken down. The primary difference between CCP and classical classification is that in the latter, the assignment of a translator to one record is independent of the assignment of a translator to a different record. In CCP, however, the assignment of a translator to one record within a block excludes this translator from further assignments to any other record in that block. The interdependency in the data poses challenges for techniques relying on the independent and identically distributed (iid) assumption.In the Pairwise CCP (PWCCP), a pair of records is grouped together. The key difference between PWCCP and classical binary classification problems is that hidden patterns can only be unmasked by comparing the instances as pairs. In this article, we introduce a new algorithm, PWC4.5, which is based on C4.5, to manage PWCCP. We first show that a simple transformation-that we call Gradient-Based Transformation (GBT)— can fix the problem of iid in C4.5. We then evaluate PWC4.5 using two real-world corpora to distinguish between translators on Arabic-English and French-English translations. While the traditional C4.5 failed to distinguish between different translators, GBT demonstrated better performance. Meanwhile, PWC4.5 consistently provided the best results over C4.5 and GBT.

机译：在本文中，我们提出了一种新的分类问题，称为比较分类问题（CCP），在这里我们使用术语数据记录来引用实例块。给定具有n个类的n个实例的单个数据记录，CCP问题是将每个实例映射到唯一的类。在分解独立且分布均匀的假设的广泛应用中，会出现此问题。 CCP与经典分类之间的主要区别在于，在后者中，译者对一个记录的分配独立于译者对不同记录的分配。但是，在CCP中，将翻译器分配给一个块中的一个记录会使该翻译器无法进一步分配给该块中的任何其他记录。数据中的相互依赖性对依赖独立且均匀分布（iid）假设的技术提出了挑战。 r n在成对CCP（PWCCP）中，一对记录被分组在一起。 PWCCP与经典二进制分类问题之间的主要区别在于，只有通过将实例成对比较才能隐藏隐藏模式。在本文中，我们介绍了一种基于C4.5的新算法PWC4.5，用于管理PWCCP。我们首先显示一个简单的转换-我们称为基于渐变的转换（GBT）-可以解决C4.5中的iid问题。然后，我们使用两个真实的语料库评估PWC4.5，以区分阿拉伯语-英语和法语-英语翻译的翻译者。传统的C4.5无法区分不同的翻译器，而GBT表现出更好的性能。同时，PWC4.5始终提供优于C4.5和GBT的最佳结果。

著录项

来源
《ACM transactions on Asian language information processing》 |2017年第1期|2.1-2.26|共26页
作者
HEBA EL-FIQI; ELENI PETRAKI; HUSSEIN A. ABBASS;
展开▼
作者单位

University of New South Wales, Australia;

University of Canberra, Australia;

University of New South Wales, Australia;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Arabic translation; classification; translator stylometry;

机译：阿拉伯语翻译;分类;笔译仪;
入库时间 2022-08-18 04:03:42

相似文献

外文文献
中文文献
专利

1. Different Bilingual Experiences Might Modulate Executive Tasks Advantages: Comparative Analysis between Monolinguals, Translators, and Interpreters [J] . S??bastien Henrard, Agn?¨s Van Daele Frontiers in Psychology . 2017,第1期

机译：不同的双语经验可能会调节执行任务的优势：双语者，翻译者和口译者之间的比较分析
2. Two Nineteenth-Century English Translators of Chamisso's Peter Schlemihl: Sir John Bowring and Emilie de Rouillon. A Comparative Analysis [J] . Michael Haldane - Michael Haldane is affiliated with the University of Essex UK. English Studies . 2008,第6期

机译：Chamisso的Peter Schlemihl的两位19世纪英语翻译：John Bowring爵士和Emilie de Rouillon爵士。比较分析
3. Significance analysis for pairwise variable selection in classification [J] . XINGYE QIAO, YUFENG LIU, J. S. MARRON Statistics and Its Interface . 2014,第2期

机译：分类中成对变量选择的意义分析
4. Comparative study for Stylometric analysis techniques for authorship attribution [C] . Maryam A. Raafat, Rania Abdel-Fattah El-Wakil, Ayman Atia International Mobile, Intelligent, and Ubiquitous Computing Conference . 2021

机译：作者归因仪表分析技术的比较研究
5. Ideology, subversion and the translator's voice: A comparative analysis of the French and English translations of Guillermo Cabrera Infante's Tres Tristes Tigres. [D] . Modrea, Andreea. 2004

机译：意识形态，颠覆和译者的声音：吉列尔莫·卡布雷拉·因凡特的《 Tres Tristes Tigres》法语和英语翻译的比较分析。
6. Different Bilingual Experiences Might Modulate Executive Tasks Advantages: Comparative Analysis between Monolinguals Translators and Interpreters [O] . Sébastien Henrard, Agnès Van Daele -1

机译：不同的双语经验可能会调节执行任务的优势：双语者翻译者和口译者之间的比较分析
7. The Comparative Analysis of the Operational Unit Peculiarities of the Creative Potential Realization of Literary Texts Translators and Future Translators. [O] . Дячук Н. В. 2013

机译：文学文本译者和未来译者创造性潜能实现的运作单位特点比较分析。
8. Comparative Analysis of RF Emission Based Fingerprinting Techniques for ZigBee Device Classification. [R] . Coon, C. W. 2017

机译：基于射频发射的ZigBee设备分类指纹技术对比分析。

Pairwise Comparative Classification for Translator Stylometric Analysis

摘要

著录项

相似文献

相关主题

期刊订阅