Author Clustering Using SPATIUM

机译：使用SPATIUM进行作者聚类

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper presents the author clustering problem and compares it to related authorship attribution questions. The proposed model is based on a distance measure called Spatium derived from the Canberra measure (weighted version of L norm). The selected features consist of the 200 most frequent words and punctuation symbols. An evaluation methodology is presented and the test collections are extracted from the PAN CLEF 2016 evaluation campaign. In addition to those, we also consider two additional corpora reflecting the literature domain more closely. Based on four different languages, the evaluation measures demonstrate a high precision and F1 for all 20 test collections. A more detailed analysis provides reasons explaining some of the failures of the Spatium model.

机译：本文提出了作者聚类问题，并将其与相关的作者身份归属问题进行了比较。提出的模型基于堪培拉测度（L模的加权版本）衍生的距离测度Spatium。所选功能包括200个最常用的单词和标点符号。介绍了一种评估方法，并从PAN CLEF 2016评估活动中提取了测试集。除此之外，我们还考虑了另外两个语料库，它们更紧密地反映了文学领域。基于四种不同的语言，评估方法显示了所有20个测试集合的高精度和F1。更详细的分析提供了解释Spatium模型失败的原因。

著录项

来源
《2017 ACM/IEEE Joint Conference on Digital Libraries》|2017年|1-4|共4页
会议地点 Toronto(CA)
作者
Mirco Kocher; Jacques Savoy;
展开▼
作者单位

Comput. Sci. Dept., Univ. of Neuchatel, Neuchatel, Switzerland;

Comput. Sci. Dept., Univ. of Neuchatel, Neuchatel, Switzerland;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类
关键词
Computer science; Computational modeling; Weight measurement; Clustering algorithms; Principal component analysis; Standards; Feature extraction;

机译：计算机科学;计算建模;重量测量;聚类算法;主成分分析;标准;特征提取;

相似文献

外文文献
中文文献
专利

1. Flags of Convenience: A Mari Usque Ad Spatium?: The Case of Sea Launch [J] . Charles Brichet Zeitschrift fuer Luft- und Weltraumrecht(ZLW) . 2021,第2期

机译：便利旗帜：Mari USQUEAY AD Spatium？：海的案例
2. Reimplanting large section of free infected tibia which revascularised in spatium intermusculare to repair bone defect: A case report [J] . Zhou Mingwu, Zhang Xun, Li Yang, Injury . 2015,第11期

机译：大块游离游离胫骨再植入小肌间血管重建修复骨缺损的一例报告
3. Hematopoietic Stem and Progenitor Cells Can Be Enriched by Implanting Biomaterial into Spatium Intermusculare [J] . Jia-Bei Tong, Xiao-Yun Wu, Ge-Liuchang Jia, BioMed research international . 2015,第4期

机译：造血干细胞和祖细胞可以通过将生物材料植入斯坦率分钟来富集
4. Author Clustering Using SPATIUM [C] . Mirco Kocher, Jacques Savoy ACM/IEEE Joint Conference on Digital Libraries . 2017

机译：作者聚类使用spatium
5. Spanning boundaries: An interdisciplinary citation study based on literary studies author co-citation clusters. [D] . Greenberg, Hinda Feige. 1999

机译：跨越界限：基于文学研究作者共引文集群的跨学科引文研究。
6. Hominem Sine Opus Spatium: Where Do the Ideas Come from to Move the Brain Mind Behaviour and Neurosciences in Malaysia? [O] . Jafri Malin Abdullah 2018

机译：Hominem Sine Opus Spatium：这些想法从何而来从而在马来西亚发展大脑思维行为和神经科学？
7. "Exigua pars est vitae qua vivimus. Ceterum quidem omne spatium non vita sed tempus est” : divagazioni semantiche (e lessicali) su spatium e sui suoi esiti romanzi [O] . Blumenthal Peter, Espinosa Elorza Rosa María, Marello Carla, 2013

机译：“我们生活的一部分。此外，距离不是生命，而是时间。”：Divagazioni语义学（elessicali）与其到suoi esiti romanzi的距离

Author Clustering Using SPATIUM

摘要

著录项

相似文献

相关主题

期刊订阅