Authorship Attribution in Bengali Language

机译：孟加拉语作者身份归属

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We describe Authorship Attribution of Bengali literary text. Our contributions include a new corpus of 3,000 passages written by three Bengali authors, an end-to-end system for authorship classification based on character n-grams, feature selection for authorship attribution, feature ranking and analysis, and learning curve to assess the relationship between amount of training data and test accuracy. We achieve state-of-the-art results on held-out dataset, thus indicating that lexical n-gram features are unarguably the best discriminators for authorship attribution of Bengali literary text.

机译：我们描述孟加拉语文学著作的作者身份。我们的贡献包括由三位孟加拉语作者撰写的3,000个段落的新语料库，基于字符n-gram的端到端的作者身份分类系统，作者属性的特征选择，特征排名和分析以及评估关系的学习曲线在训练数据量和测试准确性之间。我们在保留的数据集上获得了最新的结果，从而表明词汇n-gram特征无疑是孟加拉语文学文本作者身份归因的最佳判别器。

著录项

来源
《International conference on natural language processing》|2015年|96-101|共6页
会议地点
作者
Shanta Phani; Shibamouli Lahiri; Arindam Biswas;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. A Supervised Learning Approach for Authorship Attribution of Bengali Literary Texts [J] . Phani Shanta, Lahiri Shibamouli, Biswas Arindam ACM transactions on Asian language information processing . 2017,第4期

机译：孟加拉语文学文本作者身份归属的有监督的学习方法
2. Authorship Attribution in Latin Languages using Stylometry [J] . Analysis and applications . 2020,第4期

机译：使用STYROMERY的拉丁语语言的作者归属
3. Authorship attribution, constructed languages, and the psycholinguistics of individual variation [J] . Patrick Juola Literary & linguistic computing . 2018,第2期

机译：作者身份，构造语言和个体变异的心理语言学
4. Authorship Attribution on Bengali Literature using Stylometric Features and Neural Network [C] . Md. Ashikul Islam, Md. Minhazul Kabir, Md. Saiful Islam, International Conference on Electrical Engineering and Information Communication Technology . 2018

机译：使用文体特征和神经网络的孟加拉文学著作权归属
5. A Natural Language Processing and Machine-Learning Based Approach to Authorship Attribution of Tweets [D] . Day, Siobahn Caroline. 2018

机译：基于自然语言处理和机器学习的推文作者身份归属方法
6. Cross-Domain Authorship Attribution Using Pre-trained Language Models [O] . Georgios Barlas, Efstathios Stamatatos -1

机译：使用预先训练的语言模型进行跨域作者归属
7. Authorship Attribution: A Comparative Study of Three Text Corpora and Three Languages [O] . Savoy, Jacques 2013

机译：作者身份归因：三种语料库和三种语言的比较研究
8. Military Typesetting Equipment and Systems for Indo-Aryan and Dravidian Languages (Hindi, Marathi, Bengali, Punjabi, Gujarati, Malayalam, Tamil, and Telugu) (1961-1963) [R] . Nitenson, E. 1964

机译：印度 - 雅利安语和德拉威语的军事排版设备和系统（印地语，马拉地语，孟加拉语，旁遮普语，古吉拉特语，马拉雅拉姆语，泰米尔语和泰卢固语）（1961-1963）

Authorship Attribution in Bengali Language

摘要

著录项

相似文献

相关主题

期刊订阅