首页> 外文会议>International Conference on Computer and Information Technology >A machine learning approach for stylometric analysis of Bangla literature
【24h】

A machine learning approach for stylometric analysis of Bangla literature

机译:孟加拉文学仪表分析机器学习方法

获取原文

摘要

The term Stylogenetics refers to the eloquent analysis of authors literary corpora which are based on clustering. While writing, a writer focuses on some frequent things subconsciously. We1 focused on these things and tried to detect the affinity and divergence of the writing of different authors. In this approach, our proposal is regarding on some particular features to distinguish authors individuality who writes and establishes their own viewpoint on similar issues. Here we assembled Bengali Blogs scripted by twenty Bangladeshi authors of two different fields e.g. Political, Educational and analyzed the corpus. Via our methodology, we evaluated some features such as negative Word frequency in particular position, Rapidity of use of highest length word and sentence, Suffix Count, Use of particular Punctuation, Common Recognizable word frequency, Classification of Parts of speech, Numeric words frequency and so on. First, we trained the system using these features and then distinguished from random data sets using two machine learning approaches, Support Vector Machines (SVM) and Naive Bayes classifier. This proposal provides more accuracy than previously established works as all the collected corpus here, are of different writers writing, on the analogous field.
机译:术语序列是指基于聚类的作者文学学习的雄辩分析。在写作时,作家潜意识地关注一些频繁的东西。我们 1 专注于这些东西,并试图检测不同作者写作的亲和力和分歧。在这种方法中,我们的提案是关于某些特定的功能,以区分写作并建立自己对类似问题的观点的作者个性。在这里,我们将孟加拉博客组成了二十个不同领域的二十个孟加拉国作者的繁荣博客。政治,教育和分析了语料库。通过我们的方法,我们在特定位置评估了一些特征,特别是负词频率,使用最高长度的单词和句子的速度,后缀计数,使用特定标点符号,常见的识别字频率,分类词语,数字单词频率和分类很快。首先,我们使用这些功能培训了系统,然后使用两种机器学习方法,支持向量机(SVM)和Naive Bayes分类器的随机数据集。该提案提供比以前建立的工作更准确,因为这里的所有收集的语料库都是不同的作家写作,在类似领域。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号