Sentence level text classification in the kannada language - a classifier's perspective

R. Jayashree; K. Srikantamurthy; Basavaraj S. Anami

首页> 外文期刊>International journal of computational vision and robotics >Sentence level text classification in the kannada language - a classifier's perspective

【24h】

Sentence level text classification in the kannada language - a classifier's perspective

机译：卡纳达语中的句子级文本分类-分类者的观点

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Better information retrieval techniques are needed to address the problem of information explosion. Major portion of data available online is text, which gives rise to huge feature space, hence, structured organisation and retrieval is very important. Information retrieval in the context of Indian languages is not uncommon, but IR in the South Indian language Kannada is quite new. This work focuses on sentence level text classification in the Kannada language, which is a fine grained approach to text classification; here, we look at the suitability of classifiers such as naive Bayesian, bag of words and support vector machine (SVM) for the same. The dimensionality reduction technique using two different approaches: minimum term frequency and stop word removal methods are carried out in this work and the performance analysis of the above mentioned classifiers are noted.

机译：需要更好的信息检索技术来解决信息爆炸的问题。联机可用数据的主要部分是文本，这引起了巨大的功能空间，因此，结构化的组织和检索非常重要。在印度语言环境中进行信息检索并不少见，但南印度语卡纳达语中的IR则很新。这项工作着重于用卡纳达语进行句子级文本分类，这是一种细粒度的文本分类方法。在这里，我们研究分类器的适用性，例如朴素贝叶斯，单词袋和支持向量机（SVM）。在这项工作中使用了两种不同方法的降维技术：最小词频和停用词去除方法，并对上述分类器的性能进行了分析。

著录项

来源
《International journal of computational vision and robotics》 |2015年第3期|254-270|共17页
作者
R. Jayashree; K. Srikantamurthy; Basavaraj S. Anami;
展开▼
作者单位

Department of Computer Science, PES Institute of Technology, Bangalore, India;

Department of Computer Science, PES School of Engineering, Bangalore, India;

Department of Computer Science, KLE Institute of Technology, Hubli, India;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
sentence level classification; Kannada text classification; naive Bayesian; bag of words; BOW; single label; multi label; SVM model;

机译：句子级别分类;卡纳达语文本分类;朴素的贝叶斯一句话弓;单标签;多标签支持向量机模型;

相似文献

外文文献
中文文献
专利

1. Suitability of Na?ve Bayesian Methods for Paragraph Level Text Classification in the Kannada Language using Dimensionality Reduction Technique [J] . Jayashree R, Srikantamurthy K, Basavaraj S Anami International Journal of Artificial Intelligence & Applications (IJAIA) . 2013,第5期

机译：朴素贝叶斯方法对降维技术在卡纳达语中段落级文本分类中的适用性
2. Euclidean Distance Based Classifier for Recognition and Generating Kannada Text Description from Live Sign Language Video [J] . Ramesh Mahadev Kagalkar, S V Gumaste International Journal of Recent Contributions from Engineering, Science & IT . 2017,第3期

机译：基于欧氏距离的分类器，用于从实时手语视频中识别和生成卡纳达语文本描述
3. Effect of stop word removal on the performance of naive Bayesian methods for text classification in the Kannada language [J] . R. Jayashree, K. Srikanta Murthy, Basavaraj S. Anami International journal of artificial intelligence and soft computing . 2014,第2a3期

机译：停用词移除对卡纳达语文本分类的朴素贝叶斯方法性能的影响
4. An analysis of sentence level text classification for the Kannada language [C] . Jayashree R, Srikanta Murthy K 2011 International Conference of Soft Computing and Pattern Recognition . 2011

机译：卡纳达语的句子级文本分类分析
5. Automated biomedical text fragmentation in support of biomedical sentence fragment classification [D] . Salehi, Sara. 2009

机译：自动生物医学文本片段化，支持生物医学句子片段分类
6. Supporting the use of standardized nursing terminologies with automatic subject heading prediction: a comparison of sentence-level text classification methods [O] . Hans Moen, Kai Hakala, Laura-Maria Peltonen, 2020

机译：支持标准护理术语与自动主题预测的使用：句子级文本分类方法的比较
7. Suitability of Naïve Bayesian Methods for Paragraph Level Text Classification in the Kannada Language using Dimensionality Reduction Technique [O] . Jayashree R, Srikantamurthy K, Basavaraj S Anami 2013

机译：使用降维技术将纯朴素贝叶斯方法应用于卡纳达语中的段落级文本分类
8. Natural Language Text Classification and Filtering with Trigrams and Evolutionary Nearest Neighbour Classifiers. Software Engineering (SEN). [R] . Langdon, W. B. 2000

机译：基于Trigrams和进化最近邻分类器的自然语言文本分类和过滤。软件工程（sEN）。

Sentence level text classification in the kannada language - a classifier's perspective

摘要

著录项

相似文献

相关主题

期刊订阅