An analysis of sentence level text classification for the Kannada language

机译：卡纳达语的句子级文本分类分析

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

With the rapid growth of internet, huge amount of data is available online. The ability to draw useful information from this digital data is quite challenging. The task of exploring and extracting information from native languages available on line is very much a useful task. The work presented here focuses on sentence level classification in the Kannada language. The most popular approaches in text categorization like Naïve Bayesian and Bag of Words (BOW) approaches are used in this work. It is evident that Bag of Words approach performs significantly better than Naïve Bayesian approach. The objective of the work is to find how sentence level classification works for Kannada Language, as it can be extended further to sentiment classification, Question Answering, Text Summarization and also for customer reviews in Kannada Blogs, because most user''s comments, queries, opinions etc are expressed using sentences, hence this sentence level Text Classification becomes a special task of Text Classification problem. The work though focuses on very basic approaches presently, can later be extended to other methods like SVM, KNN etc.

机译：随着互联网的快速发展，在线提供了大量数据。从此数字数据中提取有用信息的能力非常具有挑战性。从在线可用的本地语言中探索和提取信息的任务非常有用。这里介绍的工作着重于卡纳达语中的句子级别分类。这项工作使用了文本分类中最流行的方法，如朴素贝叶斯方法和单词袋（BOW）方法。显然，“言语袋”方法的性能要比朴素贝叶斯方法好得多。这项工作的目的是发现句子级分类对于卡纳达语是如何工作的，因为它可以扩展到情感分类，问答，文本摘要以及卡纳达语博客中的客户评论，因为大多数用户的评论，查询，意见等使用句子来表达，因此此句子级别的文本分类成为文本分类问题的一项特殊任务。尽管该工作目前集中在非常基本的方法上，但以后可以扩展到其他方法，例如SVM，KNN等。

著录项

来源
《2011 International Conference of Soft Computing and Pattern Recognition》|2011年|p.147-151|共5页
会议地点
作者
Jayashree R; Srikanta Murthy K;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类理论、方法;模式识别与装置;
关键词
Bag of Words; Naïve Bayesian; kannada text classification; sentence level classification; single label;

机译：单词袋;朴素贝叶斯; kannada文本分类;句子级别分类;单个标签;

相似文献

外文文献
中文文献
专利

1. Sentence level text classification in the kannada language - a classifier's perspective [J] . R. Jayashree, K. Srikantamurthy, Basavaraj S. Anami International journal of computational vision and robotics . 2015,第3期

机译：卡纳达语中的句子级文本分类-分类者的观点
2. Suitability of Na?ve Bayesian Methods for Paragraph Level Text Classification in the Kannada Language using Dimensionality Reduction Technique [J] . Jayashree R, Srikantamurthy K, Basavaraj S Anami International Journal of Artificial Intelligence & Applications (IJAIA) . 2013,第5期

机译：朴素贝叶斯方法对降维技术在卡纳达语中段落级文本分类中的适用性
3. Effect of stop word removal on the performance of naive Bayesian methods for text classification in the Kannada language [J] . R. Jayashree, K. Srikanta Murthy, Basavaraj S. Anami International journal of artificial intelligence and soft computing . 2014,第2a3期

机译：停用词移除对卡纳达语文本分类的朴素贝叶斯方法性能的影响
4. An analysis of sentence level text classification for the Kannada language [C] . Jayashree R, Srikanta Murthy K International Conference of Soft Computing and Pattern Recognition . 2011

机译：Kannada语言句子级文本分类分析
5. Automated biomedical text fragmentation in support of biomedical sentence fragment classification [D] . Salehi, Sara. 2009

机译：自动生物医学文本片段化，支持生物医学句子片段分类
6. Supporting the use of standardized nursing terminologies with automatic subject heading prediction: a comparison of sentence-level text classification methods [O] . Hans Moen, Kai Hakala, Laura-Maria Peltonen, 2020

机译：支持标准护理术语与自动主题预测的使用：句子级文本分类方法的比较
7. Suitability of Naïve Bayesian Methods for Paragraph Level Text Classification in the Kannada Language using Dimensionality Reduction Technique [O] . Jayashree R, Srikantamurthy K, Basavaraj S Anami 2013

机译：使用降维技术将纯朴素贝叶斯方法应用于卡纳达语中的段落级文本分类
8. Classification of Sentences Used in Natural Language Processing in the Military Services. [R] . Wittrock, M. 1989

机译：军事自然语言处理中的句子分类。

An analysis of sentence level text classification for the Kannada language

摘要

著录项

相似文献

相关主题

期刊订阅