首页> 外文会议>International Conference of Soft Computing and Pattern Recognition >An analysis of sentence level text classification for the Kannada language
【24h】

An analysis of sentence level text classification for the Kannada language

机译:Kannada语言句子级文本分类分析

获取原文
获取外文期刊封面目录资料

摘要

With the rapid growth of internet, huge amount of data is available online. The ability to draw useful information from this digital data is quite challenging. The task of exploring and extracting information from native languages available on line is very much a useful task. The work presented here focuses on sentence level classification in the Kannada language. The most popular approaches in text categorization like Naïve Bayesian and Bag of Words (BOW) approaches are used in this work. It is evident that Bag of Words approach performs significantly better than Naïve Bayesian approach. The objective of the work is to find how sentence level classification works for Kannada Language, as it can be extended further to sentiment classification, Question Answering, Text Summarization and also for customer reviews in Kannada Blogs, because most user's comments, queries, opinions etc are expressed using sentences, hence this sentence level Text Classification becomes a special task of Text Classification problem. The work though focuses on very basic approaches presently, can later be extended to other methods like SVM, KNN etc.
机译:随着互联网的快速增长,大量数据可在线获得。从此数字数据绘制有用信息的能力非常具有挑战性。从线上可用的母语探索和提取信息的任务是非常有用的任务。这里提出的工作侧重于克南纳语的句子级分类。在这项工作中使用了朴素贝叶斯和袋子(弓)方法等文本分类中最受欢迎的方法。很明显,袋子方法比天真贝叶斯方法更好地表现出明显好。这项工作的目标是找到肯纳语言的句子级分类工作,因为它可以进一步扩展到kannada博客的顾客评论,问题回答,文本摘要以及客户评论,因为大多数用户的评论,查询,意见等使用句子表示,因此该句子级文本分类成为文本分类问题的特殊任务。目前,虽然专注于非常基本的方法,但后来可以扩展到SVM,KNN等的其他方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号