...
首页> 外文期刊>International journal of computational vision and robotics >Sentence level text classification in the kannada language - a classifier's perspective
【24h】

Sentence level text classification in the kannada language - a classifier's perspective

机译:卡纳达语中的句子级文本分类-分类者的观点

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

Better information retrieval techniques are needed to address the problem of information explosion. Major portion of data available online is text, which gives rise to huge feature space, hence, structured organisation and retrieval is very important. Information retrieval in the context of Indian languages is not uncommon, but IR in the South Indian language Kannada is quite new. This work focuses on sentence level text classification in the Kannada language, which is a fine grained approach to text classification; here, we look at the suitability of classifiers such as naive Bayesian, bag of words and support vector machine (SVM) for the same. The dimensionality reduction technique using two different approaches: minimum term frequency and stop word removal methods are carried out in this work and the performance analysis of the above mentioned classifiers are noted.
机译:需要更好的信息检索技术来解决信息爆炸的问题。联机可用数据的主要部分是文本,这引起了巨大的功能空间,因此,结构化的组织和检索非常重要。在印度语言环境中进行信息检索并不少见,但南印度语卡纳达语中的IR则很新。这项工作着重于用卡纳达语进行句子级文本分类,这是一种细粒度的文本分类方法。在这里,我们研究分类器的适用性,例如朴素贝叶斯,单词袋和支持向量机(SVM)。在这项工作中使用了两种不同方法的降维技术:最小词频和停用词去除方法,并对上述分类器的性能进行了分析。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号