首页> 外文会议>2017 20th International Conference of Computer and Information Technology >Question classification using support vector machine with hybrid feature extraction method
【24h】

Question classification using support vector machine with hybrid feature extraction method

机译:支持向量机的混合特征提取方法在问题分类中的应用

获取原文
获取原文并翻译 | 示例

摘要

This paper presents an approach to categorizing Bangla language question into some predefined coarse-grained category that represents expected answer type of that particular question. Support vector machine was used with different kernel function to increase the accuracy of existing Bangla question classification system. Both predefined feature set and the stream of unigram based on the frequency of data set was considered to build feature matrix. For five cross validation average 89.14% accuracy was achieved using 380 top frequent words as the feature which outperformed existing single model based Bangla question classification system. For same cross validation, 88.62% accuracy was achieved with a combination of wh-word, wh-word position and question length as feature set.
机译:本文提出了一种将孟加拉语言问题分类为一些预定义的粗粒度类别的方法,该类别表示该特定问题的预期答案类型。支持向量机具有不同的内核功能,以提高现有孟加拉语问题分类系统的准确性。既考虑了预定义的特征集,又考虑了基于数据集频率的字母组合流来构建特征矩阵。对于五个交叉验证,使用380个最常见单词作为特征,其平均准确性达到89.14%,优于基于Bangla问题分类系统的现有单个模型。对于相同的交叉验证,wh-word,wh-word位置和问题长度作为特征集的组合实现了88.62%的准确性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号