...
首页> 外文期刊>International journal of speech technology >Database development and automatic speech recognition of isolated Pashto spoken digits using MFCC and K-NN
【24h】

Database development and automatic speech recognition of isolated Pashto spoken digits using MFCC and K-NN

机译:使用MFCC和K-NN对孤立的普什图语语音进行数据库开发和自动语音识别

获取原文
获取原文并翻译 | 示例
           

摘要

Automatic recognition of isolated spoken digits is one of the most challenging tasks in the area of Automatic Speech Recognition. In this paper, Database Development and Automatic Speech Recognition of Isolated Pashto Spoken Digits from Sefer (0) to Naha (9) has been presented. A number of 50 individual Pashto native speakers (25 male and 25 female) of different ages, ranging from 18 to 60 years, were involved to utter from Sefer (0) to Naha (9) digits separately. Sony PCM-M 10 linear recorder is used for recoding purpose in the office and home in noise free environment. Adobe audition version 1.0 is used to split the audio of digits into individual digits and result is saved in wav format. Mel frequency cepstral coefficients is used to extract speech features. K nearest neighbor classifier is used for the first time up to author knowledge in Pashto language to classify the features of speech and compare its accuracy with linear discriminate analysis. The experimental results are evaluated, and the overall average recognition exactitude of 76.8 % is obtained.
机译:自动识别孤立的语音数字是自动语音识别领域最具挑战性的任务之一。本文介绍了从Sefer(0)到Naha(9)的孤立普什图语语音数字的数据库开发和自动语音识别。年龄在18至60岁之间的50位不同年龄的普什图人母语使用者(25位男性和25位女性)分别从塞弗(0)到那霸(9)的数字说话。 Sony PCM-M 10线性录音机用于在无噪音环境下的办公室和家庭中进行录音。 Adobe Audition 1.0版用于将数字音频分割为单个数字,并将结果以wav格式保存。梅尔频率倒谱系数用于提取语音特征。最近使用K最近邻分类器来编写普什图语的知识,以对语音的特征进行分类并将其准确性与线性判别分析进行比较。对实验结果进行评估,获得的总体平均识别准确度为76.8%。

著录项

  • 来源
    《International journal of speech technology》 |2015年第2期|271-275|共5页
  • 作者单位

    Institute of Business and Management Sciences, The University of Agricultural Peshawar, Peshawar, Pakistan;

    Universities of Engineering and Technology Peshawar, Peshawar, Pakistan;

    Department of Computer Science, Central University of Kerala, Nileshwar, India;

    Institute of Business and Management Sciences, The University of Agricultural Peshawar, Peshawar, Pakistan;

    Institute of Business and Management Sciences, The University of Agricultural Peshawar, Peshawar, Pakistan;

    Universities of Engineering and Technology Peshawar, Peshawar, Pakistan;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    KNN; MFCC; Pashto digits;

    机译:KNN;MFCC;普什图数字;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号