首页> 外文会议>International Conference on Artificial Intelligence and Evolutionary Algorithms in Engineering Systems >An Efficient Continuous Speech Recognition System for Dravidian Languages Using Support Vector Machine
【24h】

An Efficient Continuous Speech Recognition System for Dravidian Languages Using Support Vector Machine

机译:使用支持向量机的Dravidian语言的高效连续语音识别系统

获取原文

摘要

This paper mainly focuses on developing a novel speech recognition system for Dravidian languages such as Tamil, Malayalam, Telugu, and Kannada. This research work targets to afford a well-organized way for human to interconnect with computers absolutely for people with disabilities who fa?ade variety of stumbling blocks while using computers. This work would be very helpful to the native speakers in various applications. The proposed CSR system comprises of three steps namely preprocessing, feature extraction, and classification. In the preprocessing step, the input signal is preprocessed through the steps such as preemphasis filter, framing, windowing, and band stop filtering in order to remove the background noise and to enrich the signal. The best-filtered and the enriched signal from the preprocessing step is taken as the input for the further process of CSR system. The speech features being the most essential segment in speech recognition system. The most powerful and widely used short-term energy (STE) and zerocrossing rate (ZCR) are used for continuous speech segmentation, and Mel-frequency cepstral coefficients (MFCC) and shifted delta cepstrum (SDC) are used for recognition task. Feature vectors are given as the input to the classifier such as support vector machine (SVM) for classifying and recognizing Dravidian language speech. Experiments are carried out with real-time Dravidian speech signals, and the results reveal that the proposed method competes with the existing methods reported in literature.
机译:本文主要侧重于开发针对Dravidian语言的新型语音识别系统,如泰米尔,Malayalam,Telugu和Kannada。这项研究工作的目标是为人类提供良好的方式,为人类互连,绝对适用于有残疾人的人互动,在使用计算机的同时使用各种绊脚石。这项工作对各种应用中的母语人员非常有帮助。所提出的CSR系统包括三个步骤,即预处理,特征提取和分类。在预处理步骤中,输入信号通过诸如预处理滤波器,框架,窗口和带停止滤波的步骤预处理,以便去除背景噪声并丰富信号。来自预处理步骤的最佳过滤和富集信号被视为CSR系统进一步过程的输入。语音特征是语音识别系统中最重要的段。最强大且广泛使用的短期能量(STE)和ZeroCross率(ZCR)用于连续语音分割,而熔融频率谱系数(MFCC)和移位的Delta综合(SDC)用于识别任务。特征向量被给出为分类器的输入,例如支持向量机(SVM),用于分类和识别Dravidian语言语音。实验与实时Dravidian语音信号进行,结果表明,该方法与文献中报告的现有方法竞争。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号