首页> 外文会议>International Conference on Communications and Signal Processing >Speech to text conversion for multilingual languages
【24h】

Speech to text conversion for multilingual languages

机译:语音到文本的多语言转换

获取原文

摘要

The current work presents a multilingual speech-to-text conversion system. Conversion is based on information in speech signal. Speech is the natural and most important form of communication for human being. Speech-To-Text (STT) system takes a human speech utterance as an input and requires a string of words as output. The objective of this system is to extract, characterize and recognize the information about speech. The proposed system is implemented using Mel-Frequency Cepstral Coefficient (MFCC) feature extraction technique and Minimum Distance Classifier, Support Vector Machine (SVM) methods for speech classification. Speech utterances are pre-recorded and stored in a database. Database mainly divided into two parts testing and training. Samples from training database are passed through training phase and features are extracted. Combining features for each sample forms feature vector which is stored as reference. Sample to be tested from testing part is given to system and its features are extracted. Similarity between these features and reference feature vector is computed and words having maximum similarity are given as output. The system is developed in MATLAB (R2010a) environment.
机译:当前的工作提出了一种多语言语音到文本转换系统。转换基于语音信号中的信息。语音是人类交流的自然而又最重要的形式。语音转文本(STT)系统将人类语音发声作为输入,并且需要一串单词作为输出。该系统的目的是提取,表征和识别有关语音的信息。该系统采用梅尔频率倒谱系数(MFCC)特征提取技术和最小距离分类器,支持向量机(SVM)方法进行语音分类。语音被预先记录并存储在数据库中。数据库主要分为测试和培训两部分。来自训练数据库的样本将经过训练阶段并提取特征。每个样本的组合特征形成了特征向量,该特征向量被存储为参考。从测试部分将要测试的样本提供给系统,并提取其特征。计算这些特征与参考特征向量之间的相似度,并给出具有最大相似度的单词作为输出。该系统是在MATLAB(R2010a)环境中开发的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号