Speech to text conversion for multilingual languages

机译：语音到文本的多语言转换

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The current work presents a multilingual speech-to-text conversion system. Conversion is based on information in speech signal. Speech is the natural and most important form of communication for human being. Speech-To-Text (STT) system takes a human speech utterance as an input and requires a string of words as output. The objective of this system is to extract, characterize and recognize the information about speech. The proposed system is implemented using Mel-Frequency Cepstral Coefficient (MFCC) feature extraction technique and Minimum Distance Classifier, Support Vector Machine (SVM) methods for speech classification. Speech utterances are pre-recorded and stored in a database. Database mainly divided into two parts testing and training. Samples from training database are passed through training phase and features are extracted. Combining features for each sample forms feature vector which is stored as reference. Sample to be tested from testing part is given to system and its features are extracted. Similarity between these features and reference feature vector is computed and words having maximum similarity are given as output. The system is developed in MATLAB (R2010a) environment.

机译：当前的工作提出了一种多语言语音到文本转换系统。转换基于语音信号中的信息。语音是人类交流的自然而又最重要的形式。语音转文本（STT）系统将人类语音发声作为输入，并且需要一串单词作为输出。该系统的目的是提取，表征和识别有关语音的信息。该系统采用梅尔频率倒谱系数（MFCC）特征提取技术和最小距离分类器，支持向量机（SVM）方法进行语音分类。语音被预先记录并存储在数据库中。数据库主要分为测试和培训两部分。来自训练数据库的样本将经过训练阶段并提取特征。每个样本的组合特征形成了特征向量，该特征向量被存储为参考。从测试部分将要测试的样本提供给系统，并提取其特征。计算这些特征与参考特征向量之间的相似度，并给出具有最大相似度的单词作为输出。该系统是在MATLAB（R2010a）环境中开发的。

著录项

来源
《International Conference on Communications and Signal Processing》|2016年|236-240|共5页
会议地点
作者
Yogita H. Ghadage; Sushama D. Shelke;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Speech; Databases; Feature extraction; Speech recognition; Support vector machines; Mel frequency cepstral coefficient; Training;

机译：语音;数据库;特征提取;语音识别;支持向量机;梅尔倒谱系数;训练;

相似文献

外文文献
中文文献
专利

1. Multilingual Text-to-Speech Software Component for Dynamic Language Identification and Voice Switching [J] . Fogarassy-Neszly Paul, Pribeanu Costin Studies in Informatics and Control . 2016,第3期

机译：用于动态语言识别和语音切换的多语言文本到语音软件组件
2. AUTOMATIC TEXT SUMMARIZATION OF INDIAN LANGUAGES: A MULTILINGUAL PROBLEM A REVIEW OF MULTILINGUAL SUMMARIZATION TECHNIQUES [J] . JOVI DSILVA, Dr. UZZAL SHARMA Journal of Theoretical and Applied Information Technology . 2019,第11期

机译：印度语的自动文本摘要：多语言问题—多语言摘要技术的回顾
3. AUTOMATIC TEXT SUMMARIZATION OF INDIAN LANGUAGES: A MULTILINGUAL PROBLEM A REVIEW OF MULTILINGUAL SUMMARIZATION TECHNIQUES [J] . JOVI DSILVA, Dr. UZZAL SHARMA Journal of Theoretical and Applied Information Technology . 2019,第11期

机译：印度语的自动文本摘要：多语言问题—多语言摘要技术的回顾
4. Speech to text conversion for multilingual languages [C] . Yogita H. Ghadage, Sushama D. Shelke International Conference on Communications and Signal Processing . 2016

机译：用于多语言语言的文本转换的演讲
5. Multilingual named entity extraction and translation from text and speech. [D] . Huang, Fei. 2006

机译：多语言命名实体从文本和语音中提取和翻译。
6. Tutorial: Speech Assessment for Multilingual Children Who Do Not Speak the Same Language(s) as the Speech-Language Pathologist [O] . Sharynne McLeod, Sarah Verdon, Elise Baker, -1

机译：教程：针对与母语病理学家讲不同语言的多语言儿童的语音评估
7. Multilingual number transcription for text-to-speech conversion [O] . San Segundo Hernández Rubén, Montero Martínez Juan Manuel, Giurgiu M., 2013

机译：多语言数字转录，可实现文本到语音的转换

Speech to text conversion for multilingual languages

摘要

著录项

相似文献

相关主题

期刊订阅