...
首页> 外文期刊>International Journal of Applied Engineering Research >Speech to Text Synthesis from Video Automated Subtitling using Levinson Durbin Method of Linear Predictive Coding
【24h】

Speech to Text Synthesis from Video Automated Subtitling using Levinson Durbin Method of Linear Predictive Coding

机译:使用线性预测编码的Levinson Durbin方法从视频自动字幕转换语音到文本

获取原文
获取原文并翻译 | 示例

摘要

The objective of speech processing is to make sure that the information to be transferred is clear and accurate, although currently many algorithms have been developed; challenge still exists in extracting the features and reconstructing it to reproduce. Feature extraction is the first thing for speech processing so that a digital system will know what is the word exactly spoken by people. In this work, a Graphical User Interface (GUI) has been developed using matlab to extract the audio speech from the test video. Noise is removed by designing a Butterworth filter and frame by frame analysis is done to separate words in the speech using a threshold. Feature extraction is done using Levinson-Durbin method of Linear Predictive Coding and given as input to the neural network for training and recognition. Results are quite satisfied with overall recognition rate of 76%.
机译:语音处理的目的是确保要传输的信息清晰准确,尽管目前已经开发了许多算法。在提取特征并将其重建以进行再现方面仍然存在挑战。特征提取是语音处理的第一件事,因此数字系统将知道人们确切说出的单词是什么。在这项工作中,使用matlab开发了图形用户界面(GUI),以从测试视频中提取音频语音。通过设计巴特沃思滤波器消除噪声,并使用阈值进行逐帧分析以分离语音中的单词。特征提取使用线性预测编码的Levinson-Durbin方法完成,并作为神经网络的输入进行训练和识别。结果非常满意,总体识别率为76%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号