首页> 外文期刊>Sensors and materials >Speech Processing Based on Hidden Markov Model and Vector Quantization Techniques Applied to Internet of Vehicles
【24h】

Speech Processing Based on Hidden Markov Model and Vector Quantization Techniques Applied to Internet of Vehicles

机译:基于隐马尔可夫模型的语音处理和矢量量化技术在车联网中的应用

获取原文
获取原文并翻译 | 示例
       

摘要

In this study, we develop an intelligence device to apply speech processing function in an Internet of Vehicles (IoV). The voice-based interactions will improve drive safety and in-time awareness of the vehicle status. This interaction can be achieved through speech recognition and response generation between the driver and the smart vehicle. Thus, the driver can focus on the driving. The proposed speech processing can be divided into three portions: (1) voice signal preprocessing, (2) speech recognition, and (3) speaker recognition. Firstly, speech signal preprocessing consists of five steps: sampling, pre-emphasis, frame, window function, and mel-frequency cepstral coefficients (MFCC), so as to be able to extract the characteristic parameters in the speech signal. Secondly, the speech model is built via the hidden Markov model (HMM), and the Viterbi algorithm is used to search the best sequence in the model to achieve the function of speech recognition. Finally, we use the Linde-Buzo-Gray (LBG) algorithm in vector quantization (VQ) to train for the speaker model, and then use cosine similarity to achieve the function of speaker recognition. The proposed speech processing function has been validated experimentally, and the experimental results demonstrate its feasibility for drivers to easily control the IoV system via voice-based command. In addition, the system distinguishes different speakers and provides the corresponding usage privileges, which will improve drive safety and in-time awareness of the general vehicle status.
机译:在这项研究中,我们开发了一种智能设备,可将语音处理功能应用于车辆互联网(IoV)。基于语音的交互将提高驾驶安全性并及时了解车辆状态。这种交互可以通过驾驶员和智能车辆之间的语音识别和响应生成来实现。因此,驾驶员可以专注于驾驶。提出的语音处理可以分为三个部分:(1)语音信号预处理,(2)语音识别和(3)说话人识别。首先,语音信号预处理包括五个步骤:采样,预加重,帧,窗函数和梅尔频率倒谱系数(MFCC),以便能够提取语音信号中的特征参数。其次,通过隐马尔可夫模型(HMM)建立语音模型,并采用维特比算法搜索模型中的最佳序列,以实现语音识别功能。最后,我们在矢量量化(VQ)中使用Linde-Buzo-Gray(LBG)算法训练说话人模型,然后使用余弦相似度实现说话人识别功能。所提出的语音处理功能已通过实验验证,实验结果证明了其对于驾驶员通过基于语音的命令轻松控制IoV系统的可行性。此外,系统可以区分不同的说话者并提供相应的使用权限,这将提高驾驶安全性和对一般车辆状态的及时了解。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号