首页> 外文期刊>Industrial Informatics, IEEE Transactions on >Enhancement of Speech Recognitions for Control Automation Using an Intelligent Particle Swarm Optimization
【24h】

Enhancement of Speech Recognitions for Control Automation Using an Intelligent Particle Swarm Optimization

机译:使用智能粒子群优化技术增强语音识别以实现控制自动化

获取原文
获取原文并翻译 | 示例
           

摘要

For over two decades, speech control mechanisms have been widely applied in manufacturing systems such as factory automation, warehouse automation, and industrial robotic control for over two decades. To implement speech controls, a commercial speech recognizer is used as the interface between users and the automation system. However, users' commands are often contaminated by environmental noise which degrades the performance of speech recognition for controlling automation systems. This paper presents a multichannel signal enhancement methodology to improve the performance of commercial speech recognizers. The proposed methodology aims to optimize speech recognition accuracy of a commercial speech recognizer in a noisy environment based on a beamformer, which is developed by an intelligent particle swarm optimization. It overcomes the limitation of the existing signal enhancement approaches whereby the parameters inside commercial speech recognizers are required to be tuned, which is impossible in a real-world situation. Also, it overcomes the limitation of the existing optimization algorithm including gradient descent methods, genetic algorithms and classical particle swarm optimization that are unlikely to develop optimal beamformers for maximizing speech recognition accuracy. The performance of the proposed methodology was evaluated by developing beamformers for a commercial speech recognizer, which was implemented on warehouse automation. Results indicate a significant improvement regarding speech recognition accuracy.
机译:二十多年来,语音控制机制已广泛应用于制造系统中,例如工厂自动化,仓库自动化和工业机器人控制。为了实现语音控制,将商用语音识别器用作用户和自动化系统之间的接口。然而,用户的命令经常被环境噪声污染,这降低了用于控制自动化系统的语音识别的性能。本文提出了一种多通道信号增强方法,以改善商业语音识别器的性能。所提出的方法旨在基于智能化粒子群优化技术开发的基于波束形成器的噪声环境中的商用语音识别器,以优化语音识别精度。它克服了现有信号增强方法的局限性,在现有信号增强方法中,需要对商业语音识别器内部的参数进行调整,这在现实世界中是不可能的。而且,它克服了现有优化算法的局限性,包括梯度下降法,遗传算法和经典粒子群优化,这些均不太可能开发出用于最大化语音识别精度的最佳波束形成器。通过开发用于商业语音识别器的波束形成器,评估了所提出方法的性能,该波束形成器在仓库自动化上实现。结果表明有关语音识别准确性的重大改进。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号