Adaptation to New Microphones Using Artificial Neural Networks With Trainable Activation Functions

Sabato Marco Siniscalchi; Valerio Mario Salerno

首页> 外文期刊>Neural Networks and Learning Systems, IEEE Transactions on >Adaptation to New Microphones Using Artificial Neural Networks With Trainable Activation Functions

【24h】

Adaptation to New Microphones Using Artificial Neural Networks With Trainable Activation Functions

机译：使用具有可训练激活功能的人工神经网络适应新的麦克风

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Model adaptation is a key technique that enables a modern automatic speech recognition (ASR) system to adjust its parameters, using a small amount of enrolment data, to the nuances in the speech spectrum due to microphone mismatch in the training and test data. In this brief, we investigate four different adaptation schemes for connectionist (also known as hybrid) ASR systems that learn microphone-specific hidden unit contributions, given some adaptation material. This solution is made possible adopting one of the following schemes: 1) the use of Hermite activation functions; 2) the introduction of bias and slope parameters in the sigmoid activation functions; 3) the injection of an amplitude parameter specific for each sigmoid unit; or 4) the combination of 2) and 3). Such a simple yet effective solution allows the adapted model to be stored in a small-sized storage space, a highly desirable property of adaptation algorithms for deep neural networks that are suitable for large-scale online deployment. Experimental results indicate that the investigated approaches reduce word error rates on the standard Spoke 6 task of the Wall Street Journal corpus compared with unadapted ASR systems. Moreover, the proposed adaptation schemes all perform better than simple multicondition training and comparable favorably against conventional linear regression-based approaches while using up to 15 orders of magnitude fewer parameters. The proposed adaptation strategies are also effective when a single adaptation sentence is available.

机译：模型自适应是一项关键技术，该技术使现代自动语音识别（ASR）系统能够使用少量的注册数据，将其参数调整为由于训练和测试数据中的麦克风不匹配而导致的语音频谱差异。在本简介中，我们研究了针对连接器（也称为混合）ASR系统的四种不同的适应方案，这些方案可以学习麦克风特定的隐藏单元贡献，并提供一些适应材料。采用以下方案之一可以使该解决方案成为可能：1）使用Hermite激活功能； 2）在S形激活函数中引入偏置和斜率参数； 3）注入特定于每个S形单位的幅度参数；或4）2）和3）的组合。这种简单而有效的解决方案允许将适应的模型存储在小型存储空间中，这是适用于大规模在线部署的深度神经网络的适应算法的高度期望的特性。实验结果表明，与不适用的ASR系统相比，所研究的方法降低了《华尔街日报》语料库标准Spoke 6任务上的单词错误率。此外，所提出的自适应方案都比简单的多条件训练更好，并且与传统的基于线性回归的方法相比具有可比性，同时使用的参数最多减少了15个数量级。当单个适应语句可用时，所提出的适应策略也是有效的。

著录项

来源
《Neural Networks and Learning Systems, IEEE Transactions on》 |2017年第8期|1959-1965|共7页
作者
Sabato Marco Siniscalchi; Valerio Mario Salerno;
展开▼
作者单位

Computer Engineering Faculty, University of Enna Kore, Enna, Italy;

University of Enna Kore, Enna, Italy;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Microphones; Training; Adaptation models; Data models; Speech recognition; Hidden Markov models; Speech;

机译：麦克风;训练;适应模型;数据模型;语音识别;隐马尔可夫模型;语音;

相似文献

外文文献
中文文献
专利

1. A novel type of activation function in artificial neural networks: Trained activation function [J] . Ertugrul Omer Faruk Neural Networks: The Official Journal of the International Neural Network Society . 2018,第期

机译：人工神经网络中的一种新型激活功能：训练有素的激活功能
2. A.1. Evolving Artificial Neural Networks-A. 1.1. Evolving Artificial Neural Networks trained by Particle Swarm Optimization [J] . International journal of healthcare information systems and informatics : . 2011,第4期

机译：A.1。进化的人工神经网络-A。 1.1。通过粒子群优化训练的进化人工神经网络
3. Ensemble of convolutional neural networks trained with different activation functions [J] . Maguolo Gianluca, Nanni Loris, Ghidoni Stefano Expert systems with applications . 2021,第Mara期

机译：卷积神经网络的集合，具有不同的激活功能
4. Fractional Adaptation of Activation Functions In Neural Networks [C] . Julio Zamora Esquivel, Jesus Adan Cruz Vargas, Paulo Lopez-Meyer International Conference on Pattern Recognition . 2021

机译：神经网络激活功能的分数适应
5. CMOS-Based Implementation of Hyperbolic Tangent Activation Function for Artificial Neural Network [D] . Mohammadi Shakiba, Fatemeh 2018

机译：人工神经网络基于CMOS的双曲正切激活函数的实现
6. Improving Post-Filtering of Artificial Speech Using Pre-Trained LSTM Neural Networks [O] . Marvin Coto-Jiménez 2019

机译：使用预训练的LSTM神经网络改善人工语音的后过滤
7. Ensemble of convolutional neural networks trained with different activation functions [O] . Gianluca Maguolo, Loris Nanni, Stefano Ghidoni 2021

机译：卷积神经网络的集合，具有不同的激活功能

Adaptation to New Microphones Using Artificial Neural Networks With Trainable Activation Functions

摘要

著录项

相似文献

相关主题

期刊订阅