首页> 外文OA文献 >Deep Neural Networks for Shimmer Approximation in Synthesized Audio Signal

【2h】

Deep Neural Networks for Shimmer Approximation in Synthesized Audio Signal

机译：深度神经网络用于合成音频信号中的微光逼近

代理获取

本网站仅为用户提供外文OA文献查询和代理获取服务，本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文，但由于OA文献来源多样且变更频繁，仍可能出现获取不到、文献不完整或与标题不符等情况，如果获取不到我们将提供退款服务。请知悉。

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Shimmer is a classical acoustic measure of the amplitude perturbation of a signal. This kind of variation in the human voice allow to characterize some properties, not only of the voice itself, but of the person who speaks. During the last years deep learning techniques have become the state of the art for recognition tasks on the voice. In this work the relationship between shimmer and deep neural networks is analyzed.A deep learning model is created. It is able to approximate shimmer value of a simple synthesized audio signal (stationary and without formants) taking the spectrogram as input feature. It is concluded firstly, that for this kind of synthesized signal, a neural network like the one we proposed can approximate shimmer, and secondly, that the convolution layers can be designed in order to preserve the information of shimmer and transmit it to the following layers.

机译：闪烁是信号幅度扰动的经典声学度量。人类语音的这种变化不仅可以表征语音本身的特征，还可以表征说话者的某些特征。在过去的几年中，深度学习技术已成为语音识别任务的最新技术。在这项工作中，分析了微光和深度神经网络之间的关系。创建了深度学习模型。它能够以频谱图为输入特征，近似简单合成音频信号（静态和无共振峰）的闪光值。结论是：首先，对于这种合成信号，像我们提出的那样的神经网络可以近似闪光，其次，可以设计卷积层以保留闪光信息并将其传输到后续层。

著录项

作者
García, Mario Alejandro; Destéfanis, Eduardo A.;
展开▼
作者单位

展开▼
年度 2017
总页数
原文格式 PDF
正文语种 en
中图分类

相似文献

外文文献
中文文献
专利

1. Better Approximations of High Dimensional Smooth Functions by Deep Neural Networks with Rectified Power Units [J] . Li Bo, Tang Shanshan, Yu Haijun Mathematical research letters: MRL . 2020,第2期

机译：深度神经网络与整流电量单元的更好近似尺寸
2. X-DNNs: Systematic Cross-Layer Approximations for Energy-Efficient Deep Neural Networks [J] . Hanif Muhammad Abdullah, Marchisio Alberto, Arif Tabasher, Journal of Low Power Electronics . 2018,第4期

机译：X-DNN：节能深神经网络的系统交叉层近似
3. Recognition of words from brain-generated signals of speech-impaired people: Application of autoencoders as a neural Turing machine controller in deep neural networks [J] . Boloukian Behzad, Safi-Esfahani Faramarz Neural Networks: The Official Journal of the International Neural Network Society . 2020,第期

机译：识别语音障碍的脑生成信号的单词：AutoEncoders在深神经网络中的神经图定型机控制器中的应用
4. Synthesizing Game Audio Using Deep Neural Networks [C] . Aoife McDonagh, Joseph Lemley, Ryan Cassidy, IEEE Games, Entertainment, Media Conference . 2018

机译：使用深度神经网络合成游戏音频
5. Going Deeper with Recurrent Convolutional Neural Networks for Classifying P300 BCI Signals [D] . Maddula, Ramesh Krishna. 2017

机译：利用递归卷积神经网络对P300 BCI信号进行分类
6. Fast Approximations of Activation Functions in Deep Neural Networks when using Posit Arithmetic [O] . Marco Cococcioni, Federico Rossi, Emanuele Ruffaldi, 2020

机译：使用正算时深层神经网络中激活函数的快速逼近
7. Synthesizing Game Audio Using Deep Neural Networks [O] . Aoife McDonagh, Joseph Lemley, Ryan Cassidy, 2018

机译：使用深神经网络综合游戏音频

Deep Neural Networks for Shimmer Approximation in Synthesized Audio Signal

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅