Testing the Universal Baby Language Hypothesis - Automatic Infant Speech Recognition with CNNs

机译：测试通用婴儿语言假设-使用CNN的自动婴儿语音识别

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper presents an application of convolutional neural networks (CNN) for the recognition of the so-called “Dunstan baby language” that consists of five “words” or phonemes used by babies of age under 3 months to communicate their needs before they start crying. The model was derived from a CNN architecture which was successfully applied by the authors for voice-based emotion detection. The input of the neural network is the spectrogram obtained from the audio records of babies' voices and is processed as a two-dimensional image. The architecture was trained for a set of 250 small duration recordings and was tested for other 65 recordings with a recognition rate of 89%. The length of all audio files is less than 1 second; the recordings were extracted from certified Dunstan language recordings. The most important original contribution of the paper is the recognition of the actual “baby words” (and not the baby cry as was done before). This architecture offers an efficient tool for the verification of the “universal baby language” hypothesis, according to which the language of infants does not depend on culture, family, etc.

机译：本文介绍了卷积神经网络（CNN）在识别所谓的“ Dunstan婴儿语言”中的应用，该语言由三个月以下婴儿使用的五个“单词”或音素在开始哭闹之前传达他们的需求。该模型源自CNN架构，该架构已成功地被作者应用于基于语音的情感检测。神经网络的输入是从婴儿声音的音频记录中获得的频谱图，并被处理为二维图像。对该体系结构进行了250组小持续时间记录的培训，并针对其他65个记录进行了测试，识别率为89％。所有音频文件的长度小于1秒;这些录音摘自经过认证的Dunstan语言录音。本文最重要的原始贡献是对实际“婴儿单词”的识别（而不是像以前那样哭泣）。这种体系结构为验证“通用婴儿语言”假设提供了一种有效的工具，根据该假设，婴儿的语言不依赖于文化，家庭等。

著录项

来源
《International Conference on Telecommunications and Signal Processing》|2018年|1-4|共4页
会议地点
作者
Eduard Franti; Ioan Ispas; Monica Dascalu;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Pediatrics; Computer architecture; Spectrogram; Training; Software; Mel frequency cepstral coefficient; Convolution;

机译：儿科;计算机体系结构;频谱图;培训;软件;梅尔频率倒谱系数;卷积;

相似文献

外文文献
中文文献
专利

1. A Review on Speech Corpus Development for Automatic Speech Recognition in Indian Languages [J] . Cini kurian International Journal of Advanced Networking and Applications . 2015,第7018期

机译：语音语料库在印度语言中自动语音识别的发展述评
2. Universal attribute characterization of spoken languages for automatic spoken language recognition [J] . Sabato Marco Siniscalchi, Jeremy Reed, Torbjorn Svendsen, Computer speech and language . 2013,第1期

机译：口语的通用属性表征，用于自动口语识别
3. A Review on Marathi Language Speech Database Development for Automatic Speech Recognition (ASR) System [J] . Mrs. Chhaya S. Patil, Prof.Dr.Vaishali B.Patil International Journal of Engineering Research and Applications . 2017,第3期

机译：用于自动语音识别（ASR）系统的Marathi语言语音数据库开发的回顾
4. Testing the Universal Baby Language Hypothesis - Automatic Infant Speech Recognition with CNNs [C] . Eduard Franti, Ioan Ispas, Monica Dascalu International Conference on Telecommunications and Signal Processing . 2018

机译：测试通用婴儿语言假设 - 用CNNS自动婴儿语音识别
5. Automatic Speech Recognition for Low-Resource and Morphologically Complex Languages [D] . Morris, Ethan. 2021

机译：用于低资源和形态复杂语言的自动语音识别
6. Automatic Classification of the Korean Triage Acuity Scale in Simulated Emergency Rooms Using Speech Recognition and Natural Language Processing: a Proof of Concept Study [O] . Dongkyun Kim, Jaehoon Oh, Heeju Im, 2021

机译：使用语音识别和自然语言处理的模拟急诊室中韩国分流刻度的自动分类：概念研究证明
7. ROBUST SPEECH RECOGNITION BY INTEGRATING SPEECH SEPARATION AND HYPOTHESIS TESTING [O] . Soundararajan Srinivasan 2008

机译：通过集成语音分离和假设测试进行语音强健识别

Testing the Universal Baby Language Hypothesis - Automatic Infant Speech Recognition with CNNs

摘要

著录项

相似文献

相关主题

期刊订阅