首页> 外文会议>International Conference on Telecommunications and Signal Processing >Testing the Universal Baby Language Hypothesis - Automatic Infant Speech Recognition with CNNs
【24h】

Testing the Universal Baby Language Hypothesis - Automatic Infant Speech Recognition with CNNs

机译:测试通用婴儿语言假设-使用CNN的自动婴儿语音识别

获取原文

摘要

This paper presents an application of convolutional neural networks (CNN) for the recognition of the so-called “Dunstan baby language” that consists of five “words” or phonemes used by babies of age under 3 months to communicate their needs before they start crying. The model was derived from a CNN architecture which was successfully applied by the authors for voice-based emotion detection. The input of the neural network is the spectrogram obtained from the audio records of babies' voices and is processed as a two-dimensional image. The architecture was trained for a set of 250 small duration recordings and was tested for other 65 recordings with a recognition rate of 89%. The length of all audio files is less than 1 second; the recordings were extracted from certified Dunstan language recordings. The most important original contribution of the paper is the recognition of the actual “baby words” (and not the baby cry as was done before). This architecture offers an efficient tool for the verification of the “universal baby language” hypothesis, according to which the language of infants does not depend on culture, family, etc.
机译:本文介绍了卷积神经网络(CNN)在识别所谓的“ Dunstan婴儿语言”中的应用,该语言由三个月以下婴儿使用的五个“单词”或音素在开始哭闹之前传达他们的需求。该模型源自CNN架构,该架构已成功地被作者应用于基于语音的情感检测。神经网络的输入是从婴儿声音的音频记录中获得的频谱图,并被处理为二维图像。对该体系结构进行了250组小持续时间记录的培训,并针对其他65个记录进行了测试,识别率为89%。所有音频文件的长度小于1秒;这些录音摘自经过认证的Dunstan语言录音。本文最重要的原始贡献是对实际“婴儿单词”的识别(而不是像以前那样哭泣)。这种体系结构为验证“通用婴儿语言”假设提供了一种有效的工具,根据该假设,婴儿的语言不依赖于文化,家庭等。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号