首页> 外文OA文献 >The suitability of iPhone recordings for the acoustic measures of speech and voice quality
【2h】

The suitability of iPhone recordings for the acoustic measures of speech and voice quality

机译:iPhone录音适合语音和语音质量的声学测量

摘要

This study examined the quality of iPhone recordings for acoustic measurements of speech and voice quality. A selection of acoustic measures were extracted from voice samples recorded using the “voice memo” application in an iPhone and compared with those derived from signals directly digitized (DD) in a laptop via a 12-bit A/D converter. Participants were 11 healthy adults, including six females and five males, aged between 27 to 67 years (Mean = 41.8 years, SD = 16.7). The participant was asked to read the first six sentences of the “rainbow passage”. In addition, two participants were asked to produce sustained vowels (/i/, /a/, and /u/) and a sentence (“We saw two cars”) ten times. The simultaneously recorded iPhone and DD signals were analysed to derive 10 acoustic measures, including spectral tilt for the whole sentence and fundamental frequency (F0), percent jitter, percent shimmer, signal-to-noise ratio, amplitude of the first harmonic relative to that of the second harmonic, singing power ratio, and frequencies of the first and second formants (F1 and F2), and vowel space area for the vowel segment. A series of Pearson’s correlation procedures revealed that measures from iPhone and DD signals were highly correlated. Findings of the vowel effect on the experimental measures obtained from iPhone signals were consistent with those from DD signals. However, the mean normalized absolute differences between measures from iPhone and DD signals are optimal (i.e., lower than 20%) only for F0, F1, and F2. These findings suggest that iPhone recordings are as adequate as other types of high quality digital recordings for acoustic measurements of voice quality but most voice measures from different digital recording systems are not directly comparable.
机译:这项研究检查了iPhone录音的质量,以进行语音和语音质量的声学测量。从使用iPhone中的“语音备忘”应用程序记录的语音样本中提取了一些声学测量结果,并将这些声学测量结果与通过12位A / D转换器在笔记本电脑中直接数字化(DD)的信号得出的结果进行了比较。参加者为11位健康的成年人,包括6位女性和5位男性,年龄在27至67岁之间(平均= 41.8岁,SD = 16.7)。要求参与者阅读“彩虹段落”的前六个句子。此外,还要求两名参与者制作持续元音(/ i /,/ a /和/ u /)和一个句子(“我们看到两辆车”)十次。对同时记录的iPhone和DD信号进行分析,得出10种声学测量,包括整个句子的频谱倾斜度和基频(F0),抖动百分比,闪烁百分比,信噪比,相对于该谐波的一次谐波幅度二次谐波,歌唱功率比,第一和第二共振峰(F1和F2)的频率以及元音段的元音空间区域。皮尔逊(Pearson)的一系列相关程序显示,iPhone和DD信号的测量值高度相关。元音对从iPhone信号获得的实验测量结果的发现与从DD信号得到的结果一致。但是,仅对于F0,F1和F2,来自iPhone和DD信号的测量值之间的平均归一化绝对差是最佳的(即,低于20%)。这些发现表明,iPhone录音与其他类型的高质量数字录音一样,足以进行声音质量的声学测量,但是来自不同数字录音系统的大多数声音测量都无法直接比较。

著录项

  • 作者

    Lin E.; Hornibrook J.;

  • 作者单位
  • 年度 2011
  • 总页数
  • 原文格式 PDF
  • 正文语种 en
  • 中图分类

相似文献

  • 外文文献
  • 中文文献
  • 专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号