Tracking of Multiple Fundamental Frequencies in Diplophonic Voices

Philipp Aichinger; Martin Hagmüller; Berit Schneider-Stickler; Jean Schoentgen; Franz Pernkopf

首页> 外文期刊>Audio, Speech, and Language Processing, IEEE/ACM Transactions on >Tracking of Multiple Fundamental Frequencies in Diplophonic Voices

【24h】

Tracking of Multiple Fundamental Frequencies in Diplophonic Voices

机译：外交声中多个基本频率的跟踪

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Diplophonia is a type of pathological voice in which two fundamental frequencies (f_o) are present simultaneously. Specialized audio analyzers that can handle up to two f_os in diplophonic voices are in their infancy. We propose the tracking of up to two f_os in diplophonic voices by audio waveform modeling (AWM), which involves obtaining candidates by repetitive execution of the Viterbi algorithm, followed by waveform Fourier synthesis, and heuristic candidate selection with majority voting. Our approach is evaluated with reference f_o-tracks obtained from laryngeal highspeed videos of 29 sustained phonations and compared to state-of-the-art tracking algorithms for multiple f_os. An accurate and a fast variant of our algorithm are tested. The median error rate of the accurate variant is 6.52%, whereas the most accurate benchmark achieves 11.11%. The fast variant is more than twice as fast as the fastest relevant benchmark, and the median error rate is 9.52%. Furthermore, illustrative results of connected speech analysis are reported. Our approach may help to improve detection and analysis of diplophonia in clinical research and practice, as well as to advance synthesis of disordered voices.

机译：Diplophonia是一种病理性语音，其中两个基本频率（f _{o ）同时出现。专门的音频分析仪尚处于起步阶段，最多可以处理两个语音中的f _{o 。我们建议通过音频波形建模（AWM）跟踪双声语音中的最多两个f _{o ，这涉及通过重复执行Viterbi算法获得候选，然后进行波形傅立叶合成和启发式候选多数投票的选择。我们的方法是通过参考f _{o 音轨进行评估的，该音轨是从29个持续发声的喉部高速视频中获得的，并与针对多个f _{o s的最新跟踪算法进行了比较。测试了我们算法的准确且快速的变体。准确变量的中位数错误率为6.52％，而最准确的基准为11.11％。快速变体的速度是最快的相关基准的两倍以上，中位数错误率为9.52％。此外，报告了连接语音分析的说明性结果。我们的方法可能有助于在临床研究和实践中改进对双声的检测和分析，以及促进混乱声音的合成。}}}}}

著录项

来源
《Audio, Speech, and Language Processing, IEEE/ACM Transactions on》 |2018年第2期|330-341|共12页
作者
Philipp Aichinger; Martin Hagmüller; Berit Schneider-Stickler; Jean Schoentgen; Franz Pernkopf;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Videos; Hidden Markov models; Speech; Oscillators; Speech processing; Error analysis; Benchmark testing;

机译：视频;隐马尔可夫模型;语音;振荡器;语音处理;误差分析;基准测试;

相似文献

外文文献
中文文献
专利

1. Fundamental frequency tracking in diplophonic voices [J] . Aichinger P., Hagmueller M., Roesner I., Biomedical signal processing and control . 2017,第AUGa期

机译：双声语音的基本频率跟踪
2. Sequential stream segregation of voiced and unvoiced speech sounds based on fundamental frequency [J] . David Marion, Lavandier Mathieu, Grimault Nicolas, Hearing Research: An International Journal . 2017,第期

机译：基于基础频率的浊音和清音语音的顺序流分离
3. Multiple carrier frequency offsets tracking in co-operative space-frequency block-coded orthogonal frequency division multiplexing systems [J] . Xiong J., Huang Q., Xi Y., Communications, IET . 2013,第3期

机译：协作空频块编码正交频分复用系统中的多载波频偏跟踪
4. Voice/Non-Voice Classification Using Reliable Fundamental Frequency Estimator for Voice Activated Powered Wheelchair Control [C] . Soo-Young Suk, Hyun-Yeol Chung, Hiroaki Kojima International Conference on Embedded Software and Systems(ICESS 2007); 20070514-16; Daegu(KR) . 2007

机译：使用可靠的基本频率估计器进行语音/电动轮椅控制的语音/非语音分类
5. Signal acquisition and tracking for fixed wireless access multiple input multiple output orthogonal frequency division multiplexing. [D] . Mody, Apurva N. 2004

机译：固定无线访问多输入多输出正交频分复用的信号采集和跟踪。
6. Sequential stream segregation of voiced and unvoiced speech sounds based on fundamental frequency [O] . Marion David, Mathieu Lavandier, Nicolas Grimault, -1

机译：基于基频的有声和无声语音流的顺序流分离
7. Enhanced neural tracking of the fundamental frequency of the voice [O] . Jana Van Canneyt, Jan Wouters, Tom Francart 2021

机译：增强了语音的基本频率的神经跟踪
8. Problems of Voice Communication in the Navy. Regulation of Vocal Intensity at Low Fundamental Frequencies [R] . Murry, T. 1970

机译：海军语音通信问题。在低基频率下调节声强

Tracking of Multiple Fundamental Frequencies in Diplophonic Voices

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅