Exploiting loudness dynamics in stochastic models of turn-taking

机译：在转弯随机模型中利用响度动力学

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Stochastic turn-taking models have traditionally been implemented as N-grams, which condition predictions on recent binary-valued speechon-speech contours. The current work re-implements this function using feed-forward neural networks, capable of accepting binary- as well as continuous-valued features; performance is shown to asymptotically approach that of the N-gram baseline as model complexity increases. The conditioning context is then extended to leverage loudness contours. Experiments indicate that the additional sensitivity to loudness considerably decreases average cross entropy rates on unseen data, by 0.03 bits per framing interval of 100 ms. This reduction is shown to make loudness-sensitive conversants capable of better predictions, with attention memory requirements at least 5 times smaller and responsiveness latency at least 10 times shorter than the loudness-insensitive baseline.

机译：传统上，随机转弯模型被实现为N-gram，它以最近的二进制值语音/非语音轮廓为条件进行预测。当前的工作是使用前馈神经网络重新实现此功能，该网络能够接受二进制值和连续值特征。随着模型复杂度的增加，性能逐渐显示出接近N元语法基线。然后扩展条件上下文以利用响度轮廓。实验表明，对响度的额外敏感性大大降低了看不见数据的平均交叉熵率，每100 ms的帧间隔降低了0.03位。这种减少表明可以使响度敏感的对话者能够更好地进行预测，与不响度敏感的基线相比，注意力记忆需求至少小5倍，响应潜伏期至少短10倍。

著录项

来源
《2012 IEEE Workshop on Spoken Language Technology.》|2012年|p.79-84|共6页
会议地点 Miami FL(US);Miami FL(US)
作者
Laskowski Kornel;
展开▼
作者单位

Carnegie Mellon University, Pittsburgh PA, USA Voci Technologies, Inc., Pittsburgh PA, USA;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类语音信号处理;语音信号处理;
关键词
Interaction models; neural networks; prosody; spoken dialogue systems;

机译：交互模型;神经网络;韵律;口语对话系统;;

相似文献

外文文献
中文文献
专利

1. Incorporating stochasticity in the study of exploited fish population dynamics: Implications for the study of post-recruitment harvest strategies [J] . Councill Elizabeth L. Mathematical Biosciences: An International Journal . 2016,第Null期

机译：将随机性纳入剥削鱼类种群动态的研究中：对招聘后收获策略研究的启示
2. Bayesian Modeling of the Dynamics of Phase Modulations and their Application to Auditory Event Related Potentials at Different Loudness Scales [J] . Mortezapouraghdam, Zeinab Frontiers in Computational Neuroscience . 2016,第4期

机译：不同响度范围内相位调制动力学的贝叶斯建模及其在听觉事件相关电位中的应用
3. Bayesian Modeling of the Dynamics of Phase Modulations and their Application to Auditory Event Related Potentials at Different Loudness Scales [J] . Zeinab Mortezapouraghdam, Robert C. Wilson, Lars Schwabe, Frontiers in Computational Neuroscience . 2016,第4期

机译：不同响度范围内相位调制动力学的贝叶斯建模及其在听觉事件相关电位中的应用
4. Exploiting loudness dynamics in stochastic models of turn-taking [C] . Laskowski Kornel IEEE Workshop on Spoken Language Technology . 2012

机译：开采响度动态的转型型号
5. Stochastic modeling: Underlying stochastic processes and model dynamics. [D] . Ostrovsky, Dmitry V. 2004

机译：随机建模：随机过程和模型动力学的基础。
6. Stochasticity in staged models of epidemics: quantifying the dynamics of whooping cough [O] . Andrew J. Black, Alan J. McKane 2010

机译：流行病分期模型中的随机性：量化百日咳的动力学
7. EXPLOITING LOUDNESS DYNAMICS IN STOCHASTIC MODELS OF TURN-TAKING [O] . Kornel Laskowski 2013

机译：探索转弯随机模型中的响度动力学
8. Techniques for Modeling Stochastics Dynamical Systems. [R] . Brockett, R. W. 1983

机译：随机动力系统建模技术。

Exploiting loudness dynamics in stochastic models of turn-taking

摘要

著录项

相似文献

相关主题

期刊订阅