Exploiting loudness dynamics in stochastic models of turn-taking

机译：开采响度动态的转型型号

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Stochastic turn-taking models have traditionally been implemented as N-grams, which condition predictions on recent binary-valued speech/non-speech contours. The current work re-implements this function using feed-forward neural networks, capable of accepting binary- as well as continuous-valued features; performance is shown to asymptotically approach that of the N-gram baseline as model complexity increases. The conditioning context is then extended to leverage loudness contours. Experiments indicate that the additional sensitivity to loudness considerably decreases average cross entropy rates on unseen data, by 0.03 bits per framing interval of 100 ms. This reduction is shown to make loudness-sensitive conversants capable of better predictions, with attention memory requirements at least 5 times smaller and responsiveness latency at least 10 times shorter than the loudness-insensitive baseline.

机译：传统上，随机轮转模型被实施为n-gram，对最近的二进制值语音/非语言轮廓的条件预测。目前的工作使用前锋神经网络重新实现此功能，能够接受二进制和连续值; 表现表现为渐近地接近N-GRAM基线作为模型复杂性的增加。然后扩展调节上下文以利用响度轮廓。实验表明，对响度的额外敏感性显着降低了未经调整数据上的平均交叉熵速率，每帧/ 100ms的帧间隔0.03比特。该减少被证明可以使能够更好地预测的响度敏感的倾向者，注意记忆要求至少比响度不敏感基线短至少10倍，响应延迟至少5倍。

著录项

来源
《IEEE Workshop on Spoken Language Technology》|2012年||共6页
会议地点
作者
Laskowski Kornel;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TN912-53;
关键词
Interaction models; neural networks; prosody; spoken dialogue systems;

机译：互动模型;神经网络;韵律;口语对话系统;

相似文献

外文文献
中文文献
专利

1. Incorporating stochasticity in the study of exploited fish population dynamics: Implications for the study of post-recruitment harvest strategies [J] . Councill Elizabeth L. Mathematical Biosciences: An International Journal . 2016,第Null期

机译：将随机性纳入剥削鱼类种群动态的研究中：对招聘后收获策略研究的启示
2. Bayesian Modeling of the Dynamics of Phase Modulations and their Application to Auditory Event Related Potentials at Different Loudness Scales [J] . Mortezapouraghdam, Zeinab Frontiers in Computational Neuroscience . 2016,第4期

机译：不同响度范围内相位调制动力学的贝叶斯建模及其在听觉事件相关电位中的应用
3. Bayesian Modeling of the Dynamics of Phase Modulations and their Application to Auditory Event Related Potentials at Different Loudness Scales [J] . Zeinab Mortezapouraghdam, Robert C. Wilson, Lars Schwabe, Frontiers in Computational Neuroscience . 2016,第4期

机译：不同响度范围内相位调制动力学的贝叶斯建模及其在听觉事件相关电位中的应用
4. Exploiting loudness dynamics in stochastic models of turn-taking [C] . Laskowski Kornel 2012 IEEE Workshop on Spoken Language Technology. . 2012

机译：在转弯随机模型中利用响度动力学
5. Stochastic modeling: Underlying stochastic processes and model dynamics. [D] . Ostrovsky, Dmitry V. 2004

机译：随机建模：随机过程和模型动力学的基础。
6. Stochasticity in staged models of epidemics: quantifying the dynamics of whooping cough [O] . Andrew J. Black, Alan J. McKane 2010

机译：流行病分期模型中的随机性：量化百日咳的动力学
7. EXPLOITING LOUDNESS DYNAMICS IN STOCHASTIC MODELS OF TURN-TAKING [O] . Kornel Laskowski 2013

机译：探索转弯随机模型中的响度动力学
8. Techniques for Modeling Stochastics Dynamical Systems. [R] . Brockett, R. W. 1983

机译：随机动力系统建模技术。

Exploiting loudness dynamics in stochastic models of turn-taking

摘要

著录项

相似文献

相关主题

期刊订阅