Probabilistic nod generation model based on speech and estimated utterance categories

Liu Chaoran; Ishi Carlos; Ishiguro Hiroshi

首页> 外文期刊>Advanced Robotics: The International Journal of the Robotics Society of Japan >Probabilistic nod generation model based on speech and estimated utterance categories

【24h】

Probabilistic nod generation model based on speech and estimated utterance categories

机译：基于语音和估计话语类别的概率点播模型

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

团队文献服务 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

We proposed and evaluated a probabilistic model that generates nod motions based on utterance categories estimated from the speech input. The model comprises two main blocks. In the first block, dialog act-related categories are estimated from the input speech. Considering the correlations between dialog acts and head motions, the utterances are classified into three categories having distinct nod distributions. Linguistic information extracted from the input speech is fed to a cluster of classifiers which are combined to estimate the utterance categories. In the second block, nod motion parameters are generated based on the categories estimated by the classifiers. The nod motion parameters are represented as probability distribution functions (PDFs) inferred from human motion data. By using speech energy features, the parameters are sampled from the PDFs belonging to the estimated categories. The effectiveness of the proposed model was evaluated using an android robot, through subjective experiments. Experiment results indicated that the motions generated by our proposed approach are considered more natural than those of a previous model using fixed nod shapes and hand-labeled utterance categories.

机译：我们提出并评估了概率模型，该模型基于语音输入估计的话语类别产生点头运动。该模型包括两个主块。在第一个块中，与输入语音估计有关的对话框行为相关类别。考虑到对话框作用和头部运动之间的相关性，话语分为三类具有不同点点分布的三个类别。从输入语音中提取的语言信息被馈送到组合以估计话语类别的分类器集群。在第二块中，基于分类器估计的类别生成NOD运动参数。点头运动参数表示为从人类运动数据推断的概率分布函数（PDF）。通过使用语音能量特征，从属于估计类别的PDF采样参数。通过主观实验使用Android机器人评估所提出的模型的有效性。实验结果表明，我们所提出的方法产生的动作被认为比使用固定点头形状和手工标记的话语类别更自然。

著录项

来源
《Advanced Robotics: The International Journal of the Robotics Society of Japan 》 |2019年第16期| 共11页
作者
Liu Chaoran; Ishi Carlos; Ishiguro Hiroshi;
展开▼
作者单位

ATR Hiroshi Ishiguro Lab Kyoto Japan;

ATR Hiroshi Ishiguro Lab Kyoto Japan;

ATR Hiroshi Ishiguro Lab Kyoto Japan;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类机器人技术 ;
关键词
Nod; motion generation; SVM; humanoid robot;

机译：点头;运动生成;SVM;人形机器人;

相似文献

外文文献
中文文献
专利

1. Probabilistic nod generation model based on speech and estimated utterance categories [J] . Liu Chaoran, Ishi Carlos, Ishiguro Hiroshi Advanced Robotics: The International Journal of the Robotics Society of Japan . 2019 ,第15a16期

机译：基于语音和估计话语类别的概率点播模型
2. Speech Unit Category based Short Utterance Speaker Recognition [J] . Nakhat Fatima, Xiaojun Wu, Thomas Fang Zheng Computer Science and Information Systems . 2012 ,第4期

机译：基于语音单元类别的简短讲话者识别
3. Concept-to-Speech generation with knowledge sharing for acoustic modelling and utterance filtering [J] . Xin Wang, Zhen-Hua Ling, Li-Rong Dai Computer speech and language . 2016 ,第Jula期

机译：从概念到语音的生成与知识共享，用于声学建模和话语过滤
4. Probabilistic nod generation model based on estimated utterance categories [C] . Chaoran Liu, Carlos Ishi, Hiroshi Ishiguro IEEE/RSJ International Conference on Intelligent Robots and Systems . 2017

机译：基于估计话语类别的概率点头生成模型
5. A stochastic model based on artificial neural networks for synthetic streamflow generation applied to probabilistic management of droughts (Spanish text). [D] . Ochoa Rivera, Juan Camilo. 2002

机译：一种基于人工神经网络的随机模型，用于合成流的生成，应用于干旱的概率管理（西班牙语）。
6. Hierarchical probabilistic models for multiple gene/variant associations based on next-generation sequencing data [O] . Dimitrios V Vavoulis, Jenny C Taylor, Anna Schuh -1

机译：基于下一代测序数据的多个基因/变异关联的分层概率模型
7. Utterance Verification Using Word Voiceprint Models Based on Probabilistic Distributions of Phone-Level Log-Likelihood Ratio and Phone Duration [O] . S.-B. KWON, H. KIM 2008

机译：使用Word VoicePrint模型的话语验证，基于概率分布的电话级日志似然比和电话持续时间

Probabilistic nod generation model based on speech and estimated utterance categories

摘要

著录项

相似文献

相关主题

期刊订阅