首页> 外国专利> INTERMEDIATE FEATURE QUANTITY CALCULATION DEVICE, ACOUSTIC MODEL LEARNING DEVICE, SPEECH RECOGNITION DEVICE, INTERMEDIATE FEATURE QUANTITY CALCULATION METHOD, ACOUSTIC MODEL LEARNING METHOD, SPEECH RECOGNITION METHOD, AND PROGRAM

INTERMEDIATE FEATURE QUANTITY CALCULATION DEVICE, ACOUSTIC MODEL LEARNING DEVICE, SPEECH RECOGNITION DEVICE, INTERMEDIATE FEATURE QUANTITY CALCULATION METHOD, ACOUSTIC MODEL LEARNING METHOD, SPEECH RECOGNITION METHOD, AND PROGRAM

机译：中间特征量计算设备，声学模型学习设备，语音识别设备，中间特征量计算方法，声学模型学习方法，语音识别方法和程序

页面导航

摘要
著录项
相似文献

摘要

PROBLEM TO BE SOLVED: To provide a technique for calculating an intermediate feature quantity to be used to learn an acoustic model for actualizing high-precision speech recognition without reference to the kind of speech data for learning while suppressing a calculation time needed for recognition processing.SOLUTION: An intermediate feature quantity calculation device includes: a kind identification part 110 which determines, from a speech feature quantity, a kind number j' (where j' is an integer satisfying 1≤j'≤J) corresponding to the kind of speech data from which the speech feature quantity is extracted; and a phoneme intermediate feature quantity calculation part 120 which calculates, as a phoneme intermediate feature quantity from the speech feature quantity and the kind number j', a j'-th kind intermediate feature quantity being a feature quantity to be used to calculate phoneme probability distributions p=(p, ..., p) being a distribution of a probability pin which a phoneme to which the speech feature quantity corresponds is a phoneme with a phoneme number m. The phoneme intermediate feature quantity calculation part includes a j-th kind intermediate feature quantity calculation part which uses a neural network for each integer j satisfying 1≤j≤J to calculate a j-th kind intermediate feature quantity from the speech feature quantity extracted from speech data with a kind number j.SELECTED DRAWING: Figure 2

机译：解决的问题：提供一种计算中间特征量的技术，该中间特征量用于学习用于实现高精度语音识别的声学模型，而无需参考用于学习的语音数据的种类，同时抑制识别处理所需的计算时间。解决方案：中间特征量计算装置包括：类型识别部分110，该类型识别部分110从语音特征量中确定与语音类型相对应的类型编号j'（其中j'是满足1≤j'≤J的整数）从中提取语音特征量的数据;音素中间特征量计算部120，根据语音特征量和种类编号j'，算出第j'种中间特征量作为用于计算音素概率的特征量，作为音素中间特征量。分布p ＝（p，...，p）是语音特征量所对应的音素是音素数m的音素的概率别针的分布。音素中间特征量计算部分包括第j种中间特征量计算部分，该第j种中间特征量计算部分对满足1≤j≤J的每个整数j使用神经网络，以从从中提取的语音特征量计算第j种中间特征量。带有种类编号j的语音数据。选定的图：图2

著录项

公开/公告号JP2018128574A

专利类型
公开/公告日2018-08-16

原文格式PDF
申请/专利权人 NIPPON TELEGR & TELEPH CORP NTT;
展开▼

申请/专利号JP20170021565
发明设计人 MORIYA TAKASHI;ASAMI TAICHI;
展开▼

申请日2017-02-08
分类号G10L15/02;G10L15/16;G10L15/10;G10L15/18;G06N3/08;G10L15/06;
国家 JP
入库时间 2022-08-21 13:13:54

相似文献

专利
外文文献
中文文献