首页> 外国专利> Methods for composite pitch extraction of frequency domain and time domain for voice signal, distributed speech recognition systems, and computer readable media

Methods for composite pitch extraction of frequency domain and time domain for voice signal, distributed speech recognition systems, and computer readable media

机译：用于语音信号的频域和时域的复合音调提取方法，分布式语音识别系统和计算机可读介质

页面导航

摘要
著录项
相似文献

摘要

A system, computer readable medium, and method for sampling a speech signal; dividing the sampled speech signal into overlapped frames; extracting first pitch information from a frame using frequency domain analysis; providing at least one pitch candidate, each being associated with a spectral score, from the first pitch information, each of the at least one pitch candidate representing a possible pitch estimate for the frame; extracting second pitch information from the frame using a time domain analysis; providing a correlation score for the at least one pitch candidate from the second pitch information; and selecting one of the at least one pitch candidate to represent the pitch estimate of the frame. The system, computer readable medium, and method are suitable for speech coding and for distributed speech recognition.

机译：一种用于对语音信号进行采样的系统，计算机可读介质和方法;将采样的语音信号划分为重叠的帧;使用频域分析从帧中提取第一音调信息;从第一音调信息中提供至少一个音调候选，每个音调候选与频谱得分相关联，至少一个音调候选中的每个代表帧的可能音调估计;使用时域分析从帧中提取第二音高信息;从第二音调信息中提供至少一个音调候选的相关分数;选择至少一个基音候选之一来表示帧的基音估计。该系统，计算机可读介质和方法适用于语音编码和分布式语音识别。

著录项

公开/公告号JP4755585B6

专利类型
公开/公告日2011-12-28

原文格式PDF
申请/专利权人インターナショナル・ビジネス・マシーンズ・コーポレーション;
展开▼

申请/专利号JP2006509610
发明设计人ソリン、アレクサンダー;ラマバドラン、テンカシ、ヴィー;
展开▼

申请日2004-03-31
分类号G10L11/04;G10L15/02;G10L15/28;
国家 JP
入库时间 2022-08-21 17:37:05

相似文献

专利
外文文献
中文文献