Unsupervised speaker adaptation based on hierarchical spectral clustering

Furui S.

首页> 外文期刊>IEEE Transactions on Acoustics, Speech, and Signal Processing >Unsupervised speaker adaptation based on hierarchical spectral clustering

【24h】

Unsupervised speaker adaptation based on hierarchical spectral clustering

机译：基于分层频谱聚类的无监督说话人自适应

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

The author proposes an automatic speaker adaptation algorithm for speech recognition, in which a small amount of training material of unspecified text can be used. The algorithm is easily applied to vector-quantization- (VQ) speech recognition systems consisting of a VQ codebook and a word dictionary in which each word is represented as a sequence of codebook entries. In the adaptation algorithm, the VQ codebook is modified for each new speaker, whereas the word dictionary is universally used for all speakers. The important feature of this algorithm is that a set of spectra in training frames and the codebook entries are clustered hierarchically. Based on the vectors representing deviation between centroids of the training frame clusters and the corresponding codebook clusters, adaptation is performed hierarchically from small to large numbers of clusters. The spectral resolution of the adaptation process is improved accordingly. Results of recognition experiments using utterances of 100 Japanese city names show that adaptation reduces the mean word recognition error rate from 4.9 to 2.9%. Since the error rate for speaker-dependent recognition is 2.2%, the adaptation method is highly effective.

机译：作者提出了一种用于语音识别的自动说话人自适应算法，其中可以使用少量的未指定文本的训练材料。该算法可轻松应用于由VQ码本和单词词典组成的矢量量化（VQ）语音识别系统，其中每个单词都表示为一系列码本条目。在自适应算法中，为每个新说话者修改了VQ码本，而单词词典普遍用于所有说话者。该算法的重要特征是训练帧中的一组频谱和码本条目是按层次结构聚类的。基于表示训练帧簇的质心和相应的码本簇之间的偏差的向量，从小簇到大簇进行分层自适应。适应过程的光谱分辨率相应提高。使用100个日语城市名称的语音进行识别实验的结果表明，自适应将平均单词识别错误率从4.9％降低到2.9％。由于用于说话人相关识别的错误率是2.2％，因此自适应方法非常有效。

著录项

来源
《IEEE Transactions on Acoustics, Speech, and Signal Processing》 |1989年第12期|P.1923-1930|共8页
作者
Furui S.;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类无线电电子学、电信技术;
关键词

相似文献

外文文献
中文文献
专利

1. Unsupervised hierarchical adaptation using reliable selection of cluster-dependent parameters [J] . Jen-Tzung Chien, Jean-Claude Junqua 20f Speech Communication . 2000,第4期

机译：使用可靠选择簇相关参数的无监督分层适应
2. Multistage data selection-based unsupervised speaker adaptation for personalized speech emotion recognition [J] . Jae-Bok Kim, Jeong-Sik Park Engineering Applications of Artificial Intelligence . 2016,第Juna期

机译：基于多阶段数据选择的无监督说话者自适应，用于个性化语音情感识别
3. Unsupervised rapid speaker adaptation based on selective eigenvoice merging for user-specific voice interaction [J] . Dong-Jin Choi, Jeong-Sik Park, Yung-Hwan Oh Engineering Applications of Artificial Intelligence . 2015,第apra期

机译：基于选择性特征语音合并的无监督快速说话人适应，用于特定于用户的语音交互
4. Unsupervised speaker adaptation method based on hierarchical spectral clustering [C] . Furui, S. . 1989

机译：基于分层谱聚类的无监督说话人自适应方法
5. An unsupervised hierarchical clustering image segmentation and an adaptive image reconstruction system for remote sensing. [D] . Lee, Sanghoon. 1990

机译：用于遥感的无监督分层聚类图像分割和自适应图像重建系统。
6. A Hierarchical Unsupervised Spectral Clustering Scheme for Detection of Prostate Cancer from Magnetic Resonance Spectroscopy (MRS) [O] . Pallavi Tiwari, Anant Madabhushi, Mark Rosen -1

机译：从磁共振波谱（MRS）检测前列腺癌的分层无监督谱聚类方案
7. Unsupervised Speaker Adaptation Using Attention-Based Speaker Memory for End-to-End ASR [O] . Leda Sari, Niko Moritz, Takaaki Hori, 2020

机译：无监督的扬声器适应使用基于注意的扬声器内存，用于端到端ASR

Unsupervised speaker adaptation based on hierarchical spectral clustering

摘要

著录项

相似文献

相关主题

期刊订阅