AUTOMATIC SEGMENTATION AND LABELING OF CONTINUOUS SPEECH WITHOUT BOOTSTRAPPING

机译：自动分词和连续语音标签，无需引导

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper, a novel approach is proposed for automatically segmenting and transcribing continuous speech signal without the use of manually segmented and labeled speech corpora. The continuous speech signal is first segmented into syllable-like units by considering short-term energy as a magnitude spectrum of some arbitrary signal. Similar syllable segments are then grouped together using an unsupervised and incremental clustering technique. Separate models are generated for each cluster of syllable segments. At this stage, labels are assigned for each group of syllable segments manually. The syllable models of these clusters are then used to transcribe/recognize the continuous speech signal of closed-set speakers as well open-set speakers. As a syllable recognizer, our initial results on Indian television news bulletins of the the languages Tamil and Telugu shows that the performance is 43.3% and 32.9% respectively.

机译：在本文中，提出了一种新颖的方法，用于自动分段和转录连续语音信号，而无需使用手动分段和标记的语音语料库。首先，通过将短期能量视为任意信号的幅度谱，将连续语音信号分割成音节状单元。然后使用无人监督和增量聚类技术将相似的音节段组合在一起。为每个音节片段群生成单独的模型。在此阶段，将为每组音节片段手动分配标签。然后，将这些聚类的音节模型用于转录/识别封闭式扬声器和开放式扬声器的连续语音信号。作为音节识别器，我们在印度电视新闻公报上以泰米尔语和泰卢固语两种语言显示的初步结果表明，该效果分别为43.3％和32.9％。

著录项

来源
《European Signal Processing Conference(EUSIPCO 2004) vol.1; 20040906-10; Vienna(AT)》|2004年|P.561-564|共4页
会议地点 Vienna(AT)
作者
T.Nagarajan; Hema A.Murthy; N.Hemalatha;
展开▼
作者单位

Dept. of Computer Science and Engineering Indian Institute of Technology, Madras;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类信息处理（信息加工）;
关键词

相似文献

外文文献
中文文献
专利

1. Automatic Segmentation and Labeling for Malay Speech Recognition [J] . S. A. R. AL-HADDAD, SALINA ABDUL SAMAD, AINI HUSSEIN, WSEAS Transactions on Signal Processing . 2006,第9期

机译：马来语语音识别的自动分段和标记
2. Automatic Segmentation of Continuous Speech on Word Level Based on Supra-segmental Features [J] . KLARA VICSI, GYOERGY SZASZAK International journal of speech technology . 2005,第4期

机译：基于超分割特征的单词水平连续语音自动分割
3. Automatic segmentation of continuous speech using minimum phase group delay functions [J] . V. Kamakshi Prasad, T. Nagarajan, Hema A. Murthy Speech Communication . 2004,第3a4期

机译：使用最小相位群延迟功能自动分割连续语音
4. Automatic segmentation and labeling of continuous speech without bootstrapping [C] . Nagarajan T., Murthy Hema /A/., Hemalatha N. European Signal Processing Conference . 2004

机译：自动分割和标记连续语音而无需自举
5. Quantitative Analysis and Comparison of Cerebrovasculature in Common Mouse Strains: C57BL/6, CD-1, CBA, and 129/Sv using Imaging, Automatic Segmentation and Labelling of the Cerebral Vessels [D] . Ghanavati, Sahar. 2017

机译：使用成像，自动分割和标记脑血管的常见小鼠品系：C57BL / 6，CD-1，CBA和129 / Sv的脑血管系统定量分析和比较
6. Brain-inspired speech segmentation for automatic speech recognition using the speech envelope as a temporal reference [O] . Byeongwook Lee, Kwang-Hyun Cho -1

机译：以语音包络作为时间参考的自动语音识别的大脑启发式语音分割
7. Robust Automatic Continuous Speech Segmentation for Indian Languages to Improve Speech to Speech Translation [O] . J. Sangeetha, S. Jothilakshmi 2013

机译：强大的印度语言自动连续语音分割功能，可改善语音到语音的翻译
8. A New Training Strategy for the Learning Subspace Method of Classification, Applied to Simultaneous Phonemic Segmentation and Labeling of Continuous Speech [R] . Jalanko, M. , Riitinen, H. 1979

机译：一种新的学习子空间分类训练策略，应用于连续语音同时音素分割和标注

AUTOMATIC SEGMENTATION AND LABELING OF CONTINUOUS SPEECH WITHOUT BOOTSTRAPPING

摘要

著录项

相似文献

相关主题

期刊订阅