Frame-based phonotactic Language Identification

机译：基于帧的音律语言识别

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper describes a frame-based phonotactic Language Identification (LID) system, which was used for the LID evaluation of the Robust Automatic Transcription of Speech (RATS) program by the Defense Advanced Research Projects Agency (DARPA). The proposed approach utilizes features derived from frame-level phone log-likelihoods from a phone recognizer. It is an attempt to capture not only phone sequence information but also short-term timing information for phone N-gram events, which is lacking in conventional phonotactic LID systems that simply count phone N-gram events. Based on this new method, we achieved 26% relative improvement in terms of Cavg for the RATS LID evaluation data compared to phone N-gram counts modeling. We also observed that it had a significant impact on score combination with our best acoustic system based on Mel-Frequency Cepstral Coefficients (MFCCs).

机译：本文介绍了一种基于帧的音符语言识别（LID）系统，该系统用于国防高级研究计划局（DARPA）对语音的稳健自动转录（RATS）程序的LID评估。所提出的方法利用了来自电话识别器的帧级电话对数可能性的特征。试图不仅捕获电话序列信息而且捕获针对电话N-gram事件的短期定时信息，这是传统的仅对电话N-gram事件进行计数的音变LID系统所缺少的。基于这种新方法，与电话N-gram计数模型相比，RATS LID评估数据的C avg 相对提高了26％。我们还观察到，它与基于梅尔频率倒谱系数（MFCC）的最佳声学系统对得分组合产生了重大影响。

著录项

来源
《2012 IEEE Workshop on Spoken Language Technology.》|2012年|p.303-306|共4页
会议地点 Miami FL(US);Miami FL(US)
作者
Han Kyu J.; Pelecanos Jason;
展开▼
作者单位

IBM T. J. Watson Research Center Yorktown Heights, NY 10598, USA;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类语音信号处理;语音信号处理;
关键词
DARPA RATS; language identification; phone event modeling with timing information; phonotactic;

机译：DARPA RATS；语言识别；带有定时信息的电话事件建模；语音方法；;

相似文献

外文文献
中文文献
专利

1. Spoken Language Identification with Phonotactics Methods on Minangkabau, Sundanese, and Javanese Languages [J] . Nur Endah Safitri, Amalia Zahra, Mirna Adriani Procedia Computer Science . 2016,第1期

机译：南部语言，Sun语和爪哇语言上的语音方法识别口语
2. Text- and speech-based phonotactic models for spoken language identification of Basque and Spanish [J] . Victor G. Guijarrubia, M. Ines Torres Pattern recognition letters . 2010,第6期

机译：基于文本和语音的音位学模型用于巴斯克语和西班牙语的口语识别
3. Phonotactic Constraints Are Activated across Languages in Bilinguals [J] . Max R. Freeman, Henrike K. Blumenfeld, Viorica Marian Frontiers in Psychology . 2016,第4期

机译：语音约束在双语者中跨语言激活
4. Improved phonotactic language identification using random forest language models [C] . XiaoRui Wang, ShiJin Wang, JiaEn Liang, Personal, Indoor and Mobile Radio Communications,2005 IEEE 16th International Symposium on . 2008

机译：使用随机森林语言模型改进的音符语言识别
5. The Influence of Native Language Phonotactics on Second Language Lexical Representation in Japanese Learners of English [D] . ?Rothgerber, John Robert 2020

机译：母语致辞对英语学习者第二语言词汇表演的影响
6. Phonotactic Constraints Are Activated across Languages in Bilinguals [O] . Max R. Freeman, Henrike K. Blumenfeld, Viorica Marian -1

机译：语音约束在双语者中跨语言激活
7. Spoken Language Identification with Phonotactics Methods on Minangkabau, Sundanese, and Javanese Languages [O] . Safitri Nur Endah, Zahra Amalia, Adriani Mirna 2016

机译：语音策略方法在Minangkabau，Sundanese和Javanese语言上的口语识别
8. Advanced Language Recognition using Cepstra and Phonotactics: MITLL System Performance on the NIST 2005 Language Recognition Evaluation. [R] . Campbell, W. M., Gleason, T., Navratil, J., 2016

机译：使用Cepstra和phonotactics进行高级语言识别：NIsT 2005语言识别评估中的mITLL系统性能。

Frame-based phonotactic Language Identification

摘要

著录项

相似文献

相关主题

期刊订阅