首页> 外文会议>Signal and Image Processing >THE DYNAMICALLY-ADJUSTABLE HISTOGRAM PRUNING METHOD FOR EMBEDDED VOICE DIALING

【24h】

THE DYNAMICALLY-ADJUSTABLE HISTOGRAM PRUNING METHOD FOR EMBEDDED VOICE DIALING

机译：嵌入式语音拨号的动态可调整直方图修剪方法

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Memory and speed are two key factors that must be faced when applying voice dialer to Pocket PCs. To provide a solution, a novel decoding method integrated with the score differences of token paths is proposed, named as "Dynamically-Adjustable Histogram Pruning". Additionally, the computation of likelihood score is accelerated by means of dynamic score lookup table. Furthermore, a new acoustic modeling method based on Extended Initial/Final (XIF) with less dimensioned acoustic feature is proven suitable for embedded speech command recognition. By adopting the methods developed above, we implement a speaker-independent, user definable voice dialing speech recognition system with good performance on a real PDA device. For a 200-Chinese-word vocabulary, its recognition accuracy reaches 97.80%. Meanwhile, it obtains better recognition speed by 80 times and saves decoding space by 30% in comparison to the baseline system using standard Viterbi decoding method.

机译：内存和速度是将语音拨号器应用于Pocket PC时必须面对的两个关键因素。为了提供一种解决方案，提出了一种与令牌路径的分数差异相集成的新颖解码方法，称为“动态可调直方图修剪”。另外，借助动态得分查找表来加速似然得分的计算。此外，事实证明，基于扩展初始/最终（XIF）的具有较小尺寸声学特征的新声学建模方法适用于嵌入式语音命令识别。通过采用以上开发的方法，我们在真实的PDA设备上实现了性能良好的独立于扬声器的，用户可定义的语音拨号语音识别系统。对于200个中文单词的词汇，其识别准确率达到97.80％。同时，与使用标准Viterbi解码方法的基准系统相比，它获得了80倍的更好识别速度，并节省了30％的解码空间。

著录项

来源
《Signal and Image Processing》|2005年|P.46-51|共6页
会议地点 HonoluluHI(US)
作者
Defeng CHEN; Fang ZHENG; Jian LIU; Jing DENG; Wenhu WU; Zhanjiang SONG; Xunyi ZHOU;
展开▼
作者单位

Center for Speech Technology, State Key Laboratory of Intelligent Technology and System, Department of Computer Science and Technology, Tsinghua University, Beijing, 100084, China;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类信息处理（信息加工）;图像信号处理;
关键词
speech recognition; voice dialing; user definable vocabulary; dynamically-adjustable histogram pruning;

机译：语音识别;语音拨号;用户定义的词汇;动态可调的直方图修剪;

相似文献

外文文献
中文文献
专利

1. A quick search method for audio and video signals based on histogram pruning [J] . Kashino K., Kurozumi T., Murase H. IEEE transactions on multimedia . 2003,第3期

机译：基于直方图修剪的音视频信号快速搜索方法
2. Effective Voice Frame Pruning Method to Increase VoIP Call Capacity [J] . Progress in Artificial Intelligence . 2020,第1期

机译：有效的语音框架修剪方法，以提高VoIP呼叫容量
3. A Steganography Embedding Method Based on Hamming Coding and Histogram-Preserving [J] . Li Ya-Feng, Yang Ren-Er, Cheng Jie, International Journal of Pattern Recognition and Artificial Intelligence . 2019,第14期

机译：基于汉明编码和直方图保存的隐写方法
4. THE DYNAMICALLY-ADJUSTABLE HISTOGRAM PRUNING METHOD FOR EMBEDDED VOICE DIALING [C] . Defeng CHEN, Fang ZHENG, Jian LIU, IASTED International Conference on Signal and Image Processing . 2005

机译：嵌入式语音拨号的动态可调直方图修剪方法
5. Voice-enabled click and dial system. [D] . Ju, Yanyan. 2005

机译：启用语音的点击和拨号系统。
6. Person-centred data collection methods to embed the authentic voice of people who experience health challenges [O] . Sarah Jane Prior, Carey Mather, Karen Ford, 2020

机译：以人为本的数据收集方法以嵌入体验健康挑战的真实声音
7. A quick search method for audio and video signals based on histogram pruning [O] . Kunio Kashino, Takayuki Kurozumi, Hiroshi Murase, 2003

机译：一种基于直方图修剪的音视频信号快速搜索方法

THE DYNAMICALLY-ADJUSTABLE HISTOGRAM PRUNING METHOD FOR EMBEDDED VOICE DIALING

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅