Relationship of analysis frame interval and image resolution in automatic lip-reading recognition performance

Hidekazu Kato; Akinobu Lee; Hiroshi SaruwatariKiyohiro Shikano

首页> 外文期刊>電子情報通信学会技術研究報告. 音声. Speech >Relationship of analysis frame interval and image resolution in automatic lip-reading recognition performance

【24h】

Relationship of analysis frame interval and image resolution in automatic lip-reading recognition performance

机译：Relationship of analysis frame interval and image resolution in automatic lip-reading recognition performance

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相关主题

摘要

Automatic lip-reading research using the video sequence of the speaker's mouth has been carried out with significant interests in increasing the robustness of automatic speech recognition in noisy environments. However, it has not accomplished enough recognition rate yet. In this paper, we investigate the relationship of analysis frame interval and image resolution to check how they take effects on the lip-reading performance. Based on the experimental results under various analysis frame interval using the video sequence recorded by high speed camera, we make clean that it is effective to use the faster frame rate for high recognition performance. Another experimental results under various image resolution shows that the recognition performance does not depend on the image resolution. These results suggest that the visual feature vector extracted by our image based approach can reduce the resolution to 20×15 pix cells.

著录项

来源
《電子情報通信学会技術研究報告. 音声. Speech》 |2002年第529期|129-134|共6页
作者
Hidekazu Kato; Akinobu Lee; Hiroshi SaruwatariKiyohiro Shikano;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种日语
中图分类电报、传真;
关键词
Automatic lip-reading; Analysis frame interval; Image resolution; Image based method;

Relationship of analysis frame interval and image resolution in automatic lip-reading recognition performance

摘要

著录项

相关主题

期刊订阅