首页> 外文会议>International Image Processing, Applications and Systems Conference >The speaker-independent lipreading play-off; a survey of lipreading machines
【24h】

The speaker-independent lipreading play-off; a survey of lipreading machines

机译:与说话者无关的唇读季后赛;唇读机调查

获取原文

摘要

Lipreading is a difficult gesture classification task. One problem in computer lipreading is speaker-independence. Speaker-independence means to achieve the same accuracy on test speakers not included in the training set as speakers within the training set. Current literature is limited on speaker-independent lipreading, the few independent test speaker accuracy scores are usually aggregated within dependent test speaker accuracies for an averaged performance. This leads to unclear independent results. Here we undertake a systematic survey of experiments with the TCD-TIMIT dataset using both conventional approaches and deep learning methods to provide a series of wholly speaker-independent benchmarks and show that the best speaker-independent machine scores 69.58% accuracy with CNN features and an SVM classifier. This is less than state-of-the-art speaker-dependent lipreading machines, but greater than previously reported in independence experiments.
机译:唇读是一项困难的手势分类任务。口语表达的一个问题是说话人的独立性。说话者独立性是指在训练集中不包含的测试说话者与训练集中的说话者达到相同的准确性。当前文献仅限于与说话者无关的唇读,通常将少数独立的测试说话者准确性分数汇总在相关的测试说话者准确性内,以获得平均性能。这导致不清楚的独立结果。在这里,我们使用传统方法和深度学习方法对TCD-TIMIT数据集进行的实验进行了系统的调查,以提供一系列完全独立于说话者的基准,并显示出最佳的独立于说话者的机器在CNN功能和SVM分类器。这比最先进的扬声器相关的唇读机要少,但比以前在独立性实验中报告的要大。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号