首页> 外文会议>IEEE International Conference on Image Processing >Semicca: A new semi-supervised probabilistic CCA model for keyword spotting
【24h】

Semicca: A new semi-supervised probabilistic CCA model for keyword spotting

机译:Semarcca:一种新的半监督概率CCA模型,用于关键字斑点

获取原文

摘要

In this paper we present a semi-supervised, attribute-based model suitable for keyword spotting (KWS) in document images. Our model can take advantage of available non-annotated segmented word images, as well as string annotations without a matching word image. We build our model by extending on the probabilistic interpretation of Canonical Correlation Analysis (CCA), solved using Expectation-Maximization (EM). On test-time, we back-project the query and database images to the embedded space by calculating the embedding space posterior density given the observations. Keyword spotting is then efficiently performed by computing query nearest neighbours in the embedded Euclidean space. We validate that our model offers superior performance given the presence of partially-labelled data, with keyword spotting trials on the Bentham and George Washington datasets.
机译:在本文中,我们在文档图像中介绍了一个适用于关键字点的基于半监督的属性的模型。我们的模型可以利用可用的非注释分段字图像,以及没有匹配字图像的字符串注释。我们通过延长规范相关分析(CCA)的概率解释来构建我们的模型,使用期望最大化(EM)解决。在测试时,我们通过计算观察的嵌入空间后密度来将查询和数据库图像重新投影到嵌入式空间。然后通过计算嵌入式欧几里德空间中的查询最近邻居来有效地执行关键字发现。我们验证了我们的模型提供了卓越的性能,因为存在部分标记的数据,在Bentham和George Washington数据集上有关键字发现试验。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号