首页> 外国专利> System and method for multi-modal fusion based fault-tolerant video content recognition

System and method for multi-modal fusion based fault-tolerant video content recognition

机译：基于多模态融合的容错视频内容识别系统及方法

页面导航

摘要
著录项
相似文献

摘要

A system and a method for multi-modal fusion based fault tolerant video content recognition is disclosed. The method conducts multi-modal recognition on an input video to extract multiple components and their respective appearance time in the video. Next, the multiple components are categorized and recognized respectively via different algorithms. Next, when the recognition confidence of any component is insufficient, a cross-validation with other components is performed to increase the recognition confidence and improve the fault tolerance of the components. Furthermore, when the recognition confidence of an individual component is insufficient, the recognition continues and tracks the component, spatially and temporally when it applies, until frames of high recognition confidence in the continuous time period is reached. Finally, multi-modal fusion is performed to summarize and resolve any recognition discrepancies between the multiple components, and to generate indices for every time frame for the ease of future text-based queries.

机译：公开了用于基于多模式融合的容错视频内容识别的系统和方法。该方法对输入视频进行多模式识别，以提取视频中的多个分量及其各自的出现时间。接下来，通过不同的算法分别对多个组件进行分类和识别。接下来，当任何组件的识别置信度不足时，将与其他组件进行交叉验证，以提高识别置信度并提高组件的容错能力。此外，当单个组件的识别可信度不足时，识别将继续并在应用时在空间和时间上跟踪该组件，直到在连续时间段内达到高识别可信度的帧为止。最后，执行多模式融合以总结和解决多个组件之间的任何识别差异，并为每个时间范围生成索引，以方便将来基于文本的查询。

著录项

公开/公告号US10013487B2

专利类型
公开/公告日2018-07-03

原文格式PDF
申请/专利权人 VISCOVERY PTE. LTD.;
展开▼

申请/专利号US201615007872
发明设计人 YEN-CHENG CHEN;CHUN-CHIEH HUANG;KUO-DON HSI;
展开▼

申请日2016-01-27
分类号G06K9/00;G06F17/30;G06K9/62;
国家 US
入库时间 2022-08-21 13:04:41

相似文献

专利
外文文献
中文文献