首页> 外文会议>Conference on Neural Information Processing Systems >LiteEval: A Coarse-to-Fine Framework for Resource Efficient Video Recognition
【24h】

LiteEval: A Coarse-to-Fine Framework for Resource Efficient Video Recognition

机译:Liteeval:资源高效视频识别的粗略框架

获取原文

摘要

This paper presents LiteEval, a simple yet effective coarse-to-fine framework for resource efficient video recognition, suitable for both online and offline scenarios. Exploiting decent yet computationally efficient features derived at a coarse scale with a lightweight CNN model, LiteEval dynamically decides on-the-fly whether to compute more powerful features for incoming video frames at a finer scale to obtain more details. This is achieved by a coarse LSTM and a fine LSTM operating cooperatively, as well as a conditional gating module to learn when to allocate more computation. Extensive experiments are conducted on two large-scale video benchmarks, FCVID and ActivityNet, and the results demonstrate LiteEval requires substantially less computation while offering excellent classification accuracy for both online and offline predictions.
机译:本文提出了LiteeVal,一个简单但有效的粗略对资源视频识别的粗略框架,适用于在线和离线方案。 利用体面尚未使用轻量级CNN模型的粗略级别导出的计算上的尚效功能,LiteeVal是动态地决定是否在更精细的比例下计算更强大的功能,以获得更多细节。 这是通过粗略LSTM和精细的LSTM来实现的,以及有条件的门控模块,以学习何时分配更多计算。 广泛的实验是在两个大规模视频基准,FCVID和ActivityNet上进行的,并且结果证明了LiteeVal需要大量计算,同时为在线和离线预测提供出色的分类准确性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号