首页> 外文会议>IEEE Conference on Computer Vision and Pattern Recognition >DESIRE: Distant Future Prediction in Dynamic Scenes with Interacting Agents
【24h】

DESIRE: Distant Future Prediction in Dynamic Scenes with Interacting Agents

机译:需求:具有交互代理的动态场景中的遥远未来预测

获取原文

摘要

We introduce a Deep Stochastic IOC RNN Encoder-decoder framework, DESIRE, for the task of future predictions of multiple interacting agents in dynamic scenes. DESIRE effectively predicts future locations of objects in multiple scenes by 1) accounting for the multi-modal nature of the future prediction (i.e., given the same context, future may vary), 2) foreseeing the potential future outcomes and make a strategic prediction based on that, and 3) reasoning not only from the past motion history, but also from the scene context as well as the interactions among the agents. DESIRE achieves these in a single end-to-end trainable neural network model, while being computationally efficient. The model first obtains a diverse set of hypothetical future prediction samples employing a conditional variational auto-encoder, which are ranked and refined by the following RNN scoring-regression module. Samples are scored by accounting for accumulated future rewards, which enables better long-term strategic decisions similar to IOC frameworks. An RNN scene context fusion module jointly captures past motion histories, the semantic scene context and interactions among multiple agents. A feedback mechanism iterates over the ranking and refinement to further boost the prediction accuracy. We evaluate our model on two publicly available datasets: KITTI and Stanford Drone Dataset. Our experiments show that the proposed model significantly improves the prediction accuracy compared to other baseline methods.
机译:我们引入了深度随机IOC RNN编码器-解码器框架DESIRE,用于未来在动态场景中预测多个交互代理的任务。 DESIRE通过以下方式有效地预测对象在多个场景中的未来位置:1)考虑到未来预测的多模态性质(即,在相同的上下文中,未来可能会有所不同),2)预见潜在的未来结果并基于策略进行战略预测3)不仅要根据过去的运动历史进行推理,而且还要根据场景上下文以及主体之间的交互进行推理。 DESIRE在单个端到端可训练神经网络模型中实现了这些目标,同时计算效率很高。该模型首先使用条件变分自动编码器获得一组假设的未来预测样本,这些样本将通过以下RNN评分-回归模块进行排名和完善。通过对累积的未来奖励进行核算来对样本进行评分,这可以实现类似于IOC框架的更好的长期战略决策。 RNN场景上下文融合模块共同捕获过去的运动历史,语义场景上下文以及多个代理之间的交互。反馈机制会对排名和细化进行迭代,以进一步提高预测准确性。我们在两个公开可用的数据集上评估我们的模型:KITTI和Stanford Drone Dataset。我们的实验表明,与其他基线方法相比,所提出的模型显着提高了预测准确性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号