首页> 外文期刊>Pattern recognition letters >Deep video-to-video transformations for accessibility with an application to photosensitivity
【24h】

Deep video-to-video transformations for accessibility with an application to photosensitivity

机译:深度视频到视频转换,可访问应用到光敏性

获取原文
获取原文并翻译 | 示例
       

摘要

We demonstrate how to construct a new class of visual assistive technologies that, rather than extract symbolic information, learn to transform the visual environment to make it more accessible. We do so without engineering which transformations are useful allowing for arbitrary modifications of the visual input. As an instantiation of this idea we tackle a problem that affects and hurts millions worldwide: photosensitivity. Any time an affected person opens a website, video, or some other medium that contains an adverse visual stimulus, either intended or unintended, they might experience a seizure with potentially significant consequences. We show how a deep network can learn a video-to-video transformation rendering such stimuli harmless while otherwise preserving the video. This approach uses a specification of the adverse phenomena, the forward transformation, to learn the inverse transformation. We show how such a network generalizes to real-world videos that have triggered numerous seizures, both by mistake and in politically-motivated attacks. A number of complimentary approaches are demonstrated including using a hand-crafted generator and a GAN using a differentiable perceptual metric. Such technology can be deployed offline to protect videos before they are shown or online with assistive glasses or real-time post processing. Other applications of this general technique include helping those with limited vision, attention deficit hyperactivity disorder, and autism. (C) 2019 Published by Elsevier B.V.
机译:我们展示了如何构建新类的视觉辅助技术,而不是提取符号信息,学会转换视觉环境以使其更可访问。我们这样做,没有工程,转换是有用的,允许任意修改视觉输入。作为这种想法的实例化,我们解决了一个影响和伤害全球数百万的问题:光敏性。任何受影响人员打开一个网站,视频或其他包含不良视觉刺激的其他媒体的时间,无论是意外还是意外,他们都可能会癫痫发作具有潜在的重大后果。我们展示了深度网络如何学习视频到视频转换,呈现此类刺激无害的,而在另外保留视频时。这种方法使用了不利现象,前向转换的规范来学习逆变换。我们展示了这种网络如何推广到现实世界的视频,这些视频既通过错误和政治动机攻击触发了众多癫痫发作。展示了许多互补方法,包括使用手工制作的发电机和GaN使用可微分的感知度量。此类技术可以在脱机中部署以保护视频在显示或在线显示辅助眼镜或实时后处理。这种通用技术的其他应用包括帮助视力有限,注意力缺陷多动障碍和自闭症。 (c)2019年由elestvier b.v发布。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号