Deep video-to-video transformations for accessibility with an application to photosensitivity

Barbu Andrei; Banda Dalitso; Katz Boris

首页> 外文期刊>Pattern recognition letters >Deep video-to-video transformations for accessibility with an application to photosensitivity

【24h】

Deep video-to-video transformations for accessibility with an application to photosensitivity

机译：深度视频到视频转换，可访问应用到光敏性

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

We demonstrate how to construct a new class of visual assistive technologies that, rather than extract symbolic information, learn to transform the visual environment to make it more accessible. We do so without engineering which transformations are useful allowing for arbitrary modifications of the visual input. As an instantiation of this idea we tackle a problem that affects and hurts millions worldwide: photosensitivity. Any time an affected person opens a website, video, or some other medium that contains an adverse visual stimulus, either intended or unintended, they might experience a seizure with potentially significant consequences. We show how a deep network can learn a video-to-video transformation rendering such stimuli harmless while otherwise preserving the video. This approach uses a specification of the adverse phenomena, the forward transformation, to learn the inverse transformation. We show how such a network generalizes to real-world videos that have triggered numerous seizures, both by mistake and in politically-motivated attacks. A number of complimentary approaches are demonstrated including using a hand-crafted generator and a GAN using a differentiable perceptual metric. Such technology can be deployed offline to protect videos before they are shown or online with assistive glasses or real-time post processing. Other applications of this general technique include helping those with limited vision, attention deficit hyperactivity disorder, and autism. (C) 2019 Published by Elsevier B.V.

机译：我们展示了如何构建新类的视觉辅助技术，而不是提取符号信息，学会转换视觉环境以使其更可访问。我们这样做，没有工程，转换是有用的，允许任意修改视觉输入。作为这种想法的实例化，我们解决了一个影响和伤害全球数百万的问题：光敏性。任何受影响人员打开一个网站，视频或其他包含不良视觉刺激的其他媒体的时间，无论是意外还是意外，他们都可能会癫痫发作具有潜在的重大后果。我们展示了深度网络如何学习视频到视频转换，呈现此类刺激无害的，而在另外保留视频时。这种方法使用了不利现象，前向转换的规范来学习逆变换。我们展示了这种网络如何推广到现实世界的视频，这些视频既通过错误和政治动机攻击触发了众多癫痫发作。展示了许多互补方法，包括使用手工制作的发电机和GaN使用可微分的感知度量。此类技术可以在脱机中部署以保护视频在显示或在线显示辅助眼镜或实时后处理。这种通用技术的其他应用包括帮助视力有限，注意力缺陷多动障碍和自闭症。（c）2019年由elestvier b.v发布。

著录项

来源
《Pattern recognition letters》 |2020年第9期|99-107|共9页
作者
Barbu Andrei; Banda Dalitso; Katz Boris;
展开▼
作者单位

MIT Ctr Brains Minds & Machines 32 Vassar St Cambridge MA 02139 USA;

MIT Ctr Brains Minds & Machines 32 Vassar St Cambridge MA 02139 USA;

MIT Ctr Brains Minds & Machines 32 Vassar St Cambridge MA 02139 USA;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类
关键词
Photosensitivity; Accessibility; Computer vision; Video-to-video transformation;

机译：光敏性;可访问性;计算机视觉;视频到视频转换;
入库时间 2022-08-18 21:28:45

相似文献

外文文献
中文文献
专利

1. Blockchain-enabled deep semantic video-to-video summarization for IoT devices [J] . Computers and Electrical Engineering . 2020,第期

机译：支持区块链的深度语义视频 - 视频到视频到视频摘要
2. Exploring job accessibility in the transformation context: an institutionalist approach and its application in Beijing [J] . Pengjun Zhao, Bin Lu Journal of Transport Geography . 2010,第3期

机译：探索转型背景下的工作可及性：一种制度主义的方法及其在北京的应用
3. Web 2.0 - End of Accessibility? Analysis of Most Common Problems with Web 2.0 Based Applications Regarding Web Accessibility [J] . Walter Kern International Journal of Public Information Systems . 2008,第2期

机译：Web 2.0-可访问性终止？基于Web 2.0的应用程序在Web可访问性方面最常见的问题分析
4. Continuous tone gray-scale photomasks based on photosensitive spin-on-glass technology for deep UV lithography applications [C] . E. A. Mendoza, F. A. Sigoli, H. Paulus, 23rd Annual BACUS Symposium on Photomask Technology . 2003

机译：基于光敏玻璃旋涂技术的连续色调灰度级光掩模，适用于深紫外光刻应用
5. Use of photosensitive metal-organic precursors to deposit metal-oxides for thin-film capacitor applications. [D] . Barstow, Sean J. 2003

机译：使用光敏性金属有机前体沉积金属氧化物以用于薄膜电容器应用。
6. 3D Printing of Amino Resin-based Photosensitive Materials on Multi-parameter Optimization Design for Vascular Engineering Applications [O] . Yung-Cheng Chiu, Yu-Fang Shen, Alvin Kai-Xing Lee, 2019

机译：基于氨基树脂的感光材料在血管工程应用的多参数优化设计中的3D打印
7. How to get the most from a business intelligence application during the post implementation phase? Deep structure transformation at a U.K. retail bank [O] . Alena Audzeyeva, Robert Hudson 2016

机译：如何在开发阶段期间从商业智能应用中获得最大的应用程序？ U.K.零售银行深度结构转型
8. Groupings of Organic Waste Chemicals Based on Sorption, Biotransformation, and Hydrolysis at Standard Conditions for Application to the Deep Subsurface Environment. [R] . Phillips, S. L., Hale, F. V., Tsang, C. F. 1988

机译：基于吸附，生物转化和水解的有机废物化学品分组在标准条件下应用于深地下环境。

Deep video-to-video transformations for accessibility with an application to photosensitivity

摘要

著录项

相似文献

相关主题

期刊订阅