Editing like Humans: A Contextual, Multimodal Framework for Automated Video Editing

机译：像人类一样编辑：一个用于自动视频编辑的语境，多模式框架

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We propose an automated video editing model, which we term contextual and multimodal video editing (CMVE). The model leverages visual and textual metadata describing videos, integrating essential information from both modalities, and uses a learned editing style from a single example video to coherently combine clips. The editing model is useful for tasks such as generating news clip montages and highlight reels given a text query that describes the video storyline. The model exploits the perceptual similarity between video frames, objects in videos and text descriptions to emulate coherent video editing. Amazon Mechanical Turk participants made judgements comparing CMVE to expert human editing. Experimental results showed no significant difference in the CMVE vs human edited video in terms of matching the text query and the level of interest each generates, suggesting CMVE is able to effectively integrate semantic information across visual and textual modalities and create perceptually coherent quality videos typical of human video editors. We publicly release an online demonstration of our method.

机译：我们提出了一种自动视频编辑模型，我们术语上下文和多模式编辑（CMVE）。该模型利用了视觉和文本元数据描述视频，从两个模态集成基本信息，并使用从单个示例视频中获取的学习编辑样式来连贯地组合剪辑。编辑模型对任务非常有用，例如生成新闻剪辑蒙太奇，并且突出显示卷轴给定描述视频故事情节的文本查询。该模型利用视频帧之间的感知相似性，视频中的对象和文本描述以模拟相干视频编辑。亚马逊机械土耳其参与者使CMVE与专家编辑进行比较。实验结果表明，在匹配文本查询和每个生成的利益水平方面，CMVE VS人类编辑视频没有显着差异，暗示CMVE能够通过视觉和文本方式有效地集成语义信息，并创造典型的感知相干的质量视频人类视频编辑器。我们公开发布我们的方法的在线演示。

著录项

来源
《IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops》|2021年|1701-1709|共9页
会议地点
作者
Sharath Koorathota; Patrick Adelman; Kelly Cotton; Paul Sajda;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Training; Visualization; Computer vision; Conferences; Semantics; Metadata; Pattern recognition;

机译：培训;可视化;计算机愿景;会议;语义;元数据;模式识别;

相似文献

外文文献
中文文献
专利

1. De/Contextualizing Information: The Digitization of Video Editing Practices at the BBC [J] . Marton Attila, Mariategui Jose-Carlos The Information Society . 2015,第2期

机译：去信息化/信息化：BBC的视频编辑实践数字化
2. Videoshop: A new framework for spatio-temporal video editing in gradient domain [J] . Wang HC, Xu N, Raskar R, Graphical models . 2007,第1期

机译：Videoshop：渐变域中时空视频编辑的新框架
3. Optimization-based automated home video editing system [J] . Xian-Sheng Hua, Lie Lu, Hong-Jiang Zhang IEEE Transactions on Circuits and Systems for Video Technology . 2004,第5期

机译：基于优化的自动家庭视频编辑系统
4. Automated Image and Video Quality Assessment for Computational Video Editing [C] . Konstantin Lomotin, Ilya Makarov Internatinal Conference on Analysis of Images, Social Networks and Texts . 2020

机译：计算视频编辑的自动图像和视频质量评估
5. Opening Chromatin and Improving CRISPR/Cas9 Editing How is CRISPR Mediated Editing Influenced by Artificially-opened Chromatin in Human Cells? [D] . Hamna, Syeda Fatima 2019

机译：打开染色质并改善CRISPR / Cas9编辑CRISPR介导的编辑如何受到人细胞中人工打开的染色质的影响？
6. A Scaled Framework for CRISPR Editing of Human Pluripotent Stem Cells to Study Psychiatric Disease [O] . Dane Z. Hazelbaker, Amanda Beccard, Anne M. Bara, 2017

机译：用于人类多能干细胞CRISPR编辑以研究精神疾病的规模化框架
7. Future of global regulation of human genome editing: a South African perspective on the WHO Draft Governance Framework on Human Genome Editing [O] . Bonginkosi Shozi, Tamanda Kamwendo, Julian Kinderlerer, 2021

机译：全球对人类基因组编辑的未来：南非对人类基因组编辑治理框架的南非观点

Editing like Humans: A Contextual, Multimodal Framework for Automated Video Editing

摘要

著录项

相似文献

相关主题

期刊订阅