首页> 外文学位 >A Distributed Information Model for Robust and Flexible Perceptual Interfaces.
【24h】

A Distributed Information Model for Robust and Flexible Perceptual Interfaces.

机译:健壮而灵活的感知接口的分布式信息模型。

获取原文
获取原文并翻译 | 示例

摘要

Modern artists turn to technology to express their ideas effectively and find in it a medium that goes beyond the limits of traditional pencil and paper. They seek tools that are natural to use and intuitive, but also need solutions that are accurate and controllable. Their demands on technology are often broad and conflicting. As researchers in computer graphics, we are expected to supply solutions supporting creative ideas from the early and crude stages of storyboarding to the demanding and precise requirements of final production. While the industry has already produced several mature applications that are routinely employed in complex projects, these tools are tedious to use and mostly inadequate to yield the true potential of individual artists. In fact, the current limits of technology are only compensated by large budgets and extended work forces. In recent years, the research community has shifted its focus to promising technologies based on machine perception, which are commonly known as perceptual interfaces.;In this thesis, I propose a new paradigm for perceptual interfaces as well as practical algorithms and theoretical concepts that are designed to tackle these key challenges. I begin by presenting several practical algorithms and perceptual technologies that aid the creative process at various stages of development in a fixed workflow. Subsequently, I extend these ideas by presenting a framework that enables a perceptual workflow to be configured dynamically around the needs of artists. To this end, I introduce a new distributed representation of complex information that organizes distinct levels of abstraction in a unified model and a reasoning system that can query, modify, and adapt this representation dynamically during an interactive session. This representation coupled with the reasoning system can configure new interaction modes on-the-fly, and lets users control the artistic process at the most appropriate level of abstraction, without sacrificing the degree of control that is often lost in traditional perceptual interfaces. As a distributed representation, this formalism is robust and fault tolerant, but most importantly can be used to pinpoint when perceptual algorithms fail to produce consistent results. When errors do occur, the reasoning system will not proceed by blind guesses, but will make an informed choice and pick the simplest interaction mode that can enable the artist to correct the error predictably and with precision. This approach is seeded by a small knowledge base that describes the assets, algorithms, and external tools that are available to the system. The knowledge base informs the reasoning system which in turn tracks the flow of information during an interactive session and specializes the knowledge base as needed. The system thus learns about the user's intent and the problem domain dynamically, while it is being used. Lastly, I present VisualDive, a full-fledged application that implements these ideas demonstrating how this new paradigm can be used in practice and in harmony with the rich ecosystem of other more traditional and well-respected software tools in computer graphics. Specifically, I show how VisualDive can enhance a workflow based on free-hand sketching to pose facial expressions. I conclude by outlining how these concepts lay the algorithmic foundations for tackling more traditional problems in computer vision that depend on robustness and require a higher degree of automation than in the context of computer graphics.;VisualDive is a large cross-platform application built on a flexible plug-in architecture and modern principles of software design. The main view of VisualDive revolves around a node-based interface similar to what found in other major graphics packages, but with profoundly different capabilities and purpose. Unlike traditional applications, the workflow in VisualDive begins with the intent of the user, and progresses through an interactive dialog that is tailored around the user's needs. The reasoning system in VisualDive orchestrates and configures both algorithms that are directly exposed to VisualDive through the plug-in architecture as well as external applications that can cooperate with VisualDive without modification. As a result, the user of VisualDive can easily access a large body of content creation tools blurring the lines between perceptual interfaces and traditional tools. Nonetheless, VisualDive successfully hides such considerable complexity by presenting stark innovation within the comfort and familiar conventions of graphical user interfaces. (Abstract shortened by UMI.)
机译:现代艺术家转向技术来有效地表达自己的想法,并在其中找到了一种超越传统铅笔和纸质极限的媒介。他们寻求使用自然且直观的工具,但还需要准确且可控的解决方案。他们对技术的需求通常是广泛且相互冲突的。作为计算机图形学的研究人员,我们期望提供解决方案,支持从情节提要的早期和原始阶段到最终产品的苛刻和精确要求的创意。尽管该行业已经产生了一些成熟的应用程序,这些应用程序通常用于复杂的项目,但是这些工具使用起来很繁琐,而且大多不足以产生单个艺术家的真正潜力。实际上,当前的技术限制只能通过庞大的预算和庞大的劳动力来弥补。近年来,研究界已将重点转移到基于机器感知的有前途的技术上,这些技术通常被称为感知接口。在本文中,我为感知接口以及实用的算法和理论概念提出了一种新的范例。旨在解决这些关键挑战。首先,我将介绍几种实用的算法和感知技术,这些算法和感知技术可以在固定工作流程的各个开发阶段帮助进行创意过程。随后,我通过提出一个框架来扩展这些想法,该框架使感知工作流能够围绕艺术家的需求进行动态配置。为此,我介绍了一种复杂信息的新的分布式表示形式,该表示形式在统一模型中组织了不同的抽象级别,以及一个推理系统,该系统可以在交互式会话期间动态地查询,修改和调整该表示形式。这种表示方法与推理系统相结合,可以动态配置新的交互模式,并允许用户以最合适的抽象级别控制艺术过程,而不会牺牲传统感知界面中经常丢失的控制程度。作为一种分布式表示形式,这种形式主义具有鲁棒性和容错性,但是最重要的是,可以用于确定何时感知算法无法产生一致的结果。当确实发生错误时,推理系统将不会盲目猜测,而是会做出明智的选择,并选择最简单的交互模式,使艺术家能够以可预测的方式准确纠正错误。这种方法是由一个小的知识库提供的,该知识库描述了系统可用的资产,算法和外部工具。知识库通知推理系统,该推理系统依次在交互式会话期间跟踪信息流,并根据需要专门化知识库。因此,系统在使用时动态地了解用户的意图和问题域。最后,我介绍了VisualDive,这是一个全面的应用程序,可实现这些思想,演示如何在实践中使用此新范例,并与计算机图形学中其他更传统且受人尊敬的软件工具的丰富生态系统协调使用。具体来说,我展示了VisualDive如何基于徒手绘制草图来构成面部表情的工作流。最后,我概述了这些概念如何为解决计算机视觉中较传统的问题奠定算法基础,这些传统问题依赖于鲁棒性并且需要比计算机图形学更高的自动化程度。VisualDive是一个大型跨平台应用程序,构建于灵活的插件架构和现代的软件设计原理。 VisualDive的主要视图围绕一个基于节点的界面,该界面与其他主要图形包中的界面相似,但功能和目的却大不相同。与传统的应用程序不同,VisualDive中的工作流程始于用户的意图,然后逐步进行根据用户需求量身定制的交互式对话框。 VisualDive中的推理系统协调并配置通过插件体系结构直接暴露给VisualDive的算法,以及可以与VisualDive进行协作而无需修改的外部应用程序。因此,VisualDive的用户可以轻松访问大量的内容创建工具,从而模糊了感知界面和传统工具之间的界限。尽管如此,VisualDive通过在图形用户界面的舒适性和熟悉的惯例内提出了鲜明的创新,成功地掩盖了相当大的复杂性。 (摘要由UMI缩短。)

著录项

  • 作者

    Nataneli, Gabriele.;

  • 作者单位

    University of California, Los Angeles.;

  • 授予单位 University of California, Los Angeles.;
  • 学科 Computer Science.
  • 学位 Ph.D.
  • 年度 2011
  • 页码 189 p.
  • 总页数 189
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号