首页> 外国专利> Method and apparatus for inducing classifiers for multimedia based on unified representation of features reflecting disparate modalities

Method and apparatus for inducing classifiers for multimedia based on unified representation of features reflecting disparate modalities

机译：基于反映不同模态的特征的统一表示来归纳多媒体分类器的方法和设备

页面导航

摘要
著录项
相似文献

摘要

This invention is a system and method to perform categorization (classification) of multimedia items. These items are comprised of a multitude of disparate information sources, in particular, visual information and textual information. Classifiers are induced based on combining textual and visual feature vectors. Textual features are the traditional ones, such as, word count vectors. Visual features include, but are not limited to, color properties of key intervals and motion properties of key intervals. The visual feature vectors are determined in such a fashion that the vectors are sparse. The vector components are features such as the absence or presence of the color green in spatial regions and the absence or the amount of visual flow in spatial regions of the media items. The text and the visual representation vectors are combined in a systematic and coherent fashion. This vector representation of a media item lends itself to well-established learning techniques. The resulting system, subject of this invention, categorizes (or classifies) media items based both on textual features and visual features.

机译：本发明是一种执行多媒体项目的分类（分类）的系统和方法。这些项目由大量不同的信息源组成，尤其是视觉信息和文本信息。基于组合文本和视觉特征向量来归纳分类器。文本特征是传统特征，例如字数向量。视觉特征包括但不限于键间隔的颜色属性和键间隔的运动属性。视觉特征向量以稀疏的方式确定。矢量分量具有诸如在空间区域中不存在绿色或在媒体项目的空间区域中不存在视觉流或视觉流的数量这样的特征。文本和视觉表示向量以系统且连贯的方式组合在一起。媒体项目的这种矢量表示使其适用于成熟的学习技术。作为本发明的主题的所得系统基于媒体特征和视觉特征对媒体项目进行分类（或分类）。

著录项

公开/公告号US6892193B2

专利类型
公开/公告日2005-05-10

原文格式PDF
申请/专利权人 RUDOLF M. BOLLE;NORMAN HAAS;FRANK J. OLES;TONG ZHANG;
展开▼

申请/专利号US20010853191
发明设计人 NORMAN HAAS;FRANK J. OLES;TONG ZHANG;RUDOLF M. BOLLE;
展开▼

申请日2001-05-10
分类号G06F7/00;G06F15/00;
国家 US
入库时间 2022-08-21 22:19:03

相似文献

专利
外文文献
中文文献