A common representation for multimedia documents.

机译：多媒体文档的通用表示形式。

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Multimedia documents are composed of multiple file format combinations, such as image and text, image and sound, or image, text and sound. The type of multimedia document determines the form of analysis for knowledge architecture design and retrieval methods. Over the last few decades, theories of text analysis have been proposed and applied effectively. In recent years, theories of image and sound analysis have been proposed to work with text retrieval systems and progressed quickly due in part to rapid progress in computer processing speed. Retrieval of multimedia documents formerly was divided into the categories of image and text, and image and sound. While standard retrieval process begins from text only, methods are developing that allow the retrieval process to be accomplished simultaneously using text and image.; Although image processing for feature extraction and text processing for term extractions are well understood, there are no prior methods that can combine these two features into a single data structure. This dissertation will introduce a common representation format for multimedia documents (CRFMD) composed of both images and text.; For image and text analysis, two techniques are used: the Lorenz Information Measurement and the Word Code. A new process named Jeong's Transform is demonstrated for extraction of text and image features, combining the two previous measurements to form a single data structure. Finally, this single data structure is analyzed by using multi-dimensional scaling. This allows multimedia objects to be represented on a two-dimensional graph as vectors. The distance between vectors represents the magnitude of the difference between multimedia documents.; This study shows that image classification on a given test set is dramatically improved when text features are encoded together with image features. This effect appears to hold true even when the available text is diffused and is not uniform with the image features. This retrieval system works by representing a multimedia document as a single data structure. CRFMD is applicable to other areas of multimedia document retrieval and processing, such as medical image retrieval, World Wide Web searching, and museum collection retrieval.

机译：多媒体文档由多种文件格式组合组成，例如图像和文本，图像和声音或图像，文本和声音。多媒体文档的类型决定了知识体系结构设计和检索方法的分析形式。在过去的几十年中，文本分析理论已被提出并得到有效应用。近年来，已经提出了图像和声音分析的理论以与文本检索系统一起使用，并且由于计算机处理速度的快速进步而迅速发展。以前，多媒体文档的检索分为图像和文本以及图像和声音的类别。虽然标准检索过程仅从文本开始，但是正在开发一些方法，允许使用文本和图像同时完成检索过程。尽管已经很好地理解了用于特征提取的图像处理和用于术语提取的文本处理，但是没有现有的方法可以将这两个特征组合为单个数据结构。本文将介绍由图像和文本组成的多媒体文档（CRFMD）通用表示格式。对于图像和文本分析，使用了两种技术：Lorenz信息度量和单词代码。演示了一种名为Jeong's Transform的新过程，该过程用于提取文本和图像特征，将之前的两次测量结合起来形成一个数据结构。最后，通过使用多维缩放来分析此单个数据结构。这允许多媒体对象在二维图形上表示为矢量。向量之间的距离代表多媒体文档之间差异的大小。这项研究表明，当文本特征与图像特征一起编码时，给定测试集上的图像分类将得到显着改善。即使可用文本分散并且与图像特征不一致，该效果似乎仍然适用。该检索系统通过将多媒体文档表示为单个数据结构来工作。 CRFMD适用于多媒体文档检索和处理的其他领域，例如医学图像检索，万维网搜索和博物馆馆藏检索。

著录项

作者
Jeong, Ki Tai.;
展开▼
作者单位

University of North Texas.;

展开▼
授予单位 University of North Texas.;
学科 Information Science.
学位 Ph.D.
年度 2002
页码 113 p.
总页数 113
原文格式 PDF
正文语种 eng
中图分类信息与知识传播;
关键词

相似文献

外文文献
中文文献
专利

1. Multimedia Communication Technology-representation, Transmission And Identification Of Multimedia Signals [J] . International Journal of Adaptive Control and Signal Processing . 2008,第10期

机译：多媒体通信技术-多媒体信号的表示，传输与识别
2. Semantic Representation Of Multimedia Content: Knowledge Representation And Semantic Indexing [J] . Phivos Mylonas, Thanos Athanasiadis, Manolis Wallace, Multimedia Tools and Applications . 2008,第3期

机译：多媒体内容的语义表示：知识表示和语义索引
3. Hierarchical representation of 3D objects in multimedia ambiance communication based on the setting representation [J] . Toshifumi Kanamaru, Kunio Yamada, Tadashi Ichikawa, 電子情報通信学会技術研究報告. マルチメディア·仮想環境基礎 . 2000,第184期

机译：基于设置表示的多媒体环境通信中的3D对象的分层表示
4. Clustering based rescoring for semantic indexing of multimedia documents. [C] . Abdelkader Hamadi, Georges Quénot, Philippe Mulhem International Workshop on Content-Based Multimedia Indexing . 2013

机译：基于聚类的多媒体文档语义索引的rescoring。
5. The Impact of Professional Development Through a Graduate Course on Multimedia Technology on Teachers’ Beliefs About Multimedia and Their Implementation of Multimedia into Their Teaching Practice, Including to Meet The Common Core State Standards [D] . Hough, Marianthony K. 2019

机译：专业发展对多媒体技术研究生课程对多媒体的信念的影响及其多媒体实施的教学实践，包括满足共同的核心国家标准
6. An ODA-based coder/decoder for multimedia medical documents. [O] . V. Marti, J. Navio, C. H. Salvador, 1993

机译：用于多媒体医疗文档的基于ODA的编码器/解码器。
7. Multimedia communication technology-representation, transmission and identification of multimedia signals. Jens R. Ohm, Signals and Communication Technology Series, Springer, Berlin, Heidelberg, Germany, 2004. No. of pages: xiv + 859. Price: £87.87. Hardcover, ISBN 3-540-01249-4 [O] . Yogeshwarsing Calleecharan 2008

机译：多媒体通信技术 - 表示多媒体信号的传输和识别。 Jens R.欧姆，信号和通信技术系列，Springer，Berlin，Heidelberg，德国，2004年。页数：XIV + 859.价格：£87.87。精装，ISBN 3-540-01249-4
8. Finding a Common Data Representation and Interchange Approach for Multimedia Models [R] . Fine, S. S. 2003

机译：寻找多媒体模型的通用数据表示和交换方法

A common representation for multimedia documents.

摘要

著录项

相似文献

相关主题

期刊订阅