首页> 外文学位 >A common representation for multimedia documents.
【24h】

A common representation for multimedia documents.

机译:多媒体文档的通用表示形式。

获取原文
获取原文并翻译 | 示例

摘要

Multimedia documents are composed of multiple file format combinations, such as image and text, image and sound, or image, text and sound. The type of multimedia document determines the form of analysis for knowledge architecture design and retrieval methods. Over the last few decades, theories of text analysis have been proposed and applied effectively. In recent years, theories of image and sound analysis have been proposed to work with text retrieval systems and progressed quickly due in part to rapid progress in computer processing speed. Retrieval of multimedia documents formerly was divided into the categories of image and text, and image and sound. While standard retrieval process begins from text only, methods are developing that allow the retrieval process to be accomplished simultaneously using text and image.; Although image processing for feature extraction and text processing for term extractions are well understood, there are no prior methods that can combine these two features into a single data structure. This dissertation will introduce a common representation format for multimedia documents (CRFMD) composed of both images and text.; For image and text analysis, two techniques are used: the Lorenz Information Measurement and the Word Code. A new process named Jeong's Transform is demonstrated for extraction of text and image features, combining the two previous measurements to form a single data structure. Finally, this single data structure is analyzed by using multi-dimensional scaling. This allows multimedia objects to be represented on a two-dimensional graph as vectors. The distance between vectors represents the magnitude of the difference between multimedia documents.; This study shows that image classification on a given test set is dramatically improved when text features are encoded together with image features. This effect appears to hold true even when the available text is diffused and is not uniform with the image features. This retrieval system works by representing a multimedia document as a single data structure. CRFMD is applicable to other areas of multimedia document retrieval and processing, such as medical image retrieval, World Wide Web searching, and museum collection retrieval.
机译:多媒体文档由多种文件格式组合组成,例如图像和文本,图像和声音或图像,文本和声音。多媒体文档的类型决定了知识体系结构设计和检索方法的分析形式。在过去的几十年中,文本分析理论已被提出并得到有效应用。近年来,已经提出了图像和声音分析的理论以与文本检索系统一起使用,并且由于计算机处理速度的快速进步而迅速发展。以前,多媒体文档的检索分为图像和文本以及图像和声音的类别。虽然标准检索过程仅从文本开始,但是正在开发一些方法,允许使用文本和图像同时完成检索过程。尽管已经很好地理解了用于特征提取的图像处理和用于术语提取的文本处理,但是没有现有的方法可以将这两个特征组合为单个数据结构。本文将介绍由图像和文本组成的多媒体文档(CRFMD)通用表示格式。对于图像和文本分析,使用了两种技术:Lorenz信息度量和单词代码。演示了一种名为Jeong's Transform的新过程,该过程用于提取文本和图像特征,将之前的两次测量结合起来形成一个数据结构。最后,通过使用多维缩放来分析此单个数据结构。这允许多媒体对象在二维图形上表示为矢量。向量之间的距离代表多媒体文档之间差异的大小。这项研究表明,当文本特征与图像特征一起编码时,给定测试集上的图像分类将得到显着改善。即使可用文本分散并且与图像特征不一致,该效果似乎仍然适用。该检索系统通过将多媒体文档表示为单个数据结构来工作。 CRFMD适用于多媒体文档检索和处理的其他领域,例如医学图像检索,万维网搜索和博物馆馆藏检索。

著录项

  • 作者

    Jeong, Ki Tai.;

  • 作者单位

    University of North Texas.;

  • 授予单位 University of North Texas.;
  • 学科 Information Science.
  • 学位 Ph.D.
  • 年度 2002
  • 页码 113 p.
  • 总页数 113
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 信息与知识传播;
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号