首页> 外文会议>Image and signal processing >Classification of Multi-structured Documents: A Comparison Based on Media Image
【24h】

Classification of Multi-structured Documents: A Comparison Based on Media Image

机译:多结构文档的分类:基于媒体图像的比较

获取原文
获取原文并翻译 | 示例

摘要

This paper focuses on the structural comparison of multimedia documents. Most of the systems treating the multimedia documents exploit only the text part of these documents. However, the text is no longer the only means to carry information. The major issue is to extend these systems to the other modality notably to the image that constitutes one of the basic components of multimedia documents. The complexity of multimedia documents, multi-structured in essence, imposes not only a structural representation in the form of trees, but rather in the form of graphs. The graphs are in appropriateness to the description of these documents. For example, one will be able to describe the components of a scene of an image, the relations between these components, their positions (spatial relations), etc. We propose a new similarity measure of graphs, based on a univocal matching between the graphs to compare. In our approach, we will take account of structural information and specificities of multimedia information. We evaluate our measure on a corpus of multi-structured documents from the INEX 2007 corpus.
机译:本文着重于多媒体文档的结构比较。处理多媒体文档的大多数系统仅利用这些文档的文本部分。但是,文本不再是承载信息的唯一手段。主要问题是将这些系统扩展到其他形式,尤其是扩展到构成多媒体文档基本组成部分之一的图像。多媒体文档的复杂性本质上是多结构的,它不仅以树的形式强加了结构表示,而且以图形的形式强加了结构表示。这些图适合于这些文件的描述。例如,一个人将能够描述图像场景的组成部分,这些组成部分之间的关​​系,它们的位置(空间关系)等。我们基于图之间的明确匹配,提出了一种新的图相似度度量比较。在我们的方法中,我们将考虑结构信息和多媒体信息的特殊性。我们评估来自INEX 2007语料库的多结构文档语料库的度量。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号