Representation, matching, and classification of multi-sequence shapes.

机译：多序列形状的表示，匹配和分类。

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

In computer vision research, shape is usually referred to the geometric properties of objects and it is very important in detecting and recognizing objects. Shape representation, the data structure to describe the shape, is the basis for shape analysis. Graph, due to its power in representing various relationship among shape parts, is frequently used in describing part-based 2D shapes. The flexibility of graphs, however, results in difficulties in matching and recognition of graphic shapes, especially when the number of edges is large because the matching of the edges usually involves the matching of the two nodes it connects. In many graphical shape representation methods, however, the number of edges will grow quadratically with respect to the number of nodes. That is partly the reason why recent research on shape analysis often deal with graphs with a limited number of nodes, or with edges abandoned.;In order to keep more shape details to improve the performance of shape recognition, graphs with larger number of nodes should be applied. This motivates us to design a shape representation method that can keep the connectivity linear to the number of nodes while most of the shape details are kept. For images where edges can be reasonably detected, we are arguing that images' intensity edges are the right connectivity information that should be kept. A new shape representation, multi-sequence shape (or MSS), is proposed. The matching, classification and indexing algorithms of MSS are also proposed and experiments are conducted to demonstrate the advantage of our shape representation method. In our representation method, the number of edges of graphs is linear to the number of nodes. The time complexity of our matching algorithms are O(N2 ) where N is the number of nodes. We will also show that our representation method is suitable for indexing so that retrieving a shape from a 60,000 shape database only takes a few milliseconds using a pre-computed index.;Our method could match and retrieve shapes very fast and accurately, based on three contributions of this dissertation. We proposed an efficient partial matching algorithm to find the most similar parts of two sequential shapes, which is the base of our proposed shape representation. Our algorithm does not need to exhaustively search all possible pairs of subsequences. Instead, we use a dynamic programming algorithm (due to Smith Waterman) to find the most similar parts efficiently The complexity of our method to find similar parts of two contours of length m and n, is only O(m · n). We compared our matching algorithm with other methods using contour shape models. Empirical analysis shows that our approach is about 10 to 20 times faster in rotation-invariant recognition because our method does not need to search all possible rotations to find the optimal orientation. Because our matching algorithm only extract similar parts of two contours, shape variance, such as occlusion, can be better tolerated and hence our algorithm is more robust to shape variance. In contrast to arbitrary distance functions that are used by previous methods, we use a probabilistic similarity measurement, p-value, to evaluate the similarity of two shapes. We conducted experiments on several public shape databases and the result indicates that our method outperforms state-of-the-art global and partial shape matching algorithms.;Based on the partial sequence shape matching algorithm, we propose a new shape representation, the multi-sequence shape, to represent shapes in both binary images and realistic images. We propose a matching algorithm for multi-sequence shapes and an algorithm that could recognize multi-sequence shapes from cluttered background as well. We are arguing that the proposed shape representation method, compared to traditional graphical shape models, such as contour model, sparse graph model, and dense graph model, is straightforward in nature and less complicated in representation and matching whereas contains sufficient shape details for common shape recognition tasks.;A structured learning algorithm is also proposed to improve the shape classification. Traditional methods for shape classification involve the establishment of point correspondences between shapes to produce matching scores, which are in turn used as similarity measures for classification. Learning techniques have been applied only in the second stage of this process, after the matching scores have been obtained. We take a different approach by learning point-to-point matching measures to produce similarity scores that minimize the classification loss. Instead of simply taking for granted the scores obtained by matching and then learning a classifier, we embed both matching and classification together within single machine learning scheme that optimizes the shape classification accuracy. The solution is based on a max-margin formulation in the structured prediction setting. Experiments in several shape databases reveal that such integrated learning algorithm substantially improves the classification accuracy of existing methods.

机译：在计算机视觉研究中，形状通常是指对象的几何特性，在检测和识别对象中非常重要。形状表示（描述形状的数据结构）是形状分析的基础。由于图形具有表示形状零件之间各种关系的功能，因此经常用于描述基于零件的2D形状。然而，图的灵活性导致难以匹配和识别图形形状，特别是当边缘的数量很大时，因为边缘的匹配通常涉及其连接的两个节点的匹配。但是，在许多图形形状表示方法中，边的数量相对于节点的数量将平方增加。这部分是为什么最近对形状分析的研究经常处理节点数量有限或边缘被遗弃的图的原因;为了保留更多形状细节以提高形状识别的性能，应使用节点数量更多的图被应用。这激励我们设计一种形状表示方法，该方法可以在保持大多数形状细节的同时，使连通性与节点数呈线性关系。对于可以合理检测边缘的图像，我们认为图像的强度边缘是应保留的正确连接性信息。提出了一种新的形状表示形式，即多序列形状（MSS）。提出了MSS的匹配，分类和索引算法，并通过实验证明了形状表示方法的优点。在我们的表示方法中，图的边数与节点数成线性关系。我们的匹配算法的时间复杂度为O（N2），其中N是节点数。我们还将展示我们的表示方法适用于索引编制，因此使用预先计算的索引从60,000个形状数据库中检索形状仅需要几毫秒的时间;基于三种方法，我们的方法可以非常快速，准确地匹配和检索形状论文的贡献。我们提出了一种有效的局部匹配算法，以找到两个连续形状的最相似部分，这是我们提出的形状表示的基础。我们的算法不需要穷举搜索所有可能的子序列对。取而代之的是，我们使用动态编程算法（由于Smith Waterman）有效地找到了最相似的部分。找到长度为m和n的两个轮廓的相似部分的方法的复杂度仅为O（m·n）。我们将匹配算法与使用轮廓形状模型的其他方法进行了比较。经验分析表明，由于我们的方法不需要搜索所有可能的旋转来找到最佳方向，因此在旋转不变识别中，该方法的速度要快10到20倍。由于我们的匹配算法仅提取两个轮廓的相似部分，因此可以更好地容忍形状方差（例如遮挡），因此我们的算法对形状方差更健壮。与以前的方法所使用的任意距离函数相比，我们使用概率相似性度量（p值）来评估两个形状的相似性。我们在多个公共形状数据库上进行了实验，结果表明我们的方法优于最新的全局和部分形状匹配算法。；在部分序列形状匹配算法的基础上，我们提出了一种新的形状表示方法，即序列形状，以表示二进制图像和真实图像中的形状。我们提出了一种针对多序列形状的匹配算法，以及一种可以从杂乱的背景中识别多序列形状的算法。我们认为与传统的图形形状模型（例如轮廓模型，稀疏图形模型和密集图形模型）相比，所提出的形状表示方法本质上简单明了，表示和匹配不那么复杂，但包含足够的形状细节以用于常见形状识别任务。;还提出了一种结构化学习算法，以改善形状分类。传统的形状分类方法包括在形状之间建立点对应关系以产生匹配分数，然后将其用作分类的相似性度量。在获得匹配分数之后，仅在该过程的第二阶段应用了学习技术。我们通过学习点对点匹配方法来采用不同的方法，以产生将分类损失最小化的相似性评分。而不是简单地认为通过匹配然后学习分类器获得的分数，我们将匹配和分类同时嵌入单个机器学习方案中，以优化形状分类的准确性。该解决方案基于结构化预测设置中的最大利润率公式。在几个形状数据库中的实验表明，这种集成学习算法大大提高了现有方法的分类精度。

著录项

作者
Chen, Longbin.;
展开▼
作者单位

University of California, Santa Barbara.;

展开▼
授予单位 University of California, Santa Barbara.;
学科 Computer Science.
学位 Ph.D.
年度 2010
页码 155 p.
总页数 155
原文格式 PDF
正文语种 eng
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Functional classification of protein structures by local structure matching in graph representation [J] . Mills Caitlyn L., Garg Rohan, Lee Joslynn S., Protein Science: A Publication of the Protein Society . 2018,第6期

机译：局部结构匹配在图形表示中的蛋白质结构的功能分类
2. Learning visual and textual representations for multimodal matching and classification [J] . Liu Yu, Liu Li, Guo Yanming, Pattern Recognition: The Journal of the Pattern Recognition Society . 2018,第期

机译：学习多模式匹配和分类的视觉和文本表示
3. Efficient image classification via sparse coding spatial pyramid matching representation of SIFT-WCS-LTP feature [J] . Huang Mingming, Mu Zhichun, Zeng Hui Image Processing, IET . 2016,第1期

机译：通过稀疏编码空间金字塔匹配表示法对SIFT-WCS-LTP特征进行有效的图像分类
4. Brain Tumor Detection and Classification from Multi-sequence MRI: Study Using ConvNets [C] . Subhashis Banerjee, Sushmita Mitra, Francesco Masulli, International Workshop on Brainlesion . 2019

机译：多序MRI的脑肿瘤检测和分类：使用COMMNETS研究
5. Matching dissimilar shapes. [D] . van Kaick, Oliver. 2011

机译：匹配不同的形状。
6. Functional classification of protein structures by local structure matching in graph representation [O] . Caitlyn L. Mills, Rohan Garg, Joslynn S. Lee, 2018

机译：通过图形表示中的局部结构匹配对蛋白质结构进行功能分类
7. Weighted Spatial Pyramid Matching Collaborative Representation for Remote-Sensing-Image Scene Classification [O] . Bao-Di Liu, Jie Meng, Wen-Yang Xie, 2019

机译：遥感图像场景分类的加权空间金字塔匹配协作表示

Representation, matching, and classification of multi-sequence shapes.

摘要

著录项

相似文献

相关主题

期刊订阅