Stereo vision is a method of depth perception, in which depth information is inferred from two (or more) images of a scene, taken from different perspectives. Practical applications for stereo vision include aerial photogrammetry, autonomous vehicle guidance, robotics and industrial automation. The initial motivation behind this work was to produce a stereo vision sensor for mining automation applications. For such applications, the input stereo images would consist of close range scenes of rocks.ududA fundamental problem faced by matching algorithms is the matching or correspondence problem. This problem involves locating corresponding points or features in two images. For this application, speed, reliability, and the ability to produce a dense depth map are of foremost importance. This work implemented a number of areabased matching algorithms to assess their suitability for this application. Area-based techniques were investigated because of their potential to yield dense depth maps, their amenability to fast hardware implementation, and their suitability to textured scenes such as rocks. In addition, two non-parametric transforms, the rank and census, were also compared. Both the rank and the census transforms were found to result in improved reliability of matching in the presence of radiometric distortion - significant since radiometric distortion is a problem which commonly arises in practice.ududIn addition, they have low computational complexity, making them amenable to fast hardware implementation. Therefore, it was decided that matching algorithms using these transforms would be the subject of the remainder of the thesis.ududAn analytic expression for the process of matching using the rank transform was derived from first principles. This work resulted in a number of important contributions.ududFirstly, the derivation process resulted in one constraint which must be satisfied for a correct match. This was termed the rank constraint. The theoretical derivation of this constraint is in contrast to the existing matching constraints which have little theoretical basis. Experimental work with actual and contrived stereo pairs has shown that the new constraint is capable of resolving ambiguous matches, thereby improving match reliability. Secondly, a novel matching algorithm incorporating the rank constraint has been proposed. This algorithm was tested using a number of stereo pairs.ududIn all cases, the modified algorithm consistently resulted in an increased proportion of correct matches. Finally, the rank constraint was used to devise a new method for identifying regions of an image where the rank transform, and hence matching, are more susceptible to noise.ududThe rank constraint was also incorporated into a new hybrid matching algorithm, where it was combined a number of other ideas. These included the use of an image pyramid for match prediction, and a method of edge localisation to improve match accuracy in the vicinity of edges. Experimental results obtained from the new algorithm showed that the algorithm is able to remove a large proportion of invalid matches, and improve match accuracy.
展开▼
机译:立体视觉是一种深度感知的方法,其中深度信息是从一个场景的两个(或多个)图像中以不同的角度得出的。立体视觉的实际应用包括航空摄影测量,自动驾驶制导,机器人技术和工业自动化。这项工作的最初动机是生产用于采矿自动化应用的立体视觉传感器。对于此类应用,输入的立体图像将由岩石的近距离场景组成。 ud ud匹配算法面临的基本问题是匹配或对应问题。此问题涉及在两个图像中定位相应的点或特征。对于此应用程序,速度,可靠性以及生成密集深度图的能力至关重要。这项工作实现了许多基于区域的匹配算法,以评估它们对于该应用程序的适用性。对基于区域的技术进行了研究,因为它们具有生成密集深度图的潜力,对快速硬件实施的适应性以及对诸如岩石等纹理场景的适用性。此外,还比较了两个非参数转换,即秩和普查。发现在存在辐射计量失真的情况下,秩变换和普查变换均可提高匹配的可靠性-这很重要,因为在实际中通常会出现辐射计量失真这一问题。 ud ud此外,它们的计算复杂度较低,因此适用于快速的硬件实施。因此,决定使用这些变换的匹配算法将是本文的其余部分。 ud ud从秩和变换的解析过程的解析表达式是从第一原理推导而来的。这项工作产生了许多重要的贡献。 ud ud首先,推导过程产生了一个约束,对于正确匹配必须满足该约束。这被称为等级约束。该约束的理论推导与现有的匹配约束相反,后者现有的理论基础很少。使用实际的和人为的立体声对进行的实验工作表明,新约束能够解决歧义匹配,从而提高了匹配可靠性。其次,提出了一种新的结合秩约束的匹配算法。使用许多立体声对测试了该算法。 ud ud在所有情况下,经过改进的算法始终会导致正确匹配比例的增加。最后,使用秩约束来设计一种新方法,用于识别图像的区域,在这些区域中,秩变换和匹配因此更容易受到噪声的影响。 ud ud也将秩约束并入了新的混合匹配算法中,其中它结合了许多其他想法。这些措施包括将图像金字塔用于匹配预测,以及边缘定位方法来提高边缘附近的匹配精度。从新算法获得的实验结果表明,该算法能够去除大部分无效匹配,并提高了匹配精度。
展开▼