Medial measures for recognition, mapping and categorizationVisual shape analysis plays a fundamental role in perception by man and by computer, allowing for inferences about properties of objects and scenes in the physical world. Mathematical approaches to describing visual form can benefit from the use of representations that simultaneously capture properties of an object's outline as well as its interior. Motivated by the success of medial models, this doctoral thesis revisits a quantity related to medial axis computations, the average outward flux of the gradient of the Euclidean distance function from a boundary, and then addresses three distinct problems using this measure. First, I consider the problem of view sphere partitioning for view-based object recognition from sparse views. View-based 3D object recognition requires a selection of model object views against which to match a query view. Ideally, for this to be computationally efficient, such a selection should be sparse. To address this problem, I introduce a novel hierarchical partitioning of the view sphere into regions within which the silhouette of a model object is qualitatively unchanged. To achieve this, I propose a part-based abstraction of a skeleton, as a graph, dubbed the Flux Graph, which allows for views to be grouped. Next, I consider the problem of mapping an initially-unknown 2D environment from possibly noisy sensed samples via an on-line procedure which robustly computes a retraction of its boundaries to obtain a topological representation. Here I motto an algorithm that allows for online map construction with loop closure. I demonstrate that the proposed method allows the robot to localize itself on a partially constructed map to calculate a path to unexplored parts of the environment (frontiers), to compute a robust terminating condition when the robot has fully explored the environment, and finally to achieve loop closure detection. I also show that the resulting map is stable under disturbances to the sensed boundary, and to variations in starting locations for exploration. Finally, I consider the problem of scene categorization from complex line drawings. In the context of human vision, we show that local ribbon symmetry between neighboring pairs of contours facilitates the categorization of complex real-world environments by human observers. In the context of computer vision, I demonstrate a high level of performance in the problem of convolutional neural network-based recognition of natural scenes from line drawings, even in the absence of color, texture and shading information. |
Common terms and phrases
3D object algorithm analysis appear applied approach Artist aspect average boundary branch Chapter cluster complex Computer Vision condition Conference connected consider context contour corresponding curve database demonstrate detection developed direction discussion distance distribution edge environment environment mapping example experiments exploration Figure finding flux graph function geometry given graph matching grouping human IEEE illustrated inner input intact International Journal line drawings machine matching McGill University medial axis method model views natural nodes object observers outward path Pattern perception performance photographs pixels present problem proposed provides radius recognition regions representation represents ribbon symmetry robot salience measures sampling scene categorization scene contours segments selected separation shape shock graph shown Siddiqi side silhouettes similarity skeletal points skeleton strategy structure symmetry score Taper task topological view sphere partitions visual weighted