2-D object recognition by indexing through a modified ART-2 neural network

J. Humberto Sossa, P. Rayón Villela, J. Figueroa Nazuno

Producción científica: Contribución a una revistaArtículorevisión exhaustiva

Resumen

A technique for the recognition of possibly occluded planar objects from a single image in the presence of projective deformations when the camera-plane is not perpendicular to the object-plane is presented. The technique is divided into two phases. During the first phase (database construction), an object is first decomposed into a set primitives called metasegments. These are groups of consecutive segments obtained from the corners of the object's contour coded by three geometric invariants: the type, and the two four and the five point-dependent affine/projective invariants. Each resulting code is then used to build the corresponding database of models. It is composed of a standard ART-2 NN connected to a Memory Map (MM), a set of logical AND gates, an evidence-register (an adder) and a set of comparators. The ART-2 NN has as input the code of a metasegment and a number of outputs equal to the number of different metasegments trained to the NN. The Memory Map has as many rows as outputs provided by the ART-2 NN and as many columns as objects used to train the NN. Each of the MM's locations contains a value. This value represents the number of metasegments present in each trained object. The indexing phase is divided in two stages: candidate selection and candidate reduction. During candidate selection, an image containing one or more possibly occluded objects is first preprocessed to obtain the corresponding contours. Each contour is then decomposed into its metasegments and coded as explained above. Each code is next used by the trained ART-2 NN to retrieve from the previously constructed Memory Map the list of objects that had possibly produced the corresponding metasegment. At the end of this process, we will have in an evidence-register the number of times an object was voted for during candidate selection. A selection-threshold is finally used, during the candidate reduction stage, to select those objects most possibly present in the test image. The system's performance is tested with a set of polygonal objects.

Idioma originalInglés
Páginas (desde-hasta)199-210
Número de páginas12
PublicaciónExpert Systems with Applications
Volumen14
N.º1-2
DOI
EstadoPublicada - 1998

Huella

Profundice en los temas de investigación de '2-D object recognition by indexing through a modified ART-2 neural network'. En conjunto forman una huella única.

Citar esto