Reconocimiento robusto de lugares mediante redes neuronales convolucionales

Omar E. Lugo Sánchez; Humberto Sossa; Erik Zamora

doi:10.13053/CYS-24-4-3340

Reconocimiento robusto de lugares mediante redes neuronales convolucionales

Translated title of the contribution: Robust place recognition using convolutional neural networks

Omar E. Lugo Sánchez, Humberto Sossa, Erik Zamora

Centro de Investigación en Computación (CIC)

Research output: Contribution to journal › Article › peer-review

3 Scopus citations

Abstract

In this work, we propose using Convolutional Neural Network for place visual recognition. The work focuses on the identification and automatic extraction of interest regions from a query image. These regions are used to build an image encoding through a vector of locally aggregated descriptors, which is turn used for image recovery. Unlike other methods, where the entire image is used to create the encoding, our approach only uses the most important image interest regions. This provides better invariance to changes at extreme view points of view, lighting and occlusions. Another contribution of the work consists in the integration of a totally convolutional spatial transformer according to the convolutional neural network architecture. This transformer is used for normalizing these interest regions, which allows achieving a greater robustness during coding. A loss function is also proposed that is used to train the artificial neural network to identify automatically regions. To measure the efficiency of the proposed model, a variety of experiments were carried out with challenging data sets. The reported results show that the proposed method produces superior results than other state of the art methods.

Translated title of the contribution	Robust place recognition using convolutional neural networks
Original language	Spanish
Pages (from-to)	1589-1605
Number of pages	17
Journal	Computacion y Sistemas
Volume	24
Issue number	4
DOIs	https://doi.org/10.13053/CYS-24-4-3340
State	Published - 2020

Access to Document

10.13053/CYS-24-4-3340

Cite this

@article{58e2e35e33014f0294d12142b881685a,

title = "Reconocimiento robusto de lugares mediante redes neuronales convolucionales",

abstract = "In this work, we propose using Convolutional Neural Network for place visual recognition. The work focuses on the identification and automatic extraction of interest regions from a query image. These regions are used to build an image encoding through a vector of locally aggregated descriptors, which is turn used for image recovery. Unlike other methods, where the entire image is used to create the encoding, our approach only uses the most important image interest regions. This provides better invariance to changes at extreme view points of view, lighting and occlusions. Another contribution of the work consists in the integration of a totally convolutional spatial transformer according to the convolutional neural network architecture. This transformer is used for normalizing these interest regions, which allows achieving a greater robustness during coding. A loss function is also proposed that is used to train the artificial neural network to identify automatically regions. To measure the efficiency of the proposed model, a variety of experiments were carried out with challenging data sets. The reported results show that the proposed method produces superior results than other state of the art methods.",

keywords = "Convolutional neural network, Vector of locally aggregated descriptors, Visual place recognition",

author = "{Lugo S{\'a}nchez}, {Omar E.} and Humberto Sossa and Erik Zamora",

year = "2020",

doi = "10.13053/CYS-24-4-3340",

language = "Espa{\~n}ol",

volume = "24",

pages = "1589--1605",

journal = "Computacion y Sistemas",

issn = "1405-5546",

number = "4",

}

TY - JOUR

T1 - Reconocimiento robusto de lugares mediante redes neuronales convolucionales

AU - Lugo Sánchez, Omar E.

AU - Sossa, Humberto

AU - Zamora, Erik

PY - 2020

Y1 - 2020

N2 - In this work, we propose using Convolutional Neural Network for place visual recognition. The work focuses on the identification and automatic extraction of interest regions from a query image. These regions are used to build an image encoding through a vector of locally aggregated descriptors, which is turn used for image recovery. Unlike other methods, where the entire image is used to create the encoding, our approach only uses the most important image interest regions. This provides better invariance to changes at extreme view points of view, lighting and occlusions. Another contribution of the work consists in the integration of a totally convolutional spatial transformer according to the convolutional neural network architecture. This transformer is used for normalizing these interest regions, which allows achieving a greater robustness during coding. A loss function is also proposed that is used to train the artificial neural network to identify automatically regions. To measure the efficiency of the proposed model, a variety of experiments were carried out with challenging data sets. The reported results show that the proposed method produces superior results than other state of the art methods.

AB - In this work, we propose using Convolutional Neural Network for place visual recognition. The work focuses on the identification and automatic extraction of interest regions from a query image. These regions are used to build an image encoding through a vector of locally aggregated descriptors, which is turn used for image recovery. Unlike other methods, where the entire image is used to create the encoding, our approach only uses the most important image interest regions. This provides better invariance to changes at extreme view points of view, lighting and occlusions. Another contribution of the work consists in the integration of a totally convolutional spatial transformer according to the convolutional neural network architecture. This transformer is used for normalizing these interest regions, which allows achieving a greater robustness during coding. A loss function is also proposed that is used to train the artificial neural network to identify automatically regions. To measure the efficiency of the proposed model, a variety of experiments were carried out with challenging data sets. The reported results show that the proposed method produces superior results than other state of the art methods.

KW - Convolutional neural network

KW - Vector of locally aggregated descriptors

KW - Visual place recognition

UR - http://www.scopus.com/inward/record.url?scp=85098759910&partnerID=8YFLogxK

U2 - 10.13053/CYS-24-4-3340

DO - 10.13053/CYS-24-4-3340

M3 - Artículo

AN - SCOPUS:85098759910

SN - 1405-5546

VL - 24

SP - 1589

EP - 1605

JO - Computacion y Sistemas

JF - Computacion y Sistemas

IS - 4

ER -

Reconocimiento robusto de lugares mediante redes neuronales convolucionales

Abstract

Access to Document

Other files and links

Fingerprint

Cite this