TY - JOUR
T1 - Reconocimiento robusto de lugares mediante redes neuronales convolucionales
AU - Lugo Sánchez, Omar E.
AU - Sossa, Humberto
AU - Zamora, Erik
N1 - Publisher Copyright:
© 2020 Instituto Politecnico Nacional. All rights reserved.
PY - 2020
Y1 - 2020
N2 - In this work, we propose using Convolutional Neural Network for place visual recognition. The work focuses on the identification and automatic extraction of interest regions from a query image. These regions are used to build an image encoding through a vector of locally aggregated descriptors, which is turn used for image recovery. Unlike other methods, where the entire image is used to create the encoding, our approach only uses the most important image interest regions. This provides better invariance to changes at extreme view points of view, lighting and occlusions. Another contribution of the work consists in the integration of a totally convolutional spatial transformer according to the convolutional neural network architecture. This transformer is used for normalizing these interest regions, which allows achieving a greater robustness during coding. A loss function is also proposed that is used to train the artificial neural network to identify automatically regions. To measure the efficiency of the proposed model, a variety of experiments were carried out with challenging data sets. The reported results show that the proposed method produces superior results than other state of the art methods.
AB - In this work, we propose using Convolutional Neural Network for place visual recognition. The work focuses on the identification and automatic extraction of interest regions from a query image. These regions are used to build an image encoding through a vector of locally aggregated descriptors, which is turn used for image recovery. Unlike other methods, where the entire image is used to create the encoding, our approach only uses the most important image interest regions. This provides better invariance to changes at extreme view points of view, lighting and occlusions. Another contribution of the work consists in the integration of a totally convolutional spatial transformer according to the convolutional neural network architecture. This transformer is used for normalizing these interest regions, which allows achieving a greater robustness during coding. A loss function is also proposed that is used to train the artificial neural network to identify automatically regions. To measure the efficiency of the proposed model, a variety of experiments were carried out with challenging data sets. The reported results show that the proposed method produces superior results than other state of the art methods.
KW - Convolutional neural network
KW - Vector of locally aggregated descriptors
KW - Visual place recognition
UR - http://www.scopus.com/inward/record.url?scp=85098759910&partnerID=8YFLogxK
U2 - 10.13053/CYS-24-4-3340
DO - 10.13053/CYS-24-4-3340
M3 - Artículo
AN - SCOPUS:85098759910
SN - 1405-5546
VL - 24
SP - 1589
EP - 1605
JO - Computacion y Sistemas
JF - Computacion y Sistemas
IS - 4
ER -