Comparative study of visual saliency maps in the problem of classification of architectural images with Deep CNNs

Abraham Montoya Obeso; Jenny Benois-Pineau; Kamel Guissous; Valerie Gouet-Brunet; Mireya S. Garcia Vazquez; Alejandro A. Ramirez Acosta

doi:10.1109/IPTA.2018.8608125

Comparative study of visual saliency maps in the problem of classification of architectural images with Deep CNNs

Abraham Montoya Obeso, Jenny Benois-Pineau, Kamel Guissous, Valerie Gouet-Brunet, Mireya S. Garcia Vazquez, Alejandro A. Ramirez Acosta

Centro de Investigación y Desarrollo de Tecnología Digital (CITEDI)

Producción científica: Capítulo del libro/informe/acta de congreso › Contribución a la conferencia › revisión exhaustiva

8 Citas (Scopus)

Resumen

Incorporating Human Visual System (HVS) models into building of classifiers has become an intensively researched field in visual content mining. In the variety of models of HVS we are interested in so-called visual saliency maps. Contrarily to scan-paths they model instantaneous attention assigning the degree of interestingness/saliency for humans to each pixel in the image plane. In various tasks of visual content understanding, these maps proved to be efficient stressing contribution of the areas of interest in image plane to classifiers models. In previous works saliency layers have been introduced in Deep CNNs, showing that they allow reducing training time getting similar accuracy and loss values in optimal models. In case of large image collections efficient building of saliency maps is based on predictive models of visual attention. They are generally bottom-up and are not adapted to specific visual tasks. Unless they are built for specific content, such as »urban images»-targeted saliency maps we also compare in this paper. In present research we propose a »bootstrap» strategy of building visual saliency maps for particular tasks of visual data mining. A small collection of images relevant to the visual understanding problem is annotated with gaze fixations. Then the propagation to a large training dataset is ensured and compared with the classical GBVS model and a recent method of saliency for urban image content. The classification results within Deep CNN framework are promising compared to the purely automatic visual saliency prediction.

Idioma original	Inglés
Título de la publicación alojada	2018 8th International Conference on Image Processing Theory, Tools and Applications, IPTA 2018 - Proceedings
Editorial	Institute of Electrical and Electronics Engineers Inc.
ISBN (versión digital)	9781538664278
DOI	https://doi.org/10.1109/IPTA.2018.8608125
Estado	Publicada - 10 ene. 2019
Evento	8th International Conference on Image Processing Theory, Tools and Applications, IPTA 2018 - Xi'an, China Duración: 7 nov. 2018 → 10 nov. 2018

Serie de la publicación

Nombre	2018 8th International Conference on Image Processing Theory, Tools and Applications, IPTA 2018 - Proceedings

Conferencia

Conferencia	8th International Conference on Image Processing Theory, Tools and Applications, IPTA 2018
País/Territorio	China
Ciudad	Xi'an
Período	7/11/18 → 10/11/18

Acceder al documento

10.1109/IPTA.2018.8608125

Otros archivos y enlaces

Enlace a la publicación en Scopus

Citar esto

Obeso, A. M., Benois-Pineau, J., Guissous, K., Gouet-Brunet, V., Garcia Vazquez, M. S., & Ramirez Acosta, A. A. (2019). Comparative study of visual saliency maps in the problem of classification of architectural images with Deep CNNs. En 2018 8th International Conference on Image Processing Theory, Tools and Applications, IPTA 2018 - Proceedings Artículo 8608125 (2018 8th International Conference on Image Processing Theory, Tools and Applications, IPTA 2018 - Proceedings). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/IPTA.2018.8608125

Obeso, Abraham Montoya ; Benois-Pineau, Jenny ; Guissous, Kamel et al. / Comparative study of visual saliency maps in the problem of classification of architectural images with Deep CNNs. 2018 8th International Conference on Image Processing Theory, Tools and Applications, IPTA 2018 - Proceedings. Institute of Electrical and Electronics Engineers Inc., 2019. (2018 8th International Conference on Image Processing Theory, Tools and Applications, IPTA 2018 - Proceedings).

@inproceedings{51feb0fa48a041dfa76aa918b2e69d43,

title = "Comparative study of visual saliency maps in the problem of classification of architectural images with Deep CNNs",

abstract = "Incorporating Human Visual System (HVS) models into building of classifiers has become an intensively researched field in visual content mining. In the variety of models of HVS we are interested in so-called visual saliency maps. Contrarily to scan-paths they model instantaneous attention assigning the degree of interestingness/saliency for humans to each pixel in the image plane. In various tasks of visual content understanding, these maps proved to be efficient stressing contribution of the areas of interest in image plane to classifiers models. In previous works saliency layers have been introduced in Deep CNNs, showing that they allow reducing training time getting similar accuracy and loss values in optimal models. In case of large image collections efficient building of saliency maps is based on predictive models of visual attention. They are generally bottom-up and are not adapted to specific visual tasks. Unless they are built for specific content, such as »urban images»-targeted saliency maps we also compare in this paper. In present research we propose a »bootstrap» strategy of building visual saliency maps for particular tasks of visual data mining. A small collection of images relevant to the visual understanding problem is annotated with gaze fixations. Then the propagation to a large training dataset is ensured and compared with the classical GBVS model and a recent method of saliency for urban image content. The classification results within Deep CNN framework are promising compared to the purely automatic visual saliency prediction.",

keywords = "Deep Learning, Mexican Culture, Saliency Maps",

author = "Obeso, {Abraham Montoya} and Jenny Benois-Pineau and Kamel Guissous and Valerie Gouet-Brunet and {Garcia Vazquez}, {Mireya S.} and {Ramirez Acosta}, {Alejandro A.}",

note = "Publisher Copyright: {\textcopyright} 2018 IEEE.; 8th International Conference on Image Processing Theory, Tools and Applications, IPTA 2018 ; Conference date: 07-11-2018 Through 10-11-2018",

year = "2019",

month = jan,

day = "10",

doi = "10.1109/IPTA.2018.8608125",

language = "Ingl{\'e}s",

series = "2018 8th International Conference on Image Processing Theory, Tools and Applications, IPTA 2018 - Proceedings",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

booktitle = "2018 8th International Conference on Image Processing Theory, Tools and Applications, IPTA 2018 - Proceedings",

address = "Estados Unidos",

}

Obeso, AM, Benois-Pineau, J, Guissous, K, Gouet-Brunet, V, Garcia Vazquez, MS & Ramirez Acosta, AA 2019, Comparative study of visual saliency maps in the problem of classification of architectural images with Deep CNNs. En 2018 8th International Conference on Image Processing Theory, Tools and Applications, IPTA 2018 - Proceedings., 8608125, 2018 8th International Conference on Image Processing Theory, Tools and Applications, IPTA 2018 - Proceedings, Institute of Electrical and Electronics Engineers Inc., 8th International Conference on Image Processing Theory, Tools and Applications, IPTA 2018, Xi'an, China, 7/11/18. https://doi.org/10.1109/IPTA.2018.8608125

Comparative study of visual saliency maps in the problem of classification of architectural images with Deep CNNs. / Obeso, Abraham Montoya; Benois-Pineau, Jenny; Guissous, Kamel et al.
2018 8th International Conference on Image Processing Theory, Tools and Applications, IPTA 2018 - Proceedings. Institute of Electrical and Electronics Engineers Inc., 2019. 8608125 (2018 8th International Conference on Image Processing Theory, Tools and Applications, IPTA 2018 - Proceedings).

Producción científica: Capítulo del libro/informe/acta de congreso › Contribución a la conferencia › revisión exhaustiva

TY - GEN

T1 - Comparative study of visual saliency maps in the problem of classification of architectural images with Deep CNNs

AU - Obeso, Abraham Montoya

AU - Benois-Pineau, Jenny

AU - Guissous, Kamel

AU - Gouet-Brunet, Valerie

AU - Garcia Vazquez, Mireya S.

AU - Ramirez Acosta, Alejandro A.

PY - 2019/1/10

Y1 - 2019/1/10

N2 - Incorporating Human Visual System (HVS) models into building of classifiers has become an intensively researched field in visual content mining. In the variety of models of HVS we are interested in so-called visual saliency maps. Contrarily to scan-paths they model instantaneous attention assigning the degree of interestingness/saliency for humans to each pixel in the image plane. In various tasks of visual content understanding, these maps proved to be efficient stressing contribution of the areas of interest in image plane to classifiers models. In previous works saliency layers have been introduced in Deep CNNs, showing that they allow reducing training time getting similar accuracy and loss values in optimal models. In case of large image collections efficient building of saliency maps is based on predictive models of visual attention. They are generally bottom-up and are not adapted to specific visual tasks. Unless they are built for specific content, such as »urban images»-targeted saliency maps we also compare in this paper. In present research we propose a »bootstrap» strategy of building visual saliency maps for particular tasks of visual data mining. A small collection of images relevant to the visual understanding problem is annotated with gaze fixations. Then the propagation to a large training dataset is ensured and compared with the classical GBVS model and a recent method of saliency for urban image content. The classification results within Deep CNN framework are promising compared to the purely automatic visual saliency prediction.

AB - Incorporating Human Visual System (HVS) models into building of classifiers has become an intensively researched field in visual content mining. In the variety of models of HVS we are interested in so-called visual saliency maps. Contrarily to scan-paths they model instantaneous attention assigning the degree of interestingness/saliency for humans to each pixel in the image plane. In various tasks of visual content understanding, these maps proved to be efficient stressing contribution of the areas of interest in image plane to classifiers models. In previous works saliency layers have been introduced in Deep CNNs, showing that they allow reducing training time getting similar accuracy and loss values in optimal models. In case of large image collections efficient building of saliency maps is based on predictive models of visual attention. They are generally bottom-up and are not adapted to specific visual tasks. Unless they are built for specific content, such as »urban images»-targeted saliency maps we also compare in this paper. In present research we propose a »bootstrap» strategy of building visual saliency maps for particular tasks of visual data mining. A small collection of images relevant to the visual understanding problem is annotated with gaze fixations. Then the propagation to a large training dataset is ensured and compared with the classical GBVS model and a recent method of saliency for urban image content. The classification results within Deep CNN framework are promising compared to the purely automatic visual saliency prediction.

KW - Deep Learning

KW - Mexican Culture

KW - Saliency Maps

UR - http://www.scopus.com/inward/record.url?scp=85061920656&partnerID=8YFLogxK

U2 - 10.1109/IPTA.2018.8608125

DO - 10.1109/IPTA.2018.8608125

M3 - Contribución a la conferencia

AN - SCOPUS:85061920656

T3 - 2018 8th International Conference on Image Processing Theory, Tools and Applications, IPTA 2018 - Proceedings

BT - 2018 8th International Conference on Image Processing Theory, Tools and Applications, IPTA 2018 - Proceedings

PB - Institute of Electrical and Electronics Engineers Inc.

T2 - 8th International Conference on Image Processing Theory, Tools and Applications, IPTA 2018

Y2 - 7 November 2018 through 10 November 2018

ER -

Obeso AM, Benois-Pineau J, Guissous K, Gouet-Brunet V, Garcia Vazquez MS, Ramirez Acosta AA. Comparative study of visual saliency maps in the problem of classification of architectural images with Deep CNNs. En 2018 8th International Conference on Image Processing Theory, Tools and Applications, IPTA 2018 - Proceedings. Institute of Electrical and Electronics Engineers Inc. 2019. 8608125. (2018 8th International Conference on Image Processing Theory, Tools and Applications, IPTA 2018 - Proceedings). doi: 10.1109/IPTA.2018.8608125

Comparative study of visual saliency maps in the problem of classification of architectural images with Deep CNNs

Resumen

Serie de la publicación

Conferencia

Acceder al documento

Otros archivos y enlaces

Huella

Citar esto