Comparative study of visual saliency maps in the problem of classification of architectural images with Deep CNNs

Abraham Montoya Obeso; Jenny Benois-Pineau; Kamel Guissous; Valerie Gouet-Brunet; Mireya S. Garcia Vazquez; Alejandro A. Ramirez Acosta

doi:10.1109/IPTA.2018.8608125

Comparative study of visual saliency maps in the problem of classification of architectural images with Deep CNNs

Abraham Montoya Obeso, Jenny Benois-Pineau, Kamel Guissous, Valerie Gouet-Brunet, Mireya S. Garcia Vazquez, Alejandro A. Ramirez Acosta

Centro de Investigación y Desarrollo de Tecnología Digital (CITEDI)

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

8 Scopus citations

Abstract

Incorporating Human Visual System (HVS) models into building of classifiers has become an intensively researched field in visual content mining. In the variety of models of HVS we are interested in so-called visual saliency maps. Contrarily to scan-paths they model instantaneous attention assigning the degree of interestingness/saliency for humans to each pixel in the image plane. In various tasks of visual content understanding, these maps proved to be efficient stressing contribution of the areas of interest in image plane to classifiers models. In previous works saliency layers have been introduced in Deep CNNs, showing that they allow reducing training time getting similar accuracy and loss values in optimal models. In case of large image collections efficient building of saliency maps is based on predictive models of visual attention. They are generally bottom-up and are not adapted to specific visual tasks. Unless they are built for specific content, such as »urban images»-targeted saliency maps we also compare in this paper. In present research we propose a »bootstrap» strategy of building visual saliency maps for particular tasks of visual data mining. A small collection of images relevant to the visual understanding problem is annotated with gaze fixations. Then the propagation to a large training dataset is ensured and compared with the classical GBVS model and a recent method of saliency for urban image content. The classification results within Deep CNN framework are promising compared to the purely automatic visual saliency prediction.

Original language	English
Title of host publication	2018 8th International Conference on Image Processing Theory, Tools and Applications, IPTA 2018 - Proceedings
Publisher	Institute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)	9781538664278
DOIs	https://doi.org/10.1109/IPTA.2018.8608125
State	Published - 10 Jan 2019
Event	8th International Conference on Image Processing Theory, Tools and Applications, IPTA 2018 - Xi'an, China Duration: 7 Nov 2018 → 10 Nov 2018

Publication series

Name	2018 8th International Conference on Image Processing Theory, Tools and Applications, IPTA 2018 - Proceedings

Conference

Conference	8th International Conference on Image Processing Theory, Tools and Applications, IPTA 2018
Country/Territory	China
City	Xi'an
Period	7/11/18 → 10/11/18

Keywords

Deep Learning
Mexican Culture
Saliency Maps

Access to Document

10.1109/IPTA.2018.8608125

Cite this

Obeso, A. M., Benois-Pineau, J., Guissous, K., Gouet-Brunet, V., Garcia Vazquez, M. S., & Ramirez Acosta, A. A. (2019). Comparative study of visual saliency maps in the problem of classification of architectural images with Deep CNNs. In 2018 8th International Conference on Image Processing Theory, Tools and Applications, IPTA 2018 - Proceedings Article 8608125 (2018 8th International Conference on Image Processing Theory, Tools and Applications, IPTA 2018 - Proceedings). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/IPTA.2018.8608125

Obeso, Abraham Montoya ; Benois-Pineau, Jenny ; Guissous, Kamel et al. / Comparative study of visual saliency maps in the problem of classification of architectural images with Deep CNNs. 2018 8th International Conference on Image Processing Theory, Tools and Applications, IPTA 2018 - Proceedings. Institute of Electrical and Electronics Engineers Inc., 2019. (2018 8th International Conference on Image Processing Theory, Tools and Applications, IPTA 2018 - Proceedings).

@inproceedings{51feb0fa48a041dfa76aa918b2e69d43,

title = "Comparative study of visual saliency maps in the problem of classification of architectural images with Deep CNNs",

abstract = "Incorporating Human Visual System (HVS) models into building of classifiers has become an intensively researched field in visual content mining. In the variety of models of HVS we are interested in so-called visual saliency maps. Contrarily to scan-paths they model instantaneous attention assigning the degree of interestingness/saliency for humans to each pixel in the image plane. In various tasks of visual content understanding, these maps proved to be efficient stressing contribution of the areas of interest in image plane to classifiers models. In previous works saliency layers have been introduced in Deep CNNs, showing that they allow reducing training time getting similar accuracy and loss values in optimal models. In case of large image collections efficient building of saliency maps is based on predictive models of visual attention. They are generally bottom-up and are not adapted to specific visual tasks. Unless they are built for specific content, such as »urban images»-targeted saliency maps we also compare in this paper. In present research we propose a »bootstrap» strategy of building visual saliency maps for particular tasks of visual data mining. A small collection of images relevant to the visual understanding problem is annotated with gaze fixations. Then the propagation to a large training dataset is ensured and compared with the classical GBVS model and a recent method of saliency for urban image content. The classification results within Deep CNN framework are promising compared to the purely automatic visual saliency prediction.",

keywords = "Deep Learning, Mexican Culture, Saliency Maps",

author = "Obeso, {Abraham Montoya} and Jenny Benois-Pineau and Kamel Guissous and Valerie Gouet-Brunet and {Garcia Vazquez}, {Mireya S.} and {Ramirez Acosta}, {Alejandro A.}",

note = "Publisher Copyright: {\textcopyright} 2018 IEEE.; 8th International Conference on Image Processing Theory, Tools and Applications, IPTA 2018 ; Conference date: 07-11-2018 Through 10-11-2018",

year = "2019",

month = jan,

day = "10",

doi = "10.1109/IPTA.2018.8608125",

language = "Ingl{\'e}s",

series = "2018 8th International Conference on Image Processing Theory, Tools and Applications, IPTA 2018 - Proceedings",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

booktitle = "2018 8th International Conference on Image Processing Theory, Tools and Applications, IPTA 2018 - Proceedings",

address = "Estados Unidos",

}

Obeso, AM, Benois-Pineau, J, Guissous, K, Gouet-Brunet, V, Garcia Vazquez, MS & Ramirez Acosta, AA 2019, Comparative study of visual saliency maps in the problem of classification of architectural images with Deep CNNs. in 2018 8th International Conference on Image Processing Theory, Tools and Applications, IPTA 2018 - Proceedings., 8608125, 2018 8th International Conference on Image Processing Theory, Tools and Applications, IPTA 2018 - Proceedings, Institute of Electrical and Electronics Engineers Inc., 8th International Conference on Image Processing Theory, Tools and Applications, IPTA 2018, Xi'an, China, 7/11/18. https://doi.org/10.1109/IPTA.2018.8608125

Comparative study of visual saliency maps in the problem of classification of architectural images with Deep CNNs. / Obeso, Abraham Montoya; Benois-Pineau, Jenny; Guissous, Kamel et al.
2018 8th International Conference on Image Processing Theory, Tools and Applications, IPTA 2018 - Proceedings. Institute of Electrical and Electronics Engineers Inc., 2019. 8608125 (2018 8th International Conference on Image Processing Theory, Tools and Applications, IPTA 2018 - Proceedings).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

TY - GEN

T1 - Comparative study of visual saliency maps in the problem of classification of architectural images with Deep CNNs

AU - Obeso, Abraham Montoya

AU - Benois-Pineau, Jenny

AU - Guissous, Kamel

AU - Gouet-Brunet, Valerie

AU - Garcia Vazquez, Mireya S.

AU - Ramirez Acosta, Alejandro A.

PY - 2019/1/10

Y1 - 2019/1/10

N2 - Incorporating Human Visual System (HVS) models into building of classifiers has become an intensively researched field in visual content mining. In the variety of models of HVS we are interested in so-called visual saliency maps. Contrarily to scan-paths they model instantaneous attention assigning the degree of interestingness/saliency for humans to each pixel in the image plane. In various tasks of visual content understanding, these maps proved to be efficient stressing contribution of the areas of interest in image plane to classifiers models. In previous works saliency layers have been introduced in Deep CNNs, showing that they allow reducing training time getting similar accuracy and loss values in optimal models. In case of large image collections efficient building of saliency maps is based on predictive models of visual attention. They are generally bottom-up and are not adapted to specific visual tasks. Unless they are built for specific content, such as »urban images»-targeted saliency maps we also compare in this paper. In present research we propose a »bootstrap» strategy of building visual saliency maps for particular tasks of visual data mining. A small collection of images relevant to the visual understanding problem is annotated with gaze fixations. Then the propagation to a large training dataset is ensured and compared with the classical GBVS model and a recent method of saliency for urban image content. The classification results within Deep CNN framework are promising compared to the purely automatic visual saliency prediction.

AB - Incorporating Human Visual System (HVS) models into building of classifiers has become an intensively researched field in visual content mining. In the variety of models of HVS we are interested in so-called visual saliency maps. Contrarily to scan-paths they model instantaneous attention assigning the degree of interestingness/saliency for humans to each pixel in the image plane. In various tasks of visual content understanding, these maps proved to be efficient stressing contribution of the areas of interest in image plane to classifiers models. In previous works saliency layers have been introduced in Deep CNNs, showing that they allow reducing training time getting similar accuracy and loss values in optimal models. In case of large image collections efficient building of saliency maps is based on predictive models of visual attention. They are generally bottom-up and are not adapted to specific visual tasks. Unless they are built for specific content, such as »urban images»-targeted saliency maps we also compare in this paper. In present research we propose a »bootstrap» strategy of building visual saliency maps for particular tasks of visual data mining. A small collection of images relevant to the visual understanding problem is annotated with gaze fixations. Then the propagation to a large training dataset is ensured and compared with the classical GBVS model and a recent method of saliency for urban image content. The classification results within Deep CNN framework are promising compared to the purely automatic visual saliency prediction.

KW - Deep Learning

KW - Mexican Culture

KW - Saliency Maps

UR - http://www.scopus.com/inward/record.url?scp=85061920656&partnerID=8YFLogxK

U2 - 10.1109/IPTA.2018.8608125

DO - 10.1109/IPTA.2018.8608125

M3 - Contribución a la conferencia

AN - SCOPUS:85061920656

T3 - 2018 8th International Conference on Image Processing Theory, Tools and Applications, IPTA 2018 - Proceedings

BT - 2018 8th International Conference on Image Processing Theory, Tools and Applications, IPTA 2018 - Proceedings

PB - Institute of Electrical and Electronics Engineers Inc.

T2 - 8th International Conference on Image Processing Theory, Tools and Applications, IPTA 2018

Y2 - 7 November 2018 through 10 November 2018

ER -

Obeso AM, Benois-Pineau J, Guissous K, Gouet-Brunet V, Garcia Vazquez MS, Ramirez Acosta AA. Comparative study of visual saliency maps in the problem of classification of architectural images with Deep CNNs. In 2018 8th International Conference on Image Processing Theory, Tools and Applications, IPTA 2018 - Proceedings. Institute of Electrical and Electronics Engineers Inc. 2019. 8608125. (2018 8th International Conference on Image Processing Theory, Tools and Applications, IPTA 2018 - Proceedings). doi: 10.1109/IPTA.2018.8608125

Comparative study of visual saliency maps in the problem of classification of architectural images with Deep CNNs

Abstract

Publication series

Conference

Keywords

Access to Document

Other files and links

Fingerprint

Cite this