Introduction of explicit visual saliency in training of deep CNNs: Application to architectural styles classification

Abraham Montoya Obeso, Jenny Benois-Pineau, Mireya Sarai Garcia Vazquez, Alejandro A. Ramirez Acosta

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

7 Scopus citations

Abstract

Introduction of visual saliency or interestingness in the content selection for image classification tasks is an intensively researched topic. It has been namely fulfilled for feature selection in feature-based methods. Nowadays, in the winner classifiers of visual content such as Deep Convolutional Neural Networks, visual saliency maps have not been introduced explicitly. Pooling features in CNNs is known as a good strategy to reduce data dimensionality, computational complexity and summarize representative features for subsequent layers. In this paper we introduce visual saliency in network pooling layers to spatially filter relevant features for deeper layers. Our experiments are conducted in a specific task to identify Mexican architectural styles. The results are promising: proposed approach reduces model loss and training time keeping the same accuracy as the base-line CNN.

Original languageEnglish
Title of host publication16th International Conference on Content-Based Multimedia Indexing, CBMI 2018
PublisherIEEE Computer Society
ISBN (Electronic)9781538670217
DOIs
StatePublished - 30 Oct 2018
Event16th International Conference on Content-Based Multimedia Indexing, CBMI 2018 - La Rochelle, France
Duration: 4 Sep 20186 Sep 2018

Publication series

NameProceedings - International Workshop on Content-Based Multimedia Indexing
Volume2018-September
ISSN (Print)1949-3991

Conference

Conference16th International Conference on Content-Based Multimedia Indexing, CBMI 2018
Country/TerritoryFrance
CityLa Rochelle
Period4/09/186/09/18

Keywords

  • CNNs
  • Pooling
  • Saliency Maps

Fingerprint

Dive into the research topics of 'Introduction of explicit visual saliency in training of deep CNNs: Application to architectural styles classification'. Together they form a unique fingerprint.

Cite this