Introduction of explicit visual saliency in training of deep CNNs: Application to architectural styles classification

Abraham Montoya Obeso; Jenny Benois-Pineau; Mireya Sarai Garcia Vazquez; Alejandro A. Ramirez Acosta

doi:10.1109/CBMI.2018.8516465

Introduction of explicit visual saliency in training of deep CNNs: Application to architectural styles classification

Abraham Montoya Obeso, Jenny Benois-Pineau, Mireya Sarai Garcia Vazquez, Alejandro A. Ramirez Acosta

Centro de Investigación y Desarrollo de Tecnología Digital (CITEDI)

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

7 Scopus citations

Abstract

Introduction of visual saliency or interestingness in the content selection for image classification tasks is an intensively researched topic. It has been namely fulfilled for feature selection in feature-based methods. Nowadays, in the winner classifiers of visual content such as Deep Convolutional Neural Networks, visual saliency maps have not been introduced explicitly. Pooling features in CNNs is known as a good strategy to reduce data dimensionality, computational complexity and summarize representative features for subsequent layers. In this paper we introduce visual saliency in network pooling layers to spatially filter relevant features for deeper layers. Our experiments are conducted in a specific task to identify Mexican architectural styles. The results are promising: proposed approach reduces model loss and training time keeping the same accuracy as the base-line CNN.

Original language	English
Title of host publication	16th International Conference on Content-Based Multimedia Indexing, CBMI 2018
Publisher	IEEE Computer Society
ISBN (Electronic)	9781538670217
DOIs	https://doi.org/10.1109/CBMI.2018.8516465
State	Published - 30 Oct 2018
Event	16th International Conference on Content-Based Multimedia Indexing, CBMI 2018 - La Rochelle, France Duration: 4 Sep 2018 → 6 Sep 2018

Publication series

Name	Proceedings - International Workshop on Content-Based Multimedia Indexing
Volume	2018-September
ISSN (Print)	1949-3991

Conference

Conference	16th International Conference on Content-Based Multimedia Indexing, CBMI 2018
Country/Territory	France
City	La Rochelle
Period	4/09/18 → 6/09/18

Keywords

CNNs
Pooling
Saliency Maps

Access to Document

10.1109/CBMI.2018.8516465

Cite this

Obeso, A. M., Benois-Pineau, J., Garcia Vazquez, M. S., & Ramirez Acosta, A. A. (2018). Introduction of explicit visual saliency in training of deep CNNs: Application to architectural styles classification. In 16th International Conference on Content-Based Multimedia Indexing, CBMI 2018 Article 8516465 (Proceedings - International Workshop on Content-Based Multimedia Indexing; Vol. 2018-September). IEEE Computer Society. https://doi.org/10.1109/CBMI.2018.8516465

Obeso, Abraham Montoya ; Benois-Pineau, Jenny ; Garcia Vazquez, Mireya Sarai et al. / Introduction of explicit visual saliency in training of deep CNNs : Application to architectural styles classification. 16th International Conference on Content-Based Multimedia Indexing, CBMI 2018. IEEE Computer Society, 2018. (Proceedings - International Workshop on Content-Based Multimedia Indexing).

@inproceedings{5f733501994a48009c86e6c0a5761c65,

title = "Introduction of explicit visual saliency in training of deep CNNs: Application to architectural styles classification",

abstract = "Introduction of visual saliency or interestingness in the content selection for image classification tasks is an intensively researched topic. It has been namely fulfilled for feature selection in feature-based methods. Nowadays, in the winner classifiers of visual content such as Deep Convolutional Neural Networks, visual saliency maps have not been introduced explicitly. Pooling features in CNNs is known as a good strategy to reduce data dimensionality, computational complexity and summarize representative features for subsequent layers. In this paper we introduce visual saliency in network pooling layers to spatially filter relevant features for deeper layers. Our experiments are conducted in a specific task to identify Mexican architectural styles. The results are promising: proposed approach reduces model loss and training time keeping the same accuracy as the base-line CNN.",

keywords = "CNNs, Pooling, Saliency Maps",

author = "Obeso, {Abraham Montoya} and Jenny Benois-Pineau and {Garcia Vazquez}, {Mireya Sarai} and {Ramirez Acosta}, {Alejandro A.}",

note = "Publisher Copyright: {\textcopyright} 2018 IEEE.; 16th International Conference on Content-Based Multimedia Indexing, CBMI 2018 ; Conference date: 04-09-2018 Through 06-09-2018",

year = "2018",

month = oct,

day = "30",

doi = "10.1109/CBMI.2018.8516465",

language = "Ingl{\'e}s",

series = "Proceedings - International Workshop on Content-Based Multimedia Indexing",

publisher = "IEEE Computer Society",

booktitle = "16th International Conference on Content-Based Multimedia Indexing, CBMI 2018",

address = "Estados Unidos",

}

Obeso, AM, Benois-Pineau, J, Garcia Vazquez, MS & Ramirez Acosta, AA 2018, Introduction of explicit visual saliency in training of deep CNNs: Application to architectural styles classification. in 16th International Conference on Content-Based Multimedia Indexing, CBMI 2018., 8516465, Proceedings - International Workshop on Content-Based Multimedia Indexing, vol. 2018-September, IEEE Computer Society, 16th International Conference on Content-Based Multimedia Indexing, CBMI 2018, La Rochelle, France, 4/09/18. https://doi.org/10.1109/CBMI.2018.8516465

Introduction of explicit visual saliency in training of deep CNNs: Application to architectural styles classification. / Obeso, Abraham Montoya; Benois-Pineau, Jenny; Garcia Vazquez, Mireya Sarai et al.
16th International Conference on Content-Based Multimedia Indexing, CBMI 2018. IEEE Computer Society, 2018. 8516465 (Proceedings - International Workshop on Content-Based Multimedia Indexing; Vol. 2018-September).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

TY - GEN

T1 - Introduction of explicit visual saliency in training of deep CNNs

T2 - 16th International Conference on Content-Based Multimedia Indexing, CBMI 2018

AU - Obeso, Abraham Montoya

AU - Benois-Pineau, Jenny

AU - Garcia Vazquez, Mireya Sarai

AU - Ramirez Acosta, Alejandro A.

PY - 2018/10/30

Y1 - 2018/10/30

N2 - Introduction of visual saliency or interestingness in the content selection for image classification tasks is an intensively researched topic. It has been namely fulfilled for feature selection in feature-based methods. Nowadays, in the winner classifiers of visual content such as Deep Convolutional Neural Networks, visual saliency maps have not been introduced explicitly. Pooling features in CNNs is known as a good strategy to reduce data dimensionality, computational complexity and summarize representative features for subsequent layers. In this paper we introduce visual saliency in network pooling layers to spatially filter relevant features for deeper layers. Our experiments are conducted in a specific task to identify Mexican architectural styles. The results are promising: proposed approach reduces model loss and training time keeping the same accuracy as the base-line CNN.

AB - Introduction of visual saliency or interestingness in the content selection for image classification tasks is an intensively researched topic. It has been namely fulfilled for feature selection in feature-based methods. Nowadays, in the winner classifiers of visual content such as Deep Convolutional Neural Networks, visual saliency maps have not been introduced explicitly. Pooling features in CNNs is known as a good strategy to reduce data dimensionality, computational complexity and summarize representative features for subsequent layers. In this paper we introduce visual saliency in network pooling layers to spatially filter relevant features for deeper layers. Our experiments are conducted in a specific task to identify Mexican architectural styles. The results are promising: proposed approach reduces model loss and training time keeping the same accuracy as the base-line CNN.

KW - CNNs

KW - Pooling

KW - Saliency Maps

UR - http://www.scopus.com/inward/record.url?scp=85057031191&partnerID=8YFLogxK

U2 - 10.1109/CBMI.2018.8516465

DO - 10.1109/CBMI.2018.8516465

M3 - Contribución a la conferencia

AN - SCOPUS:85057031191

T3 - Proceedings - International Workshop on Content-Based Multimedia Indexing

BT - 16th International Conference on Content-Based Multimedia Indexing, CBMI 2018

PB - IEEE Computer Society

Y2 - 4 September 2018 through 6 September 2018

ER -

Obeso AM, Benois-Pineau J, Garcia Vazquez MS, Ramirez Acosta AA. Introduction of explicit visual saliency in training of deep CNNs: Application to architectural styles classification. In 16th International Conference on Content-Based Multimedia Indexing, CBMI 2018. IEEE Computer Society. 2018. 8516465. (Proceedings - International Workshop on Content-Based Multimedia Indexing). doi: 10.1109/CBMI.2018.8516465

Introduction of explicit visual saliency in training of deep CNNs: Application to architectural styles classification

Abstract

Publication series

Conference

Keywords

Access to Document

Other files and links

Fingerprint

Cite this