Introduction of explicit visual saliency in training of deep CNNs: Application to architectural styles classification

Abraham Montoya Obeso; Jenny Benois-Pineau; Mireya Sarai Garcia Vazquez; Alejandro A. Ramirez Acosta

doi:10.1109/CBMI.2018.8516465

Introduction of explicit visual saliency in training of deep CNNs: Application to architectural styles classification

Abraham Montoya Obeso, Jenny Benois-Pineau, Mireya Sarai Garcia Vazquez, Alejandro A. Ramirez Acosta

Centro de Investigación y Desarrollo de Tecnología Digital (CITEDI)

Producción científica: Capítulo del libro/informe/acta de congreso › Contribución a la conferencia › revisión exhaustiva

7 Citas (Scopus)

Resumen

Introduction of visual saliency or interestingness in the content selection for image classification tasks is an intensively researched topic. It has been namely fulfilled for feature selection in feature-based methods. Nowadays, in the winner classifiers of visual content such as Deep Convolutional Neural Networks, visual saliency maps have not been introduced explicitly. Pooling features in CNNs is known as a good strategy to reduce data dimensionality, computational complexity and summarize representative features for subsequent layers. In this paper we introduce visual saliency in network pooling layers to spatially filter relevant features for deeper layers. Our experiments are conducted in a specific task to identify Mexican architectural styles. The results are promising: proposed approach reduces model loss and training time keeping the same accuracy as the base-line CNN.

Idioma original	Inglés
Título de la publicación alojada	16th International Conference on Content-Based Multimedia Indexing, CBMI 2018
Editorial	IEEE Computer Society
ISBN (versión digital)	9781538670217
DOI	https://doi.org/10.1109/CBMI.2018.8516465
Estado	Publicada - 30 oct. 2018
Evento	16th International Conference on Content-Based Multimedia Indexing, CBMI 2018 - La Rochelle, Francia Duración: 4 sep. 2018 → 6 sep. 2018

Serie de la publicación

Nombre	Proceedings - International Workshop on Content-Based Multimedia Indexing
Volumen	2018-September
ISSN (versión impresa)	1949-3991

Conferencia

Conferencia	16th International Conference on Content-Based Multimedia Indexing, CBMI 2018
País/Territorio	Francia
Ciudad	La Rochelle
Período	4/09/18 → 6/09/18

Acceder al documento

10.1109/CBMI.2018.8516465

Otros archivos y enlaces

Enlace a la publicación en Scopus

Citar esto

Obeso, A. M., Benois-Pineau, J., Garcia Vazquez, M. S., & Ramirez Acosta, A. A. (2018). Introduction of explicit visual saliency in training of deep CNNs: Application to architectural styles classification. En 16th International Conference on Content-Based Multimedia Indexing, CBMI 2018 Artículo 8516465 (Proceedings - International Workshop on Content-Based Multimedia Indexing; Vol. 2018-September). IEEE Computer Society. https://doi.org/10.1109/CBMI.2018.8516465

Obeso, Abraham Montoya ; Benois-Pineau, Jenny ; Garcia Vazquez, Mireya Sarai et al. / Introduction of explicit visual saliency in training of deep CNNs : Application to architectural styles classification. 16th International Conference on Content-Based Multimedia Indexing, CBMI 2018. IEEE Computer Society, 2018. (Proceedings - International Workshop on Content-Based Multimedia Indexing).

@inproceedings{5f733501994a48009c86e6c0a5761c65,

title = "Introduction of explicit visual saliency in training of deep CNNs: Application to architectural styles classification",

abstract = "Introduction of visual saliency or interestingness in the content selection for image classification tasks is an intensively researched topic. It has been namely fulfilled for feature selection in feature-based methods. Nowadays, in the winner classifiers of visual content such as Deep Convolutional Neural Networks, visual saliency maps have not been introduced explicitly. Pooling features in CNNs is known as a good strategy to reduce data dimensionality, computational complexity and summarize representative features for subsequent layers. In this paper we introduce visual saliency in network pooling layers to spatially filter relevant features for deeper layers. Our experiments are conducted in a specific task to identify Mexican architectural styles. The results are promising: proposed approach reduces model loss and training time keeping the same accuracy as the base-line CNN.",

keywords = "CNNs, Pooling, Saliency Maps",

author = "Obeso, {Abraham Montoya} and Jenny Benois-Pineau and {Garcia Vazquez}, {Mireya Sarai} and {Ramirez Acosta}, {Alejandro A.}",

note = "Publisher Copyright: {\textcopyright} 2018 IEEE.; 16th International Conference on Content-Based Multimedia Indexing, CBMI 2018 ; Conference date: 04-09-2018 Through 06-09-2018",

year = "2018",

month = oct,

day = "30",

doi = "10.1109/CBMI.2018.8516465",

language = "Ingl{\'e}s",

series = "Proceedings - International Workshop on Content-Based Multimedia Indexing",

publisher = "IEEE Computer Society",

booktitle = "16th International Conference on Content-Based Multimedia Indexing, CBMI 2018",

address = "Estados Unidos",

}

Obeso, AM, Benois-Pineau, J, Garcia Vazquez, MS & Ramirez Acosta, AA 2018, Introduction of explicit visual saliency in training of deep CNNs: Application to architectural styles classification. En 16th International Conference on Content-Based Multimedia Indexing, CBMI 2018., 8516465, Proceedings - International Workshop on Content-Based Multimedia Indexing, vol. 2018-September, IEEE Computer Society, 16th International Conference on Content-Based Multimedia Indexing, CBMI 2018, La Rochelle, Francia, 4/09/18. https://doi.org/10.1109/CBMI.2018.8516465

Introduction of explicit visual saliency in training of deep CNNs: Application to architectural styles classification. / Obeso, Abraham Montoya; Benois-Pineau, Jenny; Garcia Vazquez, Mireya Sarai et al.
16th International Conference on Content-Based Multimedia Indexing, CBMI 2018. IEEE Computer Society, 2018. 8516465 (Proceedings - International Workshop on Content-Based Multimedia Indexing; Vol. 2018-September).

Producción científica: Capítulo del libro/informe/acta de congreso › Contribución a la conferencia › revisión exhaustiva

TY - GEN

T1 - Introduction of explicit visual saliency in training of deep CNNs

T2 - 16th International Conference on Content-Based Multimedia Indexing, CBMI 2018

AU - Obeso, Abraham Montoya

AU - Benois-Pineau, Jenny

AU - Garcia Vazquez, Mireya Sarai

AU - Ramirez Acosta, Alejandro A.

PY - 2018/10/30

Y1 - 2018/10/30

N2 - Introduction of visual saliency or interestingness in the content selection for image classification tasks is an intensively researched topic. It has been namely fulfilled for feature selection in feature-based methods. Nowadays, in the winner classifiers of visual content such as Deep Convolutional Neural Networks, visual saliency maps have not been introduced explicitly. Pooling features in CNNs is known as a good strategy to reduce data dimensionality, computational complexity and summarize representative features for subsequent layers. In this paper we introduce visual saliency in network pooling layers to spatially filter relevant features for deeper layers. Our experiments are conducted in a specific task to identify Mexican architectural styles. The results are promising: proposed approach reduces model loss and training time keeping the same accuracy as the base-line CNN.

AB - Introduction of visual saliency or interestingness in the content selection for image classification tasks is an intensively researched topic. It has been namely fulfilled for feature selection in feature-based methods. Nowadays, in the winner classifiers of visual content such as Deep Convolutional Neural Networks, visual saliency maps have not been introduced explicitly. Pooling features in CNNs is known as a good strategy to reduce data dimensionality, computational complexity and summarize representative features for subsequent layers. In this paper we introduce visual saliency in network pooling layers to spatially filter relevant features for deeper layers. Our experiments are conducted in a specific task to identify Mexican architectural styles. The results are promising: proposed approach reduces model loss and training time keeping the same accuracy as the base-line CNN.

KW - CNNs

KW - Pooling

KW - Saliency Maps

UR - http://www.scopus.com/inward/record.url?scp=85057031191&partnerID=8YFLogxK

U2 - 10.1109/CBMI.2018.8516465

DO - 10.1109/CBMI.2018.8516465

M3 - Contribución a la conferencia

AN - SCOPUS:85057031191

T3 - Proceedings - International Workshop on Content-Based Multimedia Indexing

BT - 16th International Conference on Content-Based Multimedia Indexing, CBMI 2018

PB - IEEE Computer Society

Y2 - 4 September 2018 through 6 September 2018

ER -

Obeso AM, Benois-Pineau J, Garcia Vazquez MS, Ramirez Acosta AA. Introduction of explicit visual saliency in training of deep CNNs: Application to architectural styles classification. En 16th International Conference on Content-Based Multimedia Indexing, CBMI 2018. IEEE Computer Society. 2018. 8516465. (Proceedings - International Workshop on Content-Based Multimedia Indexing). doi: 10.1109/CBMI.2018.8516465

Introduction of explicit visual saliency in training of deep CNNs: Application to architectural styles classification

Resumen

Serie de la publicación

Conferencia

Acceder al documento

Otros archivos y enlaces

Huella

Citar esto