Connoisseur: Classification of styles of Mexican architectural heritage with deep learning and visual attention prediction

Abraham Montoya Obeso, Mireya S.García Vázquez, Alejandro A.Ramirez Acosta, Jenny Benois-Pineau

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

24 Scopus citations

Abstract

The automatic description of multimedia content was mainly developed for classification tasks, retrieval systems and massive ordering of data. Preservation of cultural heritage is a field of high importance for application to this method. Our problem is classification of architectural styles of buildings in digital photographs of Mexican cultural heritage. The selection of relevant content in the scene for training classification models allows them to be more precise in the classification task. Here we use a saliency-driven approach to predict visual attention in images and use it to train a Convolutional Neural Network to identify the architectural style of Mexican buildings. Also, we present an analysis of the behavior of the models trained under the traditional cropped image and the prominence maps. In this sense, we show that the performance of the saliency-based CNNs is better than the traditional training reaching a classification rate of 97% in validation dataset. It is considered that style identification with this technique can make a wide contribution in video description tasks, specifically in the automatic documentation of Mexican cultural heritage.

Original languageEnglish
Title of host publicationProceedings of the 15th International Workshop on Content-Based Multimedia Indexing, CBMI 2017
PublisherAssociation for Computing Machinery
ISBN (Electronic)9781450353335
DOIs
StatePublished - 19 Jun 2017
Event15th International Workshop on Content-Based Multimedia Indexing, CBMI 2017 - Firenze, Italy
Duration: 19 Jun 201721 Jun 2017

Publication series

NameACM International Conference Proceeding Series
VolumePart F130150

Conference

Conference15th International Workshop on Content-Based Multimedia Indexing, CBMI 2017
Country/TerritoryItaly
CityFirenze
Period19/06/1721/06/17

Keywords

  • CNN
  • Cultural heritage
  • Deep learning
  • Image classification
  • Visual attention prediction

Fingerprint

Dive into the research topics of 'Connoisseur: Classification of styles of Mexican architectural heritage with deep learning and visual attention prediction'. Together they form a unique fingerprint.

Cite this