BDVC (Bimodal Database of Violent Content): A database of violent audio and video

Jose Luis Rivera Martínez, Mario Humberto Mijes Cruz, Manuel Antonio Rodríguez Vázqu, Luis Rodríguez Espejo, Abraham Montoya Obeso, Mireya Saraí García Vázquez, Alejandro Álvaro Ramírez Acosta

Research output: Contribution to conferencePaper

Abstract

© 2017 SPIE. Nowadays there is a trend towards the use of unimodal databases for multimedia content description, organization and retrieval applications of a single type of content like text, voice and images, instead bimodal databases allow to associate semantically two different types of content like audio-video, image-text, among others. The generation of a bimodal database of audio-video implies the creation of a connection between the multimedia content through the semantic relation that associates the actions of both types of information. This paper describes in detail the used characteristics and methodology for the creation of the bimodal database of violent content; the semantic relationship is stablished by the proposed concepts that describe the audiovisual information. The use of bimodal databases in applications related to the audiovisual content processing allows an increase in the semantic performance only and only if these applications process both type of content. This bimodal database counts with 580 audiovisual annotated segments, with a duration of 28 minutes, divided in 41 classes. Bimodal databases are a tool in the generation of applications for the semantic web.
Original languageAmerican English
DOIs
StatePublished - 1 Jan 2017
EventProceedings of SPIE - The International Society for Optical Engineering -
Duration: 1 Jan 2017 → …

Conference

ConferenceProceedings of SPIE - The International Society for Optical Engineering
Period1/01/17 → …

    Fingerprint

Cite this

Rivera Martínez, J. L., Mijes Cruz, M. H., Rodríguez Vázqu, M. A., Rodríguez Espejo, L., Montoya Obeso, A., García Vázquez, M. S., & Ramírez Acosta, A. Á. (2017). BDVC (Bimodal Database of Violent Content): A database of violent audio and video. Paper presented at Proceedings of SPIE - The International Society for Optical Engineering, . https://doi.org/10.1117/12.2274735