© 2017 SPIE. Nowadays there is a trend towards the use of unimodal databases for multimedia content description, organization and retrieval applications of a single type of content like text, voice and images, instead bimodal databases allow to associate semantically two different types of content like audio-video, image-text, among others. The generation of a bimodal database of audio-video implies the creation of a connection between the multimedia content through the semantic relation that associates the actions of both types of information. This paper describes in detail the used characteristics and methodology for the creation of the bimodal database of violent content; the semantic relationship is stablished by the proposed concepts that describe the audiovisual information. The use of bimodal databases in applications related to the audiovisual content processing allows an increase in the semantic performance only and only if these applications process both type of content. This bimodal database counts with 580 audiovisual annotated segments, with a duration of 28 minutes, divided in 41 classes. Bimodal databases are a tool in the generation of applications for the semantic web.
|Original language||American English|
|State||Published - 1 Jan 2017|
|Event||Proceedings of SPIE - The International Society for Optical Engineering - |
Duration: 1 Jan 2017 → …
|Conference||Proceedings of SPIE - The International Society for Optical Engineering|
|Period||1/01/17 → …|
Rivera Martínez, J. L., Mijes Cruz, M. H., Rodríguez Vázqu, M. A., Rodríguez Espejo, L., Montoya Obeso, A., García Vázquez, M. S., & Ramírez Acosta, A. Á. (2017). BDVC (Bimodal Database of Violent Content): A database of violent audio and video. Paper presented at Proceedings of SPIE - The International Society for Optical Engineering, . https://doi.org/10.1117/12.2274735