IPN Hand: A video dataset and benchmark for real-time continuous hand gesture recognition

Gibran Benitez-Garcia, Jesus Olivares-Mercado, Gabriel Sanchez-Perez, Keiji Yanai

Producción científica: Capítulo del libro/informe/acta de congresoContribución a la conferenciarevisión exhaustiva

51 Citas (Scopus)

Resumen

Continuous hand gesture recognition (HGR) is an essential part of human-computer interaction with a wide range of applications in the automotive sector, consumer electronics, home automation, and others. In recent years, accurate and efficient deep learning models have been proposed for HGR. However, in the research community, the current publicly available datasets lack real-world elements needed to build responsive and efficient HGR systems. In this paper, we introduce a new benchmark dataset named IPN Hand with sufficient size, variety, and real-world elements able to train and evaluate deep neural networks. This dataset contains more than 4,000 gesture samples and 800,000 RGB frames from 50 distinct subjects. We design 13 different static and dynamic gestures focused on interaction with touchless screens. We especially consider the scenario when continuous gestures are performed without transition states, and when subjects perform natural movements with their hands as non-gesture actions. Gestures were collected from about 30 diverse scenes, with real-world variation in background and illumination. With our dataset, the performance of three 3D-CNN models is evaluated on the tasks of isolated and continuous real-time HGR. Furthermore, we analyze the possibility of increasing the recognition accuracy by adding multiple modalities derived from RGB frames, i.e., optical flow and semantic segmentation, while keeping the real-time performance of the 3D-CNN model. Our empirical study also provides a comparison with the publicly available nvGesture (NVIDIA) dataset. The experimental results show that the state-of-the-art ResNext-101 model decreases about 30% accuracy when using our real-world dataset, demonstrating that the IPN Hand dataset can be used as a benchmark, and may help the community to step forward in the continuous HGR. Our dataset and pre-trained models used in the evaluation are publicly available at github.com/GibranBenitez/IPN-hand.

Idioma originalInglés
Título de la publicación alojadaProceedings of ICPR 2020 - 25th International Conference on Pattern Recognition
EditorialInstitute of Electrical and Electronics Engineers Inc.
Páginas4340-4347
Número de páginas8
ISBN (versión digital)9781728188089
DOI
EstadoPublicada - 2020
Evento25th International Conference on Pattern Recognition, ICPR 2020 - Virtual, Milan, Italia
Duración: 10 ene. 202115 ene. 2021

Serie de la publicación

NombreProceedings - International Conference on Pattern Recognition
ISSN (versión impresa)1051-4651

Conferencia

Conferencia25th International Conference on Pattern Recognition, ICPR 2020
País/TerritorioItalia
CiudadVirtual, Milan
Período10/01/2115/01/21

Huella

Profundice en los temas de investigación de 'IPN Hand: A video dataset and benchmark for real-time continuous hand gesture recognition'. En conjunto forman una huella única.

Citar esto