Pedestrian detection model based on Tiny-Yolov3 architecture for wearable devices to visually impaired assistance

Sergio Uriel Maya-Martínez; Amadeo José Argüelles-Cruz; Zobeida Jezabel Guzmán-Zavaleta; Miguel de Jesús Ramírez-Cadena

doi:10.3389/frobt.2023.1052509

Pedestrian detection model based on Tiny-Yolov3 architecture for wearable devices to visually impaired assistance

Sergio Uriel Maya-Martínez, Amadeo José Argüelles-Cruz, Zobeida Jezabel Guzmán-Zavaleta, Miguel de Jesús Ramírez-Cadena

Centro de Investigación en Computación (CIC)

Research output: Contribution to journal › Article › peer-review

4 Scopus citations

Abstract

Introduction: Wearable assistive devices for the visually impaired whose technology is based on video camera devices represent a challenge in rapid evolution, where one of the main problems is to find computer vision algorithms that can be implemented in low-cost embedded devices. Objectives and Methods: This work presents a Tiny You Only Look Once architecture for pedestrian detection, which can be implemented in low-cost wearable devices as an alternative for the development of assistive technologies for the visually impaired. Results: The recall results of the proposed refined model represent an improvement of 71% working with four anchor boxes and 66% with six anchor boxes compared to the original model. The accuracy achieved on the same data set shows an increase of 14% and 25%, respectively. The F1 calculation shows a refinement of 57% and 55%. The average accuracy of the models achieved an improvement of 87% and 99%. The number of correctly detected objects was 3098 and 2892 for four and six anchor boxes, respectively, whose performance is better by 77% and 65% compared to the original, which correctly detected 1743 objects. Discussion: Finally, the model was optimized for the Jetson Nano embedded system, a case study for low-power embedded devices, and in a desktop computer. In both cases, the graphics processing unit (GPU) and central processing unit were tested, and a documented comparison of solutions aimed at serving visually impaired people was performed. Conclusion: We performed the desktop tests with a RTX 2070S graphics card, and the image processing took about 2.8 ms. The Jetson Nano board could process an image in about 110 ms, offering the opportunity to generate alert notification procedures in support of visually impaired mobility.

Original language	English
Article number	1052509
Journal	Frontiers in Robotics and AI
Volume	10
DOIs	https://doi.org/10.3389/frobt.2023.1052509
State	Published - 2023

Keywords

Tiny YOLOv3
deep learning
graphic processing unit
image processing
pedestrian detection
visual impaired

Access to Document

10.3389/frobt.2023.1052509

Cite this

@article{7d10322cc1fb4cd681d6f4803b48810e,

title = "Pedestrian detection model based on Tiny-Yolov3 architecture for wearable devices to visually impaired assistance",

abstract = "Introduction: Wearable assistive devices for the visually impaired whose technology is based on video camera devices represent a challenge in rapid evolution, where one of the main problems is to find computer vision algorithms that can be implemented in low-cost embedded devices. Objectives and Methods: This work presents a Tiny You Only Look Once architecture for pedestrian detection, which can be implemented in low-cost wearable devices as an alternative for the development of assistive technologies for the visually impaired. Results: The recall results of the proposed refined model represent an improvement of 71% working with four anchor boxes and 66% with six anchor boxes compared to the original model. The accuracy achieved on the same data set shows an increase of 14% and 25%, respectively. The F1 calculation shows a refinement of 57% and 55%. The average accuracy of the models achieved an improvement of 87% and 99%. The number of correctly detected objects was 3098 and 2892 for four and six anchor boxes, respectively, whose performance is better by 77% and 65% compared to the original, which correctly detected 1743 objects. Discussion: Finally, the model was optimized for the Jetson Nano embedded system, a case study for low-power embedded devices, and in a desktop computer. In both cases, the graphics processing unit (GPU) and central processing unit were tested, and a documented comparison of solutions aimed at serving visually impaired people was performed. Conclusion: We performed the desktop tests with a RTX 2070S graphics card, and the image processing took about 2.8 ms. The Jetson Nano board could process an image in about 110 ms, offering the opportunity to generate alert notification procedures in support of visually impaired mobility.",

keywords = "Tiny YOLOv3, deep learning, graphic processing unit, image processing, pedestrian detection, visual impaired",

author = "Maya-Mart{\'i}nez, {Sergio Uriel} and Arg{\"u}elles-Cruz, {Amadeo Jos{\'e}} and Guzm{\'a}n-Zavaleta, {Zobeida Jezabel} and Ram{\'i}rez-Cadena, {Miguel de Jes{\'u}s}",

note = "Publisher Copyright: Copyright {\textcopyright} 2023 Maya-Mart{\'i}nez, Arg{\"u}elles-Cruz, Guzm{\'a}n-Zavaleta and Ram{\'i}rez-Cadena.",

year = "2023",

doi = "10.3389/frobt.2023.1052509",

language = "Ingl{\'e}s",

volume = "10",

journal = "Frontiers in Robotics and AI",

issn = "2296-9144",

publisher = "Frontiers Media S.A.",

}

TY - JOUR

T1 - Pedestrian detection model based on Tiny-Yolov3 architecture for wearable devices to visually impaired assistance

AU - Maya-Martínez, Sergio Uriel

AU - Argüelles-Cruz, Amadeo José

AU - Guzmán-Zavaleta, Zobeida Jezabel

AU - Ramírez-Cadena, Miguel de Jesús

PY - 2023

Y1 - 2023

N2 - Introduction: Wearable assistive devices for the visually impaired whose technology is based on video camera devices represent a challenge in rapid evolution, where one of the main problems is to find computer vision algorithms that can be implemented in low-cost embedded devices. Objectives and Methods: This work presents a Tiny You Only Look Once architecture for pedestrian detection, which can be implemented in low-cost wearable devices as an alternative for the development of assistive technologies for the visually impaired. Results: The recall results of the proposed refined model represent an improvement of 71% working with four anchor boxes and 66% with six anchor boxes compared to the original model. The accuracy achieved on the same data set shows an increase of 14% and 25%, respectively. The F1 calculation shows a refinement of 57% and 55%. The average accuracy of the models achieved an improvement of 87% and 99%. The number of correctly detected objects was 3098 and 2892 for four and six anchor boxes, respectively, whose performance is better by 77% and 65% compared to the original, which correctly detected 1743 objects. Discussion: Finally, the model was optimized for the Jetson Nano embedded system, a case study for low-power embedded devices, and in a desktop computer. In both cases, the graphics processing unit (GPU) and central processing unit were tested, and a documented comparison of solutions aimed at serving visually impaired people was performed. Conclusion: We performed the desktop tests with a RTX 2070S graphics card, and the image processing took about 2.8 ms. The Jetson Nano board could process an image in about 110 ms, offering the opportunity to generate alert notification procedures in support of visually impaired mobility.

AB - Introduction: Wearable assistive devices for the visually impaired whose technology is based on video camera devices represent a challenge in rapid evolution, where one of the main problems is to find computer vision algorithms that can be implemented in low-cost embedded devices. Objectives and Methods: This work presents a Tiny You Only Look Once architecture for pedestrian detection, which can be implemented in low-cost wearable devices as an alternative for the development of assistive technologies for the visually impaired. Results: The recall results of the proposed refined model represent an improvement of 71% working with four anchor boxes and 66% with six anchor boxes compared to the original model. The accuracy achieved on the same data set shows an increase of 14% and 25%, respectively. The F1 calculation shows a refinement of 57% and 55%. The average accuracy of the models achieved an improvement of 87% and 99%. The number of correctly detected objects was 3098 and 2892 for four and six anchor boxes, respectively, whose performance is better by 77% and 65% compared to the original, which correctly detected 1743 objects. Discussion: Finally, the model was optimized for the Jetson Nano embedded system, a case study for low-power embedded devices, and in a desktop computer. In both cases, the graphics processing unit (GPU) and central processing unit were tested, and a documented comparison of solutions aimed at serving visually impaired people was performed. Conclusion: We performed the desktop tests with a RTX 2070S graphics card, and the image processing took about 2.8 ms. The Jetson Nano board could process an image in about 110 ms, offering the opportunity to generate alert notification procedures in support of visually impaired mobility.

KW - Tiny YOLOv3

KW - deep learning

KW - graphic processing unit

KW - image processing

KW - pedestrian detection

KW - visual impaired

UR - http://www.scopus.com/inward/record.url?scp=85151963467&partnerID=8YFLogxK

U2 - 10.3389/frobt.2023.1052509

DO - 10.3389/frobt.2023.1052509

M3 - Artículo

C2 - 37008985

AN - SCOPUS:85151963467

SN - 2296-9144

VL - 10

JO - Frontiers in Robotics and AI

JF - Frontiers in Robotics and AI

M1 - 1052509

ER -

Pedestrian detection model based on Tiny-Yolov3 architecture for wearable devices to visually impaired assistance

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this