FASSD-Net: Fast and Accurate Real-Time Semantic Segmentation for Embedded Systems

Leonel Rosas-Arias, Gibran Benitez-Garcia, Jose Portillo-Portillo, Jesus Olivares-Mercado, Gabriel Sanchez-Perez, Keiji Yanai

Research output: Contribution to journalArticlepeer-review

11 Scopus citations

Abstract

Recent works of real-time semantic segmentation, remove or make use of light decoders from dense deep neural networks to achieve fast inference speed. This strategy helps to achieve real-time performance; however, the accuracy is significantly compromised in comparison to non-real-time methods. In this paper, we introduce two key modules aimed to design a high-performance decoder for real-time semantic segmentation, which also reduces the accuracy gap between real-time and non-real-time networks. The first module, Dilated Asymmetric Pyramidal Fusion (DAPF), is designed to increase the receptive field on the top of the last stage of the encoder, obtaining richer contextual features. The second module, Multi-resolution Dilated Asymmetric (MDA) module, fuses and refines detail and contextual information from multi-scale feature maps coming from early and deeper stages of the network. Both modules are designed to keep a low computational complexity by using asymmetric convolutions. With these modules, we propose a network entitled ``FASSD-Net,'' which is based on a light-weight CNN backbone. Running on a single Nvidia GTX 1080Ti, our model reaches 77.5% and 69.3% of mIoU, at 41 and 80 FPS on the Cityscapes and CamVid datasets, respectively. We present an extensive analysis of the accuracy-speed tradeoffs of three FASSD-Net variations on different embedded systems, demonstrating that a light version of our network can run on the low-power consumption Jetson Xavier NX, at 32 FPS reaching 74% of mIoU with full resolution (1024x 2048). The source code and pre-trained models are available at github.com/GibranBenitez/FASSD-Net.

Original languageEnglish
JournalIEEE Transactions on Intelligent Transportation Systems
DOIs
StateAccepted/In press - 2021

Keywords

  • Convolutional codes
  • Decoding
  • Embedded systems
  • HarDNet
  • Image segmentation
  • Jetson Xavier NX.
  • Real-time systems
  • Semantic segmentation
  • Semantics
  • Task analysis
  • embedded systems
  • fully convolutional networks
  • spatial pyramid pooling

Fingerprint

Dive into the research topics of 'FASSD-Net: Fast and Accurate Real-Time Semantic Segmentation for Embedded Systems'. Together they form a unique fingerprint.

Cite this