Unified multi-spectral pedestrian detection based on probabilistic fusion networks

Kihong Park, Seungryong Kim, Kwanghoon Sohn

Research output: Contribution to journalArticlepeer-review

57 Citations (Scopus)

Abstract

Despite significant progress in machine learning, pedestrian detection in the real-world is still regarded as one of the challenging problems, limited by occluded appearances, cluttered backgrounds, and bad visibility at night. This has caused detection approaches using multi-spectral sensors such as color and thermal which could be complementary to each other. In this paper, we propose a novel sensor fusion framework for detecting pedestrians even in challenging real-world environments. We design a convolutional neural network (CNN) architecture that consists of three-branch detection models taking different modalities as inputs. Unlike existing methods, we consider all detection probabilities from each modality in a unified CNN framework and selectively use them through a channel weighting fusion (CWF) layer to maximize the detection performance. An accumulated probability fusion (APF) layer is also introduced to combine probabilities from different modalities at the proposal-level. We formulate these sub-networks into a unified network, so that it is possible to train the whole network in an end-to-end manner. Our extensive evaluation demonstrates that the proposed method outperforms the state-of-the-art methods on the challenging KAIST, CVC-14, and DIML multi-spectral pedestrian datasets.

Original languageEnglish
Pages (from-to)143-155
Number of pages13
JournalPattern Recognition
Volume80
DOIs
Publication statusPublished - 2018 Aug

Bibliographical note

Funding Information:
This work was supported by Institute for Information &Communications Technology Promotion(IITP) grant funded by the Korea government(MSIP) (R0124-16-0002, Emotional Intelligence Technology to Infer Human Emotion and Carry on Dialogue Accordingly).

Funding Information:
This work was supported by Institute for Information &Communications Technology Promotion(IITP) grant funded by the Korea government( MSIP ) ( R0124-16-0002 , Emotional Intelligence Technology to Infer Human Emotion and Carry on Dialogue Accordingly).

Publisher Copyright:
© 2018 Elsevier Ltd

All Science Journal Classification (ASJC) codes

  • Software
  • Signal Processing
  • Computer Vision and Pattern Recognition
  • Artificial Intelligence

Fingerprint

Dive into the research topics of 'Unified multi-spectral pedestrian detection based on probabilistic fusion networks'. Together they form a unique fingerprint.

Cite this