Multi-Task Self-Supervised Visual Representation Learning for Monocular Road Segmentation

Laehoon Cho, Youngjung Kim, Hyungjoo Jung, Changjae Oh, Jaesung Youn, Kwanghoon Sohn

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Training deep networks commonly follows the supervised learning paradigm, which requires large-scale semantically-labeled data. The construction of such dataset is one of the major challenges when approaching to Advanced Driver Assistance Systems (ADAS) due to the expense of human annotation. In this paper, we explore whether unsupervised stereo-based cues can be used to learn high-level semantics for monocular road detection. Specifically, we estimate drivable space and surface normals from stereo images, which are used for pseudo ground-truth to train a convolutional neural network (CNN) as a multi-task learning scheme. Combining these multiple self-supervision tasks enables CNN to jointly encode the knowledge of obstacle and ground-plane into a single frame. We demonstrate that the feature representation learned by our multi-task approach synergistically provides a rich knowledge about geometrical characteristics. Experiments on the KITTI road dataset show that our representation outperforms state-of-the-art road detection approaches.

Original languageEnglish
Title of host publication2018 IEEE International Conference on Multimedia and Expo, ICME 2018
PublisherIEEE Computer Society
ISBN (Electronic)9781538617373
DOIs
Publication statusPublished - 2018 Oct 8
Event2018 IEEE International Conference on Multimedia and Expo, ICME 2018 - San Diego, United States
Duration: 2018 Jul 232018 Jul 27

Publication series

NameProceedings - IEEE International Conference on Multimedia and Expo
Volume2018-July
ISSN (Print)1945-7871
ISSN (Electronic)1945-788X

Conference

Conference2018 IEEE International Conference on Multimedia and Expo, ICME 2018
CountryUnited States
CitySan Diego
Period18/7/2318/7/27

All Science Journal Classification (ASJC) codes

  • Computer Networks and Communications
  • Computer Science Applications

Fingerprint Dive into the research topics of 'Multi-Task Self-Supervised Visual Representation Learning for Monocular Road Segmentation'. Together they form a unique fingerprint.

  • Cite this

    Cho, L., Kim, Y., Jung, H., Oh, C., Youn, J., & Sohn, K. (2018). Multi-Task Self-Supervised Visual Representation Learning for Monocular Road Segmentation. In 2018 IEEE International Conference on Multimedia and Expo, ICME 2018 [8486472] (Proceedings - IEEE International Conference on Multimedia and Expo; Vol. 2018-July). IEEE Computer Society. https://doi.org/10.1109/ICME.2018.8486472