Abstract
Aerial-to-ground image synthesis is an emerging and challenging problem that aims to synthesize a ground image from an aerial image. Due to the highly different layout and object representation between the aerial and ground images, existing approaches usually fail to transfer the components of the aerial scene into the ground scene. In this paper, we propose a novel framework to explore the challenges by imposing enhanced structural alignment and semantic awareness. We introduce a novel semantic-attentive feature transformation module that allows to reconstruct the complex geographic structures by aligning the aerial feature to the ground layout. Furthermore, we propose semantic-aware loss functions by leveraging a pre-trained segmentation network. The network is enforced to synthesize realistic objects across various classes by separately calculating losses for different classes and balancing them. Extensive experiments including comparisons with previous methods and ablation studies show the effectiveness of the proposed framework both qualitatively and quantitatively.
Original language | English |
---|---|
Title of host publication | 2021 IEEE International Conference on Image Processing, ICIP 2021 - Proceedings |
Publisher | IEEE Computer Society |
Pages | 3862-3866 |
Number of pages | 5 |
ISBN (Electronic) | 9781665441155 |
DOIs | |
Publication status | Published - 2021 |
Event | 2021 IEEE International Conference on Image Processing, ICIP 2021 - Anchorage, United States Duration: 2021 Sept 19 → 2021 Sept 22 |
Publication series
Name | Proceedings - International Conference on Image Processing, ICIP |
---|---|
Volume | 2021-September |
ISSN (Print) | 1522-4880 |
Conference
Conference | 2021 IEEE International Conference on Image Processing, ICIP 2021 |
---|---|
Country/Territory | United States |
City | Anchorage |
Period | 21/9/19 → 21/9/22 |
Bibliographical note
Funding Information:This research was supported by the Yonsei University Research Fund of 2021 (2021-22-0001).
Funding Information:
∗Corresponding author This work was supported by Institute of Information communications Technology Planning & Evaluation (IITP) grand funded by the Korea government(MSIT) (No.2020-0-00056, To create AI systems that act appropriately and effectively in novel situations that occur in open worlds.)
Publisher Copyright:
© 2021 IEEE
All Science Journal Classification (ASJC) codes
- Software
- Computer Vision and Pattern Recognition
- Signal Processing