Label-Conditional Synthetic Satellite Imagery

Motivation

Generating good-quality synthetic satellite imagery is of importance since satellite imagery is a crucial type of data people use to train machine learning models to address global issues, such as climate change and biodiversity estimation to name just two topics that are very en vogue.

Furthermore, there are a variety of other use cases in established industries such as urban planning, security, agriculture, or even the insurance business. However, high-resolution satellite imagery is infrequently collected and expensive to access, making them a scarce resource.

Moreover, licensing constraints oftentimes prohibit the release of high-resolution satellite images to the public. In contrast, synthetic satellite imagery can be abundant, low-cost, and high-quality at the same time.

Our Solution

We propose a label-conditional synthetic image generation model for creating synthetic satellite imagery datasets.

Given a dataset of real high-resolution imagery and accompanying land cover masks, we show that it is possible to train an upstream class-conditional synthetic imagery generator, use that generator to create a synthetic imagery using the land cover masks, then train a downstream model on the synthetic imagery and land cover masks that achieves similar test set performance to a model that was trained with the real imagery.

Further, we find that incorporating a mixture of real and synthetic imagery acts as a data augmentation method, producing better models than using only real imagery.

lambda	mIoU	FID (1)	FID (2)
0	0.2894	72.29	73.31
2	0.3417	63.07	70.72
4	0.3827	61.70	61.38
6	0.4059	56.60	70.98
8	0.3572	58.09	63.46
10	0.3234	60.48	56.37

lambda

mIoU

FID (1)

FID (2)

0.2894

72.29

73.31

0.3417

63.07

70.72

0.3827

61.70

61.38

0.4059

56.60

70.98

0.3572

58.09

63.46

0.3234

60.48

56.37

Training	Water	Forest	Low Vegetation	Barren Land	Impervious (other)	Impervious (road)	Mean
100% real	0.6794	0.8386	0.7279	0.1205	0.5302	0.2443	0.5235
100% synthetic	0.4001	0.7332	0.5642	0.0134	0.4085	0.3161	0.4059
200% synthetic	0.5322	0.6956	0.5636	0.0125	0.3677	0.3288	0.4167
300% synthetic	0.2432	0.7402	0.5479	0.0157	0.3316	0.2878	0.3611
100% synthetic (4-channel)	0.9100	0.7476	0.7034	0.0177	0.4143	0.3097	0.5171
100% real (4-channel)	0.9676	0.8532	0.8346	0.1456	0.5665	0.5137	0.6469

Training

Water

Forest

Low Vegetation

Barren Land

Impervious (other)

Impervious (road)

Mean

100% real

0.6794

0.8386

0.7279

0.1205

0.5302

0.2443

0.5235

100% synthetic

0.4001

0.7332

0.5642

0.0134

0.4085

0.3161

0.4059

200% synthetic

0.5322

0.6956

0.5636

0.0125

0.3677

0.3288

0.4167

300% synthetic

0.2432

0.7402

0.5479

0.0157

0.3316

0.2878

0.3611

100% synthetic (4-channel)

0.9100

0.7476

0.7034

0.0177

0.4143

0.3097

0.5171

100% real (4-channel)

0.9676

0.8532

0.8346

0.1456

0.5665

0.5137

0.6469

Our team

We are graduate students from Harvard John A. Paulson School of Engineering and Applied Science. We thank Sarah Rathnam and Weiwei Pan for their help in coordination and communication.

We are proud to be working with Microsoft AI for Good Lab on this project. We thank Caleb Robinson, Simone Fobi Nsutezo, and Anthony Ortiz in the Microsoft AI for Good Research Lab for insightful advice. Their expertise in artificial intelligence and commitment to using technology for social good make them a perfect partner for us.

Sherry(Xinran) Tang: xinran_tang@g.harvard.edu

SM Student in Applied Computation

Mengyuan Li: mengyuan_li@g.harvard.edu

Chelsea(Zixi) Chen: zixichen@g.harvard.edu

Van Anh Le: vananhle@g.harvard.edu

Varshini Reddy: varshinibogolu@g.harvard.edu

Label-Conditional Synthetic Satellite Imagery

Project from Microsoft AI For Good Lab

Motivation

Our Solution

Features and experiments

Satellite Imagery Generation

Synthetic Imagery for Landcover Segmentation

Label conditional synthetic satellite imagery generation

Downstream Tasks

Our team

Organization