GitHub - sustainlab-group/geography-aware-ssl: Official Repository for ICCV 2021 Paper Titled as "Geography-Aware Self-Supervised Learning"

Geography-Aware Self-Supervised Learning (ICCV 2021)

Kumar Ayush^*, Burak Uzkent^*, Chenlin Meng^*, Kumar Tanmay, Marshall Burke, David Lobell, Stefano Ermon.
Stanford University
In ICCV, 2021.

This is a PyTorch implementation of Geography-Aware Self-Supervised Learning. We use the the official implementation of MoCo-v2 for developing our methods.

fMoW Dataset

Functional Map of the Dataset can be downloaded from their website/repo. You can create csvs similar to the ones in the csvs/ folder.

Map showing distribution of the fMoW dataset.

Preparation

Install PyTorch and download the fMoW dataset.

Self-Supervised Training

Similar to official implementation of MoCo-v2, this implementation only supports multi-gpu, DistributedDataParallel training, which is faster and simpler; single-gpu or DataParallel training is not supported.

To do self-supervised pre-training of a ResNet-50 model on fmow using our MoCo-v2+Geo+TP model in an 4-gpu machine, run:

python moco_fmow/main_moco_geo+tp.py \ 
    -a resnet50 \
    --lr 0.03 \
    --dist-url 'tcp://localhost:14653' --multiprocessing-distributed --moco-t 0.02 --world-size 1 --rank 0 --mlp -j 4 \
    --loss cpc --epochs 200 --batch-size 256 --moco-dim 128 --aug-plus --cos \
    --save-dir ${PT_DIR} \
    --data fmow

To do self-supervised pre-training of a ResNet-50 model on fmow using our MoCo-v2+TP model in an 4-gpu machine, run:

python moco_fmow/main_moco_tp.py \ 
    -a resnet50 \
    --lr 0.03 \
    --dist-url 'tcp://localhost:14653' --multiprocessing-distributed --moco-t 0.02 --world-size 1 --rank 0 --mlp -j 4 \
    --loss cpc --epochs 200 --batch-size 256 --moco-dim 128 --aug-plus --cos \
    --save-dir ${PT_DIR} \
    --data fmow

Linear Classification

With a pre-trained model, to train a supervised linear classifier on frozen features/weights in an 4-gpu machine, run:

python moco_fmow/main_lincls.py \
    -a resnet50 \
    --lr 1 \
    --dist-url 'tcp://localhost:14653' --multiprocessing-distributed --world-size 1 --rank 0 -j 4 \
    --pretrained=${PT_DIR} \
    --save-dir ${PTDIR}/lincls \
    --data fmow --batch-size 256

Models

Our pre-trained ResNet-50 models can be downloaded as following:

	epochs	model
MoCo-v2	200	download
MoCo-v2-Geo	200	download
MoCo-v2-TP	200	download
MoCo-v2+Geo+TP	200	download

GeoImageNet

Download the GeoImageNet - The instructions to download GeoImageNet set are given here. Using this repository, we can download in the order of 2M images together with their coordinates. In the paper, we use 540k images for the GeoImageNet. The download process should download the images into their representative class folder. We recommend parallelizing the download process for efficiency.

Clustering - Once, we download the GeoImageNet dataset, we can use a clustering algorithm to cluster the images using their geo-coordinates. In the paper, we use K-means clustering to cluster 540k images into 100 clusters, however, any clustering algorithm can be used. After K-means clustering, we need to create a csv file similar to ones in the ./csvs/ folder.

Perform Self-Supervised Learning - After downloading the GeoImageNet and clustering the images, we can perform self-supervised learning. To do it, you can execute the following command :

python moco_fmow/main_moco_geo+tp.py \ 
    -a resnet50 \
    --lr 0.03 \
    --dist-url 'tcp://localhost:14653' --multiprocessing-distributed --moco-t 0.02 --world-size 1 --rank 0 --mlp -j 4 \
    --loss cpc --epochs 200 --batch-size 256 --moco-dim 128 --aug-plus --cos \
    --save-dir ${PT_DIR}

Linear Classification - After learning the representations with MoCo-v2-geo, we can train the linear layer to classify GeoImageNet images. With a pre-trained model, to train a supervised linear classifier on frozen features/weights in an 4-gpu machine, run:

python moco_fmow/main_lincls.py \
    -a resnet50 \
    --lr 1 \
    --dist-url 'tcp://localhost:14653' --multiprocessing-distributed --world-size 1 --rank 0 -j 4 \
    --pretrained=${PT_DIR} \
    --save-dir ${PTDIR}/lincls \
    --batch-size 256

Transfer Learning Experiments

We use Retina-Net implementation from this repository for object detection experiments on xView. We use PSANet implementation from this repository for semantic segmentation experiments on SpaceNet.

Citing

If you find our work useful, please consider citing:

@article{ayush2021geography,
      title={Geography-Aware Self-Supervised Learning},
      author={Ayush, Kumar and Uzkent, Burak and Meng, Chenlin and Tanmay, Kumar and Burke, Marshall and Lobell, David and Ermon, Stefano},
      journal={ICCV},
      year={2021}
    }

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
.github/images		.github/images
csvs		csvs
geoimagenet_downloader		geoimagenet_downloader
moco_fmow		moco_fmow
.DS_Store		.DS_Store
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

.github/images

.github/images

csvs

csvs

geoimagenet_downloader

geoimagenet_downloader

moco_fmow

moco_fmow

.DS_Store

.DS_Store

.gitignore

.gitignore

README.md

README.md

Repository files navigation

Geography-Aware Self-Supervised Learning (ICCV 2021)

fMoW Dataset

Preparation

Self-Supervised Training

Linear Classification

Models

GeoImageNet

Transfer Learning Experiments

Citing

About

Releases

Packages

Contributors 2

Languages

sustainlab-group/geography-aware-ssl

Folders and files

Latest commit

History

Repository files navigation

Geography-Aware Self-Supervised Learning (ICCV 2021)

fMoW Dataset

Preparation

Self-Supervised Training

Linear Classification

Models

GeoImageNet

Transfer Learning Experiments

Citing

About

Resources

Stars

Watchers

Forks

Languages