Tackling Few-Shot Segmentation in Remote Sensing via Inpainting Diffusion Model

ICLR Machine Learning for Remote Sensing Workshop, 2025 (Oral)

This repo contains the official code for training and generation for the paper "Tackling Few-Shot Segmentation in Remote Sensing via Inpainting Diffusion Model".

Steve Andreas Immanuel | Woojin Cho | Junhyuk Heo | Darongsae Kwon.

We introduce a image-conditioned diffusion-based approach for to create diverse set of novel-classes samples for semantic segmentation in few-shot settings in remote sensing domain. By ensuring semantic consistency using cosine similarity between the generated samples and the conditioning image, and using the Segment Anything Model (SAM) to obtain the precise segmentation, our method can train off-the-shelf segmentation models with high-quality synthetic data, significantly improving performance in low-data scenarios.

Setup

Create a new conda environment and activate it with the following commands:

conda env create -f environment.yaml
conda activate rspaint

Checkpoint Weights

All required checkpoints are available in huggingface.

Download sd_inpaint_samrs_ep74.ckpt and remoteclip.pt and save them to the checkpoints directory. Optionally, if you also want to perform the mask refinement, download SAM checkpoint and also save it to the checkpoints directory.

Generating Samples

Note that the current version is not optimized. Generating a single sample with a resolution of 512x512 takes about 9GB of GPU memory. Using cpu is possible, but takes significantly longer.

Follow along the steps in the provided notebook notebooks/generate_samples.ipynb to generate samples using the trained model.

Training

Data preparing

To train the model, you can use any dataset with bounding box annotations.

The data structure must be like th@article{immanuel2025tackling, title={Tackling Few-Shot Segmentation in Remote Sensing via Inpainting Diffusion Model}, author={Immanuel, Steve Andreas and Cho, Woojin and Heo, Junhyuk and Kwon, Darongsae}, journal={arXiv preprint arXiv:2503.03785}, year={2025} }is:

.
├── bbox
│   ├── train
│   │   ├── 00001.txt
│   │   ├── 00003.txt
│   │   ├── 00004.txt
│   │   ├── ...
│   └── validation
│   │   ├── 00023.txt
│   │   ├── 00024.txt
│   │   ├── 00025.txt
│   │   ├── ...
└── images
    ├── train
│   │   ├── 00001.png
│   │   ├── 00003.png
│   │   ├── 00004.png
│   │   ├── ...
│   └── validation
│   │   ├── 00023.png
│   │   ├── 00024.png
│   │   ├── 00025.png
│   │   ├── ...

Update the yaml config file in the following lines:

data:
  target: main.DataModuleFromConfig
  params:
      batch_size: 8
      wrap: False
      train:
          target: ldm.data.remote_sensing.RemoteSensingDataset
          params:
              state: train
              dataset_dir: <root dataset directory>
              arbitrary_mask_percent: 0.5
              image_size: 512
              version: openai/clip-vit-large-patch14
              bbox_ratio_range: [0.1, 0.25]
      validation:
          target: ldm.data.remote_sensing.RemoteSensingDataset
          params:
              state: validation
              dataset_dir: <root dataset directory> 
              arbitrary_mask_percent: 0.5
              image_size: 512
              version: openai/clip-vit-large-patch14
              bbox_ratio_range: [0.1, 0.25]

Optionally, you can write custom dataloader and placing it in ldm/data directory. Then, change the target key to the custom dataloader.

To start training, use the following command:

python -u main.py \
--logdir <log path> --pretrained_model checkpoints/checkpoints/sd_inpaint_samrs_ep74.ckpt \
--base <yaml config path> --scale_lr False --seed 20250110

Citation

@article{2025rspaint,
  title={Tackling Few-Shot Segmentation in Remote Sensing via Inpainting Diffusion Model},
  author={Immanuel, Steve Andreas and Cho, Woojin and Heo, Junhyuk and Kwon, Darongsae},
  journal={arXiv preprint arXiv:2503.03785},
  year={2025}
}

Acknowledgements

This code is mainly based on Paint by Example. We thank the authors for their great work.

Name		Name	Last commit message	Last commit date
Latest commit History 30 Commits
configs		configs
eval_tool		eval_tool
figure		figure
ldm		ldm
notebooks		notebooks
scripts		scripts
test_bench		test_bench
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
environment.yaml		environment.yaml
inference_test_bench.sh		inference_test_bench.sh
main.py		main.py
setup.py		setup.py
test.sh		test.sh
train.sh		train.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Tackling Few-Shot Segmentation in Remote Sensing via Inpainting Diffusion Model

Setup

Checkpoint Weights

Generating Samples

Training

Data preparing

Citation

Acknowledgements

About

Releases

Packages

Languages

License

SteveImmanuel/rs-paint

Folders and files

Latest commit

History

Repository files navigation

Tackling Few-Shot Segmentation in Remote Sensing via Inpainting Diffusion Model

Setup

Checkpoint Weights

Generating Samples

Training

Data preparing

Citation

Acknowledgements

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages