SRGAN

Photo-Realistic Single Image Super-Resolution Using a Generative Adversarial Network paper

Overview

The Super-Resolution Generative Adversarial Network (SRGAN) is a deep learning model that can generate high-resolution images from low-resolution inputs. The model is trained using a combination of adversarial and content loss functions, which help to produce photo-realistic images with enhanced details.

Architecture

The SRGAN architecture consists of two main components: a generator and a discriminator.
The generator is responsible for producing high-resolution images from low-resolution inputs,

while the discriminator is trained to distinguish between real and generated images. The generator is trained using a combination of adversarial and content loss functions, which help to produce high-quality images with enhanced details.

Dataset

The model is trained on the DIV2K dataset, which contains 800 high-resolution images of various scenes and objects. The dataset is divided into training and validation sets, which are used to train and evaluate the model's performance.

you can also take a look at flickr2k dataset which contains 2650 images with 2k resolution.

Training

The model is trained for 200 epochs using the Adam optimizer with a learning rate of 1e-4 and a batch size of 16. The training process consists of two main stages: pre-training and fine-tuning. During the pre-training stage, the generator is trained using the content loss function, while the discriminator is frozen. In the fine-tuning stage, the generator and discriminator are trained simultaneously using a combination of adversarial and content loss functions.

you can see all the training details in the config file.

Download the pre-trained model from here

Results

The model achieves good performance on the DIV2K dataset, producing high-quality images with enhanced details and sharpness. because of the limited resources, I trained the model for only 200 epochs, so the results are not as good as the original paper.

all the training and results done using Nvidia GTX 1660ti GPU.

Input:

Output:

Usage

install the required packages using the following command.

pip install -r requirements.txt

Training

To train the model, I recommend read the trainingDetails file to understand all the training details and hyperparameters.

Inference

To test the model, you can use the inference script, which loads the pre-trained model and generates high-resolution images from low-resolution inputs. all the images in the inputs folder passed through the model and the generated images will be saved in the outputs folder by default.

python inference.py --model_path model.pth --image_path inputs/
--save_path outputs/ --device cuda

--model_path is the path to the pre-trained model.

--image_path is the path to the input images.

--save_path is the path to save the generated images.

--device can be either 'cuda' or 'cpu' depending on the available resources.

TODO

Train the model for more epochs to achieve better results.
Implement ESRGAN model and compare the results with SRGAN.

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
api		api
config		config
inputs		inputs
outputs		outputs
src		src
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
SRGAN.pth		SRGAN.pth
discriminator_arc.jpg		discriminator_arc.jpg
generator_arc.jpg		generator_arc.jpg
inference.py		inference.py
requirements.txt		requirements.txt
train.py		train.py
trainingDetails.md		trainingDetails.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SRGAN

Table of contents

Overview

Architecture

Dataset

Training

Results

Usage

Training

Inference

TODO

References

About

Releases

Packages

Languages

Yousef-Nasr/SRGAN

Folders and files

Latest commit

History

Repository files navigation

SRGAN

Table of contents

Overview

Architecture

Dataset

Training

Results

Usage

Training

Inference

TODO

References

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages