- Obtain images as .jpg files and masks as .png files
- The masks must be single channel masks with the 3rd channel corresponding to class ID. If masks are multichannel, run ./make_single_channel.py by specifying the color and corresponding class ID in main
- Arrange dataset as: --dataset/ --imgs/ --masks/
- Create folder Segment_config/ in dataset/ and create 4 files: test.txt, trainval.txt, train.txt and val.txt
- Set path to dataset/ in ./voc_annotation.py and ensure it runs and populates the .txt files. You can also decide train:val:test split in this file prior to running it
- create ./logs directory
- set path to dataset/ in train.py
- check unet.py file and train.py file to set configurations
- remember to properly set num_classes and the correctly corresponding class ID and color in single channel mask (you need to visualise the mask in an image viewer for this)
- run train.py and monitor training
- in root directory. run
tesnorboard --logdir logs/
For tensorboard
- in predict.py set the "mode" accordingly. "predict" is the default mode
- in unet.py set the model_path, num_classes, input_shape and mix_type as needed
- run predict.py