This repository is an "unofficial" implementation of Detector Cleanse. The implementation is based on the algorithm introduced in the paper BadDet: Backdoor Attacks on Object Detection.
The clean feature images used in this repository are composed of bounding box regions from the VOC2007 dataset's test set, converted into images. You can either create your own folder of JPG images as clean feature images or use the prepared set, which contains approximately 15,000 images.
You can download the prepared clean feature images from the following link: (https://drive.google.com/file/d/1Qj_5s84HXdgDWG3KRn-zW7oIRGvrouer/view?usp=sharing)
Alternatively, you can use the following commands in your bash terminal to download and unzip the clean feature images:
pip install gdown
gdown https://drive.google.com/file/d/1Qj_5s84HXdgDWG3KRn-zW7oIRGvrouer/view?usp=sharing
unzip clean_feature_images.zip
Ensure Python is installed along with the necessary packages.
pip install pytorch
pip install visdom scikit-image tqdm fire ipdb pprint matplotlib torchnet
start visdom for visualization
nohup python -m visdom.server &
To run the Detector Cleanse, use the following command-line arguments:
--n
: Number of features to randomly select from the clean feature images.--m
: Detection mean for identifying poison.--delta
: Detection threshold for identifying poison.--image_path
: Path to the image that needs to be analyzed.--clean_feature_path
: Path to the clean feature image folder. Default is 'clean_feature_images'.--weight
: Path to the weight file of the model.
You can analyze either a single image or multiple images in a folder using the same command:
python main.py --n 100 --m 0.51 --delta 0.25 --image_path 'path/to/image_or_folder' --clean_feature_path 'path/to/clean_feature_images' --weight 'path/to/model/weight.pth'
- If
--image_path
is set to a single jpg image, the script will analyze that image and output whether it is poisoned or clean. - If
--image_path
is set to a folder, the script will analyze all jpg images in the folder, classifying them as 'modified' or 'clean'. It will then output the False Accept Rate (FAR) and False Reject Rate (FRR) for the analyzed images.
Since this implementation of Detector Cleanse is based on a specific repository (simple-faster-rcnn-pytorch), additional modifications might be necessary to test it with your own model. To accommodate different object detection models or different setups, you may need to make changes in the code.
For detailed instructions on how to customize this implementation for your specific needs, please refer to the Customization Guide.
We acknowledge the use of code from the simple-faster-rcnn-pytorch repository, specifically for defining the Faster R-CNN model. This part of the code is borrowed under the terms of the MIT License of the original repository. We have adapted and extended it for our purposes. See LICENSE in the original repository for more details.
We welcome contributions to this project! If you have suggestions for improvements or want to contribute code, please feel free to open an issue or a pull request.
This project is licensed under the MIT License - see the LICENSE file for details.
The image below it the Detector Cleanse algorithm as outlined in the BadDet Appendix.