ConvNeXt and ConvNeXtV2

This repository is about an implementation of the research paper "A ConvNet of the 2020s" and "ConvNeXt V2: Co-designing and Scaling ConvNets with Masked Autoencoders" using Tensorflow.

ConvNeXtV1 : ConvNeXt, a pure ConvNet model constructed entirely from standard ConvNet modules. ConvNeXts compete favorably with Transformers in terms of accuracy and scalability, achieving 87.8% ImageNet top-1 accuracy and outperforming Swin Transformers on COCO detection and ADE20K segmentation, while maintaining the simplicity and efficiency of standard ConvNets.

ConvNeXtV2: The paper proposed a fully convolutional masked autoencoder framework (FCMAE) and a new Global Response Normalization (GRN) layer to original ConvNeXtV1 model to enhance inter-channel feature competition. This co-design of self-supervised learning techniques and architectural improvement results in a new model family called ConvNeXt V2, which significantly improves the performance of pure ConvNets on various recognition benchmarks.