COBRA: Contrastive Bi-Modal Representation Algorithm

This repository contains the code for the paper COBRA: Contrastive Bi-Modal Representation Algorithm (ArXiv) by Vishaal Udandarao, Abhishek Maiti, Deepak Srivatsav, Suryatej Reddy, Yifang Yin and Rajiv Ratn Shah.

Methodology

We present a novel framework COBRA that aims to train two modalities (image and text) in a joint fashion inspired by the Contrastive Predictive Coding (CPC) and Noise Contrastive Estimation (NCE) paradigms which preserve both inter and intra-class relationships. We empirically show that this framework reducesthe modality gap significantly and generates a robust and task agnostic joint-embedding space. We outperform existing work on four diverse downstream tasks spanning across seven benchmark cross-modal datasets.

A visualisation of the loss function:

Architecture

Datasets

The 7 datasets used to empirically prove our results are:

PKU-XMedia
MS-COCO
NUS-Wide 10k
Wikipedia
FakeNewsNet
MeTooMA
CrisisMMD

t-SNEs

Results

Instructions for running

The code has been tested on Python 3.6.8 and PyTorch 1.5.1.

Install all the dependencies using the following command:

pip install -r requirements.txt

Create a folder features to save the trained models
To train COBRA, use the following command:

python main.py

To switch between NCE contrastive loss and softmax contrastive loss, change the use_nce flag. To change the number of anchor points and number of negative samples, modify the num_anchors and num_negative_samples respectively.

Queries

In case of any queries, please open an issue. We will respond as soon as possible.

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
images		images
COBRA.py		COBRA.py
LICENSE		LICENSE
README.md		README.md
dataloader.py		dataloader.py
main.py		main.py
networks.py		networks.py
requirements.txt		requirements.txt
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

COBRA: Contrastive Bi-Modal Representation Algorithm

Methodology

Architecture

Datasets

t-SNEs

Results

Instructions for running

Queries

About

Releases

Packages

Contributors 2

Languages

License

ovshake/cobra

Folders and files

Latest commit

History

Repository files navigation

COBRA: Contrastive Bi-Modal Representation Algorithm

Methodology

Architecture

Datasets

t-SNEs

Results

Instructions for running

Queries

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages