US Visa Approval Prediction MLOps Production Pipeline

Introduction

This project aims to develop a machine learning model capable of predicting the approval status of US visa applications. By implementing a comprehensive MLOps pipeline, the project ensures efficient model development, deployment, and monitoring, resulting in a robust and scalable solution.

Project Structure

The repository is organized as follows:

.github/workflows/: Contains GitHub Actions workflows for CI/CD.
config/: Configuration files for the project.
notebook/: Jupyter notebooks for data exploration and analysis.
static/css/: CSS files for the web interface.
templates/: HTML templates for the web interface.
us_visa/: Core package containing modules for data ingestion, validation, transformation, and model training.
.dockerignore: Specifies files and directories to ignore in Docker builds.
.gitignore: Specifies files and directories to ignore in Git.
Dockerfile: Instructions to build the Docker image.
LICENSE: License information.
README.md: Project documentation.
app.py: Entry point for the web application.
demo.py: Script for demonstrating the model.
requirements.txt: Python dependencies.
setup.py: Setup script for the Python package.
template.py: Template script for various utilities.

Installation

To set up the project locally, follow these steps:

Clone the repository:

git clone https://github.com/vijaybalamahalingam/US-Visa-Approval-Prediction-MLOps-Production-Pipeline.git
cd US-Visa-Approval-Prediction-MLOps-Production-Pipeline

Create and activate a virtual environment:

python3 -m venv venv
source venv/bin/activate

Install the required dependencies:
```
pip install -r requirements.txt
```

Usage

To run the web application:

python app.py

Features

Data Ingestion: Fetches and stores visa application data.
Data Validation: Ensures data integrity and quality.
Data Transformation: Preprocesses data for model training.
Model Training: Trains machine learning models to predict visa approval.
Model Evaluation: Assesses model performance using various metrics.
Model Deployment: Deploys the model as a web service.
CI/CD Pipeline: Automates testing, building, and deployment processes.

Dependencies

The project relies on the following key technologies:

Python: Core programming language for development.
Jupyter Notebook: For data exploration and analysis.
MongoDB: Database for storing application data.
Evidently AI: For monitoring and analyzing data drift, concept drift, and other model performance metrics.
FastAPI: Web framework for building the API.
Docker: Containerization platform for deployment.
AWS Services: Including S3 for storage and EC2 for hosting.
GitHub Actions: For continuous integration and deployment.

Configuration

Before running the application, ensure that the following environment variables are set:

MONGODB_URL: Connection string for MongoDB.
AWS_ACCESS_KEY_ID: AWS access key ID.
AWS_SECRET_ACCESS_KEY: AWS secret access key.
AWS_DEFAULT_REGION: AWS region.

Troubleshooting

If you encounter issues during installation or execution, consider the following steps:

Ensure all dependencies are installed correctly.
Verify that environment variables are set appropriately.
Check the configuration files for any missing or incorrect settings.
Consult the logs for error messages and stack traces.

Contributors

Vijay Bala Mahalingam

License

This project is licensed under the MIT License. See the LICENSE file for more details.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

US Visa Approval Prediction MLOps Production Pipeline

Introduction

Table of Contents

Project Structure

Installation

Usage

Features

Dependencies

Configuration

Troubleshooting

Contributors

License

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 29 Commits
.github/workflows		.github/workflows
config		config
notebook		notebook
static/css		static/css
templates		templates
us_visa		us_visa
.dockerignore		.dockerignore
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
app.py		app.py
demo.py		demo.py
requirements.txt		requirements.txt
setup.py		setup.py
template.py		template.py

License

vijaybalamahalingam/US-Visa-Approval-Prediction-MLOps-Production-Pipeline

Folders and files

Latest commit

History

Repository files navigation

US Visa Approval Prediction MLOps Production Pipeline

Introduction

Table of Contents

Project Structure

Installation

Usage

Features

Dependencies

Configuration

Troubleshooting

Contributors

License

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages