Zero-shot classification is a transfer learning method that uses a pre-trained language model to predict a class not seen during training. This project uses a new promising zero-shot text classification model ModernBERT-large-zeroshot-v2.0 from the Hugging Face model hub to classify text into different classes. Models in this series are designed for efficient zero-shot classification with the Hugging Face pipeline. These models can classify without training data and run on both GPUs and CPUs.
The project is structured as follows:
-
src\
: The folder that contains the source code for the project.-
modernbert\classifier.py
: The file that contains the function for classifying text into different classes using the ModernBERT zero-shot text classification model. -
config.py
: This file contains the configuration for the used model. -
logger.py
: This file contains the project's logging configuration. -
exception.py
: This file contains the exception handling for the project.
-
-
app.py
: This file contains the code for the Gradio application. -
requirements.txt
: This file contains the required dependencies for the project. -
LICENSE
: This file contains the license information for the project. -
README.md
: This file contains the information about the project. -
assets
: This folder contains the screenshots for working on the application.
- Python (for the programming language)
- PyTorch (for the deep learning framework)
- Hugging Face Transformers Library (for the zero-shot text classification model)
- Gradio (for the web application)
- Hugging Face Spaces (for hosting the gradio application)
To get started with this project, follow the steps below:
- Clone the repository:
git clone https://github.com/sitamgithub-MSIT/ClassyText.git
- Change the directory:
cd ClassyText
- Create a virtual environment:
python -m venv tutorial-env
- Activate the virtual environment:
tutorial-env\Scripts\activate
- Install the required dependencies:
pip install -r requirements.txt
- Run the Gradio application:
python app.py
Now, open up your local host and you should see the web application running. For more information, refer to the Gradio documentation here. Also, a live version of the application can be found here.
Once the application is up and running, you can interact with the interface to classify text into different classes of your choice. The model used in this project is a zero-shot text classification model, meaning it can classify text it has not seen during training. This is a powerful feature that can be used in a variety of applications, such as content moderation, sentiment analysis of new product reviews, topic classification of news articles, identifying the intent of customer support queries, and classifying social media posts based on their subject matter; essentially any situation where you want to quickly categorize text into predefined classes without a large labeled training dataset for each class.
Contributions are welcome! If you would like to contribute to this project, please raise an issue to discuss the changes you would like to make. Once the changes are approved, you can create a pull request.
This project is licensed under the MIT License.
If you have any questions or suggestions regarding the project, feel free to reach out to me on my GitHub profile.
Happy coding! 🚀