Cross Encoder - Zero Shot Classification

Our model is a Multilabel Classification Model designed to classify texts into various predefined categories using a bidirectional transformer encoder (BERT-like). By leveraging the cross encoder architecture, the model effectively considers the interaction between the text and the labels. This approach demonstrates superior performance compared to the previous bi-encoder architecture while maintaining efficient inference times. Consequently, it offers a practical alternative to Large Language Models (LLMs), which, despite their versatility, are often too costly and large for resource-constrained environments.

🤗 Available pretrained model

Usage

from model import BiEncoderModel

texts = [
    "The wildlife conservation program is focused on protecting endangered species in Africa.",
    "The government announced a new initiative to combat poverty in rural areas.",
    "A celebrity chef has opened a new restaurant specializing in vegan cuisine."
    ]
batch_labels = [

        ["Conservation", "Business", "Animals", "Africa"],
        ["Politics", "Social Issues", "Economy", "Technolgy"],
        ["Food", "Business","Politics", "Vegan"]

    ]
# Load the model
model = CrossEncoderModel("sabdou/cross-encoder-model", max_num_labels=6)
# Prediction with JSON output
predictions = model.forward_predict(texts, batch_labels)
print("Predictions:", predictions)

Expected Output

Predictions: [
    {'text': 'The wildlife conservation program is focused on protecting endangered species in Africa.',
  'scores': {'Conservation': 1.0,
             'Business': 0.0,
             'Animals': 1.0,
             'Africa': 0.99}},

 {'text': 'The government announced a new initiative to combat poverty in rural areas.',
  'scores': {'Politics': 1.0,
             'Social Issues': 0.99,
             'Economy': 1.0,
             'Technolgy': 0.0}},

 {'text': 'A celebrity chef has opened a new restaurant specializing in vegan cuisine.', 
 'scores': {'Food': 1.0,
            'Business': 1.0, 
            'Politics': 0.0, 
            'Vegan': 1.0}}]

Usecase

This model can be used in

Augmented user experience : To detect important informations in customer reviews.
Content moderation: Automatically flagging inappropriate or harmful content in social media posts.
News categorization: Classifying news articles into predefined categories for better organization and retrieval.
Sentiment analysis: Identifying the sentiment of customer feedback to improve products and services.
Document tagging: Automatically tagging documents with relevant keywords for easier search and retrieval.
Chatbot enhancement: Improving chatbot responses by understanding the context and intent of user queries.

Data

Synthetic data generated with gpt4-mini and gemini

Training Results

Training was conducted using two encoders: BERT base and RoBERTa. Results are available in \training results

Author 🧑‍💻

Salim ABDOU DAOURA

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
data		data
training results		training results
.gitignore		.gitignore
README.md		README.md
config.yaml		config.yaml
dataset.py		dataset.py
model.py		model.py
training.py		training.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Cross Encoder - Zero Shot Classification

Usage

Expected Output

Usecase

Data

Training Results

Author 🧑‍💻

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Cross Encoder - Zero Shot Classification

Usage

Expected Output

Usecase

Data

Training Results

Author 🧑‍💻

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages