Image Caption Generator with Transformers & Streamlit

Overview

This project is a modern and efficient Image Caption Generator built as an interactive web application using Streamlit. It leverages a state-of-the-art, pre-trained model from the Hugging Face Transformers library to automatically generate descriptive captions for any uploaded image.

How It Works

The application uses a powerful image-to-text pipeline powered by the ydshieh/vit-gpt2-coco-en model. This model combines a Vision Transformer (ViT) to understand the visual content of the image and a GPT-2 language model to generate a coherent, human-like caption.

The entire application is wrapped in a user-friendly interface created with Streamlit, allowing users to easily upload an image and view the generated caption in real-time.

Key Technologies

Streamlit: For building the interactive web UI.
Hugging Face Transformers: For accessing the pre-trained ViT-GPT2 model.
Pillow (PIL): For image processing.
PyTorch: As the backend framework for the model.

Setup and Usage

Install the required libraries:

pip install streamlit transformers torch Pillow

Save the code as a Python file (e.g., app.py).
Run the application from your terminal:
```
streamlit run app.py
```
Upload an image through the web interface to see the result.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
.gitattributes		.gitattributes
Image Caption Generator.ipynb		Image Caption Generator.ipynb
README.md		README.md
main.py		main.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Image Caption Generator with Transformers & Streamlit

Overview

How It Works

Key Technologies

Setup and Usage

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Image Caption Generator with Transformers & Streamlit

Overview

How It Works

Key Technologies

Setup and Usage

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages