vit-gpt2

Star

Here are 9 public repositories matching this topic...

Ecolash / Image-Captioning-Model

Star

𝗜𝗺𝗮𝗴𝗲 𝗖𝗮𝗽𝘁𝗶𝗼𝗻𝗶𝗻𝗴 𝘄𝗶𝘁𝗵 𝗢𝗰𝗰𝗹𝘂𝘀𝗶𝗼𝗻 𝗔𝗻𝗮𝗹𝘆𝘀𝗶𝘀 | 𝗩𝗶𝗧-𝗚𝗣𝗧𝟮 | 𝗦𝗺𝗼𝗹𝗩𝗟𝗠 | 𝗕𝗘𝗥𝗧

image-captioning vision-transformer bert-classification image-caption-generation vit-gpt2 smolvlm occlusion-analysis

Updated May 3, 2025
Jupyter Notebook

ChaituRajSagar / video_to_narrative

Star

Flask-based AI app that summarizes surveillance videos using Whisper (audio), ViT-GPT2 (frame captions), and Groq LLM (narratives). Produces both general and law enforcement-style summaries.

python opencv flask ffmpeg image-captioning whisper groq llm openai-whisper generative-ai vit-gpt2 video-summary law-enforcement-ai surveillance-ai bodycam-analysis

Updated Jul 14, 2025
Python

Divy005 / image_caption_generator

Star

AI-powered image captioning using InceptionV3+LSTM and ViT-GPT2 models. Trained on Flickr8k dataset with interactive Streamlit interface.

image-captioning nlp-machine-learning keras-tensorflow depplearning computer-vison flicker8k-dataset streamlit-webapp vit-gpt2

Updated Jan 7, 2026
Jupyter Notebook

PrachiPatel15 / Multimodal-Visual-AI-Chatbot

Star

A powerful Streamlit application that analyzes images using multiple vision models and responds to queries about visual content through conversational AI.

blip conversational-ai multimodal-large-language-models vit-gpt2

Updated Feb 26, 2025
Python

mo-the-creator / Detect-and-Describe

Star

image-captioning object-detection yolov5 vit-gpt2

Updated Oct 29, 2024
HTML

PrachiPatel15 / AI-Image-Captioning

Star

An AI-powered image captioning app built with Streamlit, using ViT-GPT2 for caption generation and YOLOv8 for object detection. The app provides enhanced captions by integrating detected objects into the generated text.

computer-vision image-processing transformers streamlit yolov8 vit-gpt2

Updated Feb 21, 2025
Python

ramyacp14 / Image-Caption-Generator

Star

Developed an image captioning system using the BLIP model to generate detailed, context-aware captions. Achieved an average BLEU score of 0.72, providing rich descriptions that enhance accessibility and inclusivity.

machine-learning tensorflow imagenet blip coco-dataset cnn-rnn vision-transformer vit-gpt2

Updated Sep 6, 2024
Jupyter Notebook

armanjscript / Argonz-Image-Captioning-Extension

Star

The chrome extension that gets input images and generates the captions for them.

nodejs chrome-extension webpack postcss image-captioning tailwindcss image-caption-generator xenova-transformers vit-gpt2

Updated Dec 5, 2024
JavaScript

Sana-Salmo / NLP-Smart-Glasses-Image-Captioning-TTS

Star

NLP and Computer Vision prototype for smart-glasses visual assistance using ViT-GPT2 image captioning and text-to-speech.

nlp text-to-speech computer-vision pytorch image-captioning gradio assistive-technology gtts smart-glasses huggingface flickr8k vit-gpt2

Updated May 21, 2026
Jupyter Notebook

Improve this page

Add a description, image, and links to the vit-gpt2 topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the vit-gpt2 topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

vit-gpt2

Here are 9 public repositories matching this topic...

Ecolash / Image-Captioning-Model

ChaituRajSagar / video_to_narrative

Divy005 / image_caption_generator

PrachiPatel15 / Multimodal-Visual-AI-Chatbot

mo-the-creator / Detect-and-Describe

PrachiPatel15 / AI-Image-Captioning

ramyacp14 / Image-Caption-Generator

armanjscript / Argonz-Image-Captioning-Extension

Sana-Salmo / NLP-Smart-Glasses-Image-Captioning-TTS

Improve this page

Add this topic to your repo