Skip to content

kindo-tk/Image_Scrapper

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Image Scraper Tool

Python Streamlit DuckDuckGo Requests Threading License

Overview

This is a robust and efficient Image Scraper built with Python and Streamlit. It leverages the DuckDuckGo Search API to fetch high-quality images without the rate limiting and complexity often associated with scraping Google Images directly.

The tool provides a clean, user-friendly interface to search, preview, and download images in bulk, ensuring a seamless experience for gathering datasets or visual assets.

Features

  • Reliable Backend: Uses ddgs (DuckDuckGo Search) for consistent result fetching.
  • High-Resolution Images: Automatically filters and attempts to download the best quality available.
  • Interactive UI: Clean Streamlit interface with a responsive grid layout.
  • Bulk Download: Download all scraped images as a single .zip file.
  • Concurrent Downloading: Uses multi-threading to speed up the download process.

Technologies Used

  • Python
  • Streamlit
  • Requests
  • ddgs (DuckDuckGo Search)

Installation

  1. Clone the Repository

    git clone https://github.com/kindo-tk/Image_Scrapper.git
  2. Navigate to the Project Directory

    cd Image_Scrapper
  3. Create a Virtual Environment

    python -m venv .venv
  4. Activate the Virtual Environment

    • For Windows
    .venv\Scripts\activate
    • For macOS/Linux
    source .venv/bin/activate
  5. Install Dependencies

    pip install -r requirements.txt

Usage

  1. Run the Application

    streamlit run app.py
  2. Workflow

    • Configure: Open the sidebar settings to choose the Target Image Count (1-50).
    • Search: Enter your keyword in the main search bar (e.g., "Cat").
    • Scrape: Click the Search button. The app will fetch URLs and download images in the background, showing a progress bar.
    • Preview: Once finished, images will be displayed in a responsive grid. Click "Load more" to see additional results if available.
    • Download: Click the Download ZIP button at the bottom to save the entire collection to your local machine.

Project Structure

Image_Scrapper/
├── app.py              # Main Streamlit application
├── scraper.py          # Logic for searching and downloading images
├── requirements.txt    # Project dependencies
├── .gitignore          # Files to exclude from Git
└── README.md           # Documentation

Contributing

Contributions are welcome! Please fork the repository and submit a pull request for any enhancements.


License

This project is licensed under the MIT License. See LICENSE for details.


Contact

For any inquiries or feedback, please contact:


Demo

Visit the live app: Click here

About

A fast, Streamlit-based image scraping tool that fetches high-quality images using DuckDuckGo, with concurrent downloads, live progress tracking, and bulk ZIP export.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages