Skip to content

webfoolery/python-pdf-image-extractor

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

10 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

PDF Image Extractor

It's a short Python script to parse PDF files & export the images found inside in their original formats.

Setup

  1. Clone the repo
  2. Create the virtual environment - python3 -m venv env
  3. Start the virtual environment, ie: . env/Scripts/activate (or whatever your system needs to do so)
  4. pip install -r ./requirements.txt
  5. End the virtual environment, ie: deactivate

Usage

  1. Place a PDF file in the same directory as the script
    • Linux: Run_pdf-image.py
    • Windows: As above or run _get-pdf-image.bat
  2. Check in the same directory to find the image/s. Image file names are based upon the original file name with page number, image number & file type appended

CHANGELOG

2022-06-23

  • Adds command line arguments for --single or --multiple (default) images
  • Adds extra .bat file for running single image task

About

Extracts images from PDF files in original format

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors