PDF utilities using PyPDF2 for extracting text, merging PDFs, splitting by page range, and reading metadata.
pip install -r requirements.txt
python pdf_tools.pyPyPDF2.PdfReaderfor reading PDFsPyPDF2.PdfWriterfor creating PDFs- Text extraction from pages
- PDF merging and splitting
- Metadata extraction
You will learn how to programmatically manipulate PDF documents and build document processing automation.
42-pdf-tools/
README.md
pdf_tools.py
requirements.txt
Install the required packages before running:
pip install PyPDF2PDF Tools
---------
1. Extract Text
2. Merge PDFs
3. Split PDF
4. Get Metadata
Choice: 4
File: document.pdf
Title: Annual Report 2024
Author: John Smith
Pages: 42
Creator: Microsoft Word