Skip to content
#

doclin

Here is 1 public repository matching this topic...

A Streamlit-based app with a FastAPI backend for extracting structured data (text, images, tables) from websites and PDFs. Processed data is stored in AWS S3 and rendered in a markdown-standardized format. APIs are deployed on Google Cloud Run Service

  • Updated Jan 31, 2025
  • Jupyter Notebook

Improve this page

Add a description, image, and links to the doclin topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the doclin topic, visit your repo's landing page and select "manage topics."

Learn more