Skip to content

feat(extraction): modularize PDF-Extract-Kit integration and enhance …#3

Merged
shmukit merged 1 commit into
mainfrom
fix/kit-mixin-type-stubs
May 3, 2026
Merged

feat(extraction): modularize PDF-Extract-Kit integration and enhance …#3
shmukit merged 1 commit into
mainfrom
fix/kit-mixin-type-stubs

Conversation

@shmukit
Copy link
Copy Markdown
Owner

@shmukit shmukit commented May 3, 2026

…extraction capabilities

  • Introduced a structured wrapper for PDF-Extract-Kit, facilitating modular extraction processes.
  • Added new mixins for image handling, page processing, question building, and text utilities.
  • Updated configuration examples to support LiteLLM model routing.
  • Enhanced extraction settings with validation for segmentation presets.
  • Improved UI to display active configuration snapshots and allow for dynamic updates.

Description

Please include a summary of the change and which issue is fixed. Please also include relevant motivation and context. List any dependencies that are required for this change.

Fixes # (issue)

Type of change

  • Bug fix (non-breaking change which fixes an issue)
  • New feature (non-breaking change which adds functionality)
  • Breaking change (fix or feature that would cause existing functionality to not work as expected)
  • This change requires a documentation update

How Has This Been Tested?

Please describe the tests that you ran to verify your changes. Provide instructions so we can reproduce. Please also list any relevant details for your test configuration.

  • Unit Tests
  • Integration Tests
  • Manual UI Verification

Checklist:

  • My code follows the style guidelines of this project
  • I have performed a self-review of my own code
  • I have commented my code, particularly in hard-to-understand areas
  • I have made corresponding changes to the documentation
  • My changes generate no new warnings
  • I have added tests that prove my fix is effective or that my feature works
  • New and existing unit tests pass locally with my changes
  • Any dependent changes have been merged and published in downstream modules

…extraction capabilities

- Introduced a structured wrapper for PDF-Extract-Kit, facilitating modular extraction processes.
- Added new mixins for image handling, page processing, question building, and text utilities.
- Updated configuration examples to support LiteLLM model routing.
- Enhanced extraction settings with validation for segmentation presets.
- Improved UI to display active configuration snapshots and allow for dynamic updates.
@shmukit shmukit merged commit 3f65acb into main May 3, 2026
3 checks passed
shmukit added a commit that referenced this pull request May 3, 2026
Merge pull request #3 from shmukit/fix/kit-mixin-type-stubs
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant