refactor: AI SDK Tools by esafwan · Pull Request #198 · tridz-dev/huf

esafwan · 2026-03-10T01:55:13Z

This PR performs a comprehensive refactor of the huf/ai/sdk_tools.py module, which had grown into a 2,682-line "god module" with multiple overlapping responsibilities. The refactor improves modularity, removes significant code duplication, and establishes a cleaner architecture for tool handlers.

Key Changes

Modularization: Split sdk_tools.py into specialized handlers in huf/ai/handlers/:
- crud.py: Handles tool creation and basic management.
- conversation_data.py: Manages data extraction and conversation indexing.
- media.py: Handles Image Generation, TTS, STT, and OCR.
- agent_runner.py: Orchestrates agent execution logic.
Single Source of Truth: Created tool_types.py to consolidate tool-type mappings, eliminating identical 20-entry if/elif chains in sdk_tools.py and flow_tool_executor.py.
Boilerplate Reduction: Extracted shared logic for media handling into media_utils.py, removing ~200 lines of redundant code across 4 handlers.
Cleanup & Bug Fixes:
- Removed duplicate function definitions in tool_functions.py.
- Fixed filename typo: cilent_side_tool.py -> client_side_tool.py.
- Deleted redundant conversation_data_tools.py.
- Standardized error logging and indentation across modified modules.
Backward Compatibility: sdk_tools.py has been slimmed down (~250 lines) but maintains re-exports for existing integrations.

Impact

Reduced sdk_tools.py size by ~90%.
Eliminated ~400 lines of dead code and duplicated boilerplate.
Improved maintainability by separating SDK wiring from business logic.

Refactor Phases

Foundation: Registry and mapping consolidation.
Extraction: Separation of concerns into logical handlers.
Slimming: Reducing the main entry point to a dispatcher/wire-up layer.
Integration: Updating references across flow_tool_executor.py, install.py, and agent_chat.py.
Quality Assurance: Standardization of linting and logging.

fix: harden tool creation functions for migration safety

feat: Update Project documentation

…nd add model field for STT

- Implement _resolve_stt_config helper to robustly resolve STT models and API keys,mirroring the architecture of _resolve_tts_config. - Update handle_transcribe_audio logic to actively save the identified stt_model to the generated 'Agent Message' database record for better tracking. - Add '_get_default_stt_model' to cleanly handle provider-specific model fallbacks.

feat: refactor audio transcription to support cross-provider overrides

…sponses - Fixed early exit bug in 'litellm.run_stream' upon receiving the 'stop' finish reason, allowing the stream to fully process and capture OpenAI's delayed usage metadata chunk. - Improved 'cached_tokens' extraction in agent_integration.py to support LiteLLM's internal 'cache_hit_tokens' alias. - Fixed a scoping bug in run_agent_stream where cost calculation and conversation DB metrics updates were incorrectly nested inside a fallback block.

- Fix token extraction in agent_integration.py sync & stream to handle explicit None values. - Fix litellm.py tracking logic to prevent '+= None' operations. - Ensure 'cached_tokens' always falls back to 0 to prevent SQL cannot be null constraints and comparison errors in cost calculation.

Fix: Resolve Gemini Prompt Caching & Streaming Cost Accumulation

fix: audio transcription issue in gemini

Feat/agent voice example

Comprehensive analysis and plan to break the 2,682-line sdk_tools.py god module into focused, testable modules. Key changes: extract CRUD handlers, media handlers, and conversation data handlers into separate files; create a single source of truth for tool-type mappings; eliminate ~200 lines of duplicated media boilerplate; fix dead code, duplicate definitions, and filename typos. https://claude.ai/code/session_01HQEq8q8LJ2H6XJF9AGYGEo

The huf/ai/ directory was missing an __init__.py file, making it not a proper Python package. Sub-packages (knowledge/, providers/, orchestration/) already had __init__.py files. Adding this ensures consistent import behavior and allows tools like pytest to discover tests correctly. https://claude.ai/code/session_01HQEq8q8LJ2H6XJF9AGYGEo

…tions.py The function was defined twice (lines 359 and 461). Python silently overwrites the first definition with the second, making the first one dead code. The second definition (which accepts **kwargs for multi-file attachment) is the correct and actively used version. https://claude.ai/code/session_01HQEq8q8LJ2H6XJF9AGYGEo

The filename had a typo ("cilent" instead of "client"). Updated the corresponding function_path reference in sdk_tools.py to match. https://claude.ai/code/session_01HQEq8q8LJ2H6XJF9AGYGEo

Move all document CRUD handler functions (_sanitize_for_doctype, handle_create_document, handle_delete_document, handle_get_list, handle_update_document, handle_get_document, handle_*_documents, handle_submit/cancel_document, handle_get/set_value, handle_get_report_result, handle_attach_file_to_document) from sdk_tools.py into a dedicated handlers/crud.py module. Also removes unused factory functions (create_get_function, create_create_function, create_update_function, create_delete_function, create_list_function, wrap_frappe_function) that were never called — create_agent_tools() uses the handle_* pattern exclusively. Backward-compatible re-exports are kept in sdk_tools.py so existing function_path values in the database continue to resolve. https://claude.ai/code/session_01HQEq8q8LJ2H6XJF9AGYGEo

Move conversation data handlers (_now_iso_utc, _load_state, handle_get/set/load_conversation_data) from sdk_tools.py into huf/ai/handlers/conversation_data.py. Delete huf/ai/conversation_data_tools.py which was a redundant copy of the same functions already present in sdk_tools.py. Backward-compatible re-exports kept in sdk_tools.py. https://claude.ai/code/session_01HQEq8q8LJ2H6XJF9AGYGEo

…r.py Move handle_run_agent() from sdk_tools.py into a dedicated module. This handler has a unique dependency on frappe.utils.background_jobs.enqueue that no other handler needs — keeping it separate reduces coupling. Backward-compatible re-export kept in sdk_tools.py. https://claude.ai/code/session_01HQEq8q8LJ2H6XJF9AGYGEo

Create huf/ai/media_utils.py with shared helpers that eliminate ~200 lines of duplicated boilerplate across image/audio/OCR handlers: - get_agent_provider_config(): load agent, provider, API key - get_next_conversation_index(): sequential message ordering - create_agent_message(): Agent Message document creation - save_media_file(): file save with message/conversation attachment - emit_message_event(): realtime socket event publishing - update_conversation_total_messages(): conversation counter update Extract all media handlers to huf/ai/handlers/media.py: - handle_generate_image (image generation via LiteLLM) - handle_ocr_document (OCR via LiteLLM OCR/vision endpoints) - handle_generate_audio (TTS via LiteLLM speech) - handle_transcribe_audio (STT via LiteLLM transcription) - All associated helper/config functions (_resolve_tts_config, etc.) Also fixes: replace print() with frappe.logger().debug() in image generation (was print("Returned images: ", images)). Backward-compatible re-exports kept in sdk_tools.py. The file is now reduced from 2,682 lines to 465 lines. https://claude.ai/code/session_01HQEq8q8LJ2H6XJF9AGYGEo

Add huf/ai/tool_types.py as the single source of truth for tool-type- to-handler-path mapping. This eliminates the duplicated if/elif chain that was maintained in both sdk_tools.py and flow_tool_executor.py. Refactor sdk_tools.py: - Replace 40-line if/elif chain with TOOL_TYPE_HANDLERS dict lookup - Replace inline DOCTYPE_BOUND_TYPES list with shared constant - Clean up duplicate imports (base64 x2, save_file x2) - Remove unused imports (io, requests, hashlib, enqueue, client, etc.) - Remove unused datetime/timedelta module-level imports - Remove inline 'import inspect' (use module-level import) - Extract _call_with_filtered_args() helper from on_invoke_tool closure - Standardize frappe.log_error() to use keyword arguments - Use consistent tab indentation throughout - Consolidate all re-exports at bottom of file sdk_tools.py is now ~280 lines of core logic (down from 2,682). https://claude.ai/code/session_01HQEq8q8LJ2H6XJF9AGYGEo

Replace the hardcoded type_to_handler dict and inline DOCTYPE_BOUND_TYPES list with imports from huf.ai.tool_types. This ensures both sdk_tools.py and flow_tool_executor.py share a single source of truth for tool-type mappings — adding a new tool type now requires updating only one file. Also reuse _call_with_filtered_args() from sdk_tools instead of duplicating the inspect-based parameter filtering logic. https://claude.ai/code/session_01HQEq8q8LJ2H6XJF9AGYGEo

Update function_path references in tool creation/migration functions to point to the new canonical handler locations: - huf.ai.sdk_tools.handle_generate_image → huf.ai.handlers.media - huf.ai.sdk_tools.handle_ocr_document → huf.ai.handlers.media - huf.ai.sdk_tools.handle_generate_audio → huf.ai.handlers.media - huf.ai.sdk_tools.handle_transcribe_audio → huf.ai.handlers.media New installations will store the correct paths. Existing installations continue to work via backward-compatible re-exports in sdk_tools.py. https://claude.ai/code/session_01HQEq8q8LJ2H6XJF9AGYGEo

… type -Added import for get_documents from tool_functions. -Implemented handle_get_documents(reference_doctype, document_ids, **kwargs). -Added validation for missing doctype and invalid/empty document_ids. -Standardized success/error response shape to match existing handlers. -Restored compatibility with tool_types.py mapping for 'Get Multiple Documents'.

-Updated success message text from Set '{name}' match successfully to Set '{name}' successfully. -No behavior or schema change only response clarity/readability improvement.

-Fixed extra-arg injection for HTTP tools to use tool_doc['tool_name'] instead of document name. -Prevents mismatch in handler-side policy/logging logic that depends on the actual tool name. -Keeps deterministic flow tool execution behavior aligned with SDK tool execution path.

-Fixed indentation in on_invoke_tool block that could cause runtime failure. -Removed duplicate context injection path and kept one canonical context extraction using _frappe_run_context_dict(ctx). -Preserved propagation of conversation_id, agent_run_id, and agent_name into tool args. -Kept backward-compatible re-export list intact and added missing CRUD plural handler export.

-Use context values as fallbacks in tool invocation. This fixes self-triggering behavior where an agent incorrectly invoked itself instead of the configured downstream agent. -Merge context with setdefault in on_invoke_tool

-make submit and cancel handler resilient to missing doctype args -return clear validation errors for missing document_id/doctype

- Added link filed for selecting report

- Updated 'create_agent_tools' to load all tools assigned to the agent without filtering. - Updated 'create_function_tool' signature to accept the 'tool_doc'. - Added a strict runtime permission check inside 'on_invoke_tool' using PermissionAwareToolRegistry._can_use_tool.

Sanjusha-tridz and others added 27 commits March 4, 2026 11:43

Merge pull request #180 from tridz-dev/fix/tool-migrate

37d0480

fix: harden tool creation functions for migration safety

feat: Update Project documentation

ae645a0

Merge pull request #181 from tridz-dev/feat/update-claude-doc

05e0f6f

feat: Update Project documentation

feat: add stt_model field for dedicated audio transcription config

a6fee6f

fix: local variable 'user_content' referenced before assignment issue

8b4e335

refactor: reorganize Agent Message doctype with structured sections a…

db744b9

…nd add model field for STT

Merge pull request #182 from tridz-dev/fix/stt-model-selection

d7569ec

feat: refactor audio transcription to support cross-provider overrides

Merge pull request #185 from tridz-dev/fix/prompt-caching

9e2ec65

Fix: Resolve Gemini Prompt Caching & Streaming Cost Accumulation

fix: audio transcription issue in gemini

a63a961

Merge pull request #186 from tridz-dev/fix/audio-transcription

c24cc26

fix: audio transcription issue in gemini

feat(example): Audio chat example

50316d8

fix: cleanup and sending audio on send click

ef10f3c

Merge pull request #187 from tridz-dev/feat/agent_voice_example

2a78bf7

Feat/agent voice example

fix: rename cilent_side_tool.py to client_side_tool.py (typo fix)

3779cf2

The filename had a typo ("cilent" instead of "client"). Updated the corresponding function_path reference in sdk_tools.py to match. https://claude.ai/code/session_01HQEq8q8LJ2H6XJF9AGYGEo

esafwan changed the title ~~Refactor: AI SDK Tools~~ refactor: AI SDK Tools Mar 19, 2026

Sanjusha-tridz added 2 commits March 27, 2026 12:57

Merge branch 'develop' into refactor/refactor-ai-sdk-tools

a0abdc3

Sanjusha-tridz added 9 commits April 1, 2026 09:58

chore: clean up set_conversation_data success message wording

95089fd

-Updated success message text from Set '{name}' match successfully to Set '{name}' successfully. -No behavior or schema change only response clarity/readability improvement.

fix: preserve Run Agent target by preventing context overwrite

4b87a09

-Use context values as fallbacks in tool invocation. This fixes self-triggering behavior where an agent incorrectly invoked itself instead of the configured downstream agent. -Merge context with setdefault in on_invoke_tool

fix: add Submit and Cancel to doctype-bound tool set

ea7ff5a

-make submit and cancel handler resilient to missing doctype args -return clear validation errors for missing document_id/doctype

fix: update reference doctype validation for Get Report Result Tool

c3010bd

- Added link filed for selecting report

fix: update frappe.log_error to use named parameters

dc2ed48

Merge branch 'develop' into refactor/refactor-ai-sdk-tools

b886ead

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

refactor: AI SDK Tools#198

refactor: AI SDK Tools#198
esafwan wants to merge 38 commits into
developfrom
refactor/refactor-ai-sdk-tools

esafwan commented Mar 10, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

esafwan commented Mar 10, 2026

Key Changes

Impact

Refactor Phases

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants