Build software better, together

tznthou / claude-prism

Cross-provider AI code review for Claude Code — evidence-based confidence scoring with Codex, Gemini & Claude

bash gemini code-review fact-checking codex multi-provider github-actions ai-code-review claude-code confidence-scoring cross-provider

Updated May 18, 2026
Shell

goergen95 / seapig

Star

Confidence based selection of compatible inputs

deep-learning pytorch remote-sensing selective-prediction torchgeo geospatial-ai confidence-scoring

Updated May 18, 2026
Python

obielin / llm-extract

Star

Extract structured data from any document — PDF, DOCX, HTML, CSV, plain text — using LLMs with Pydantic schema validation, per-field confidence scores, and source grounding.

python nlp pdf extraction structured-output pydantic llm document-parsing anthropic confidence-scoring

Updated Apr 5, 2026
Python

metareason-ai / metareason-core

Star

Open-source LLM evaluation engine with statistical confidence scoring

statistical-analysis bayesian-inference ai-governance llm-evaluation confidence-scoring

Updated Mar 24, 2026
Python

laundromatic / shopgraph

Star

The extraction API that shows its work. Product data extraction with per-field confidence scoring and extraction provenance. REST API + MCP server. 50 free calls/month.

ecommerce ucp schema-org structured-data ai-agents product-data mcp-server confidence-scoring agent-commerce stripe-mpp shopgraph extraction-provenance

Updated May 17, 2026
TypeScript

lorenzespinosa / n8n-ai-agent-delegator

Star

Multi-agent AI task delegation architecture for n8n: orchestrator routes natural-language commands to specialist agents with confidence scoring and human-in-the-loop gates.

automation orchestration multi-agent openai ai-agents n8n llm confidence-scoring

Updated Mar 31, 2026

seljicom / selji-zero-noise

Star

Zero-Noise utilities for safer product research and review signal analysis.

ecommerce decision-support consumer-research review-analysis product-research confidence-scoring zero-noise buyer-tools shopping-tools

Updated Feb 7, 2026
JavaScript

SouravUpadhyay7 / self_correcting_rag

Star

Research-grade Self-Correcting RAG agent built with LangGraph that retrieves knowledge, generates answers, evaluates grounding/relevance/completeness, and iteratively self-improves with confidence scoring and memory.

python rag streamlit langchain llm-agent openrouter hallucination-detection langgraph knowledge-retrieval huggingface-embeddings confidence-scoring self-correcting-ai

Updated Mar 20, 2026
Python

theangelofwill / CrossModel-Consensus

Star

System that aggregates outputs from multiple Large Language Models (GPT-4, Claude-3, custom models) to generate reliable, high-confidence results through consensus-based reasoning evaluation. Demonstrates sophisticated AI orchestration with 92.7% accuracy improvement over single-model.

python api docker portfolio machine-learning ai deep-learning orchestration pytorch neural-networks multi-model consensus-algorithm model-comparison mlflow fastapi ai-engineering llm prompt-engineering confidence-scoring

Updated Dec 22, 2025
Python

simply-mihir / nistula-technical-assessment

Star

AI-powered concierge that normalises guest messages from WhatsApp, Booking.com, Airbnb, Instagram and direct channels, drafts a reply with Claude, and routes responses through a deterministic confidence-scoring pipeline. Built with FastAPI + Claude Sonnet 4.

Updated May 18, 2026
Python

m2ai-portfolio / hallucination-hunter-vscode-extension-for-real-time-ai-answer-validation

Star

Catch AI‑code hallucinations instantly: real‑time sandbox validation scores suggestions, flags low‑confidence snippets, so solo devs avoid wasted debugging and regain trust in assistants.

vscode-extension cli-tool trust-in-ai real-time-validation confidence-scoring sandbox-testing solo-developers linter-integration reduce-rework ai-assistant-users

Updated Apr 16, 2026
Python

obinexus / gating

Star

wjddusrb03 / docforge

Star

Smart Document Conversion for the AI Era - CPU-only, fast, with confidence scoring. Converts PDF, DOCX, PPTX, HTML, EPUB to Markdown, JSON, HTML, Text.

Updated Mar 29, 2026
Python

JLHC-AI-portfolio / community-fair-supplier-packet-review

Star

Supplier PDF-to-Excel/CSV workflow with structured extraction, confidence scoring, validation flags, and human-review cues.

nodejs express validation data-cleaning csv-export excel-automation pdf-extraction document-automation confidence-scoring ai-assisted-extraction

Updated Apr 28, 2026
JavaScript

selfradiance / memledger

Star

Append-only CLI ledger for structured agent memory claims with provenance, confidence, contestability, and immutable history.

nodejs cli typescript sqlite provenance developer-tools ai-agents audit-trail append-only zod local-first agent-memory confidence-scoring memory-integrity claim-ledger

Updated Apr 28, 2026
TypeScript

Jh-justinHarmon / knowledge-ingestion-engine

Star

Ingestion pipelines with artifact lineage, replayable stages, and append-only persistence.

telemetry systems-engineering append-only pipeline-architecture confidence-scoring knowledge-ingestion deterministic-processing artifact-lineage replayable-pipeline

Updated Apr 6, 2026
Python

raksh-dev / inventory-data-standardization

Star

A modular AI-driven pipeline for cleaning, normalizing, and standardizing large-scale inventory data with automated SKU generation, confidence scoring, and human-in-the-loop validation.

python machine-learning pandas data-engineering data-normalization data-cleaning human-in-the-loop ai-agents etl-pipeline fastapi sku-generation google-gemini confidence-scoring inventory-standardization

Updated Jan 26, 2026
Python

sirmaxworld / ai-solver

Star

AI-powered problem solver using dual-AI validation with 88%+ confidence scoring. By Yourox.ai

ai agpl developer-tools problem-solving gpt claude confidence-scoring dual-ai

Updated Aug 13, 2025
HTML

hrswatirai-debug / HR-RAG-Multi-Files-AI-Agent

Star

Production-grade HR document intelligence system built on n8n, Pinecone, OpenAI, and PostgreSQL. Automatically detects and processes multiple files from a Google Drive folder, then answers natural-language queries against your HR documents with cited, confidence-scored responses — complete with query logging, caching, and error handling.

webhook postgresql google-drive multi-agent openai deduplication audit-trail pinecone rag n8n generative-ai enterprise-ai hr-automation confidence-scoring document-intelligence-rag llm-reranking source-citation query-caching auto-ingestion

Updated Apr 28, 2026

MafeTech24 / n8n-procesamientoDocsEnd2End

Star

Backend document processing pipeline using n8n and Gemini AI. Receives files via webhook, extracts structured data, calculates confidence scores and stores results in Supabase and Google Sheets.

javascript api webhooks automation ocr ai backend google-sheets gemini workflow-automation document-processing process-automation rpa n8n supabase document-intelligence ai-automation gemini-ai confidence-scoring

Updated Feb 18, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

confidence-scoring

Here are 23 public repositories matching this topic...

tznthou / claude-prism

goergen95 / seapig

obielin / llm-extract

metareason-ai / metareason-core

laundromatic / shopgraph

lorenzespinosa / n8n-ai-agent-delegator

seljicom / selji-zero-noise

SouravUpadhyay7 / self_correcting_rag

theangelofwill / CrossModel-Consensus

simply-mihir / nistula-technical-assessment

m2ai-portfolio / hallucination-hunter-vscode-extension-for-real-time-ai-answer-validation

obinexus / gating

wjddusrb03 / docforge

JLHC-AI-portfolio / community-fair-supplier-packet-review

selfradiance / memledger

Jh-justinHarmon / knowledge-ingestion-engine

raksh-dev / inventory-data-standardization

sirmaxworld / ai-solver

hrswatirai-debug / HR-RAG-Multi-Files-AI-Agent

MafeTech24 / n8n-procesamientoDocsEnd2End

Improve this page

Add this topic to your repo