Maintenance in progress. Documentation and research surfaces are being standardized before this repo is shared as a stable reference.
Polinko is a human-led research system for inspecting AI behavior through fail-first evaluation, evidence-preserving method work, and repo-native publishing.
The website is a doorway. The repository is the research surface.
Most AI projects foreground polished outputs and hide the failure structure. Polinko keeps failure visible enough to inspect, classify, and improve. That matters anywhere confidence can outrun source evidence.
The method is intentionally small and strict: preserve the artifact, decide
pass or fail, retain useful failures, evict noise, and let the tracked
evidence change the next run.
Beta 2.3 is the frozen method snapshot. pre-Beta 2.4 is now staged
as the next research-model contract before new evidence is cut.
The active read is:
- OCR is the mature green lane, stabilized on the current image set and moving into broader generalization pressure.
- The discarded run-level rollup path is not being carried forward; the next research-model question is how source artifacts, row/case judgment, manual evals, lane summaries, and repeated lane signal can carry claims.
- Co-reasoning is the first promoted non-OCR lane, supported by tracked style and soak evidence.
- Retrieval, response behaviour, uncertainty boundary, and hallucination boundary are operationalized support surfaces.
- Operator burden is the active thin lane because it is still producing distinct evidence pressure.
- The research surface is open: current lane status is explicit, but the method claim is still under pressure.
For the maintained research map, start with docs/research/README.md.
Use the public path when you want the shortest reviewer-facing read:
Use the operator path when you need to run, inspect, or change the system:
- FastAPI API and CLI runtime
- Manual eval workbench: notebooks, local evidence databases, chat artifacts, feedback, checkpoints, and exports
- Fail-first eval surfaces
- OCR and non-OCR method lanes
- Export-backed behaviour backlog mining
- Tracked research docs, diagrams, and eval context
- Repo-local engineering and validation workflow
make deps-install
cp .env.example .env
# set OPENAI_API_KEY in .env
make doctor-env
make docsThis prints the local API docs URL. Use make docs-open only when you want to
launch the system browser.
src/polinko/cli.pycontains the packaged CLI chat implementation;make chat,polinko-chat, and rootmain.pylaunch it.src/polinko/asgi.pyconstructs the FastAPI ASGI app and chat-facing manual eval workbench endpoints; rootserver.pypreservesuvicorn server:app.pyproject.tomlandsrc/polinko/provide the editable-install rail for the runtime package.src/polinko/config.pycontains the canonical config implementation; the legacy rootconfig.pyshim is retired.src/polinko/api/contains the canonical API implementation; legacy rootapi/shims are retired.src/polinko/core/contains canonical runtime logic; legacy rootcore/shims are retired.tools/contains eval and maintenance scripts.tests/contains the test suite.docs/public/contains the curated public reading path.docs/research/contains the compact research surface.docs/eval/contains eval evidence and phase context.docs/governance/contains charter, state, and decisions.docs/runtime/contains runbook and architecture.
Apache-2.0. See LICENSE.