Name	Name	Last commit message	Last commit date
parent directory ..
graders	graders
reports	reports
README.md	README.md
__init__.py	__init__.py
baselines.json	baselines.json
contract_eval.py	contract_eval.py
hello_world.py	hello_world.py
http_agent_eval.py	http_agent_eval.py
noise_aware_regression.py	noise_aware_regression.py
run_eval.py	run_eval.py
tasks.json	tasks.json

Name

Last commit message

Last commit date

reports

noise_aware_regression.py

run_eval.py

tasks.json

TraceLens Examples

Run these examples from the repository root after installing TraceLens:

uv pip install -e ".[dev]"

Example Ladder

Step	File	What It Shows
1	`hello_world.py`	The smallest possible local eval: task, adapter, grader, runner.
2	`contract_eval.py`	Generate graders from a behavior contract.
3	`http_agent_eval.py`	Evaluate an agent exposed as an HTTP JSON endpoint.
4	`noise_aware_regression.py`	Compare runs with different infrastructure fingerprints.

Hello World

python examples/hello_world.py
tracelens report --results examples/reports/hello_world_report.json --format markdown

Expected output:

tracelens hello-world
--------------------
trials run : 9
pass rate  : 100%
report json: examples/reports/hello_world_report.json
sample md  : examples/reports/hello_world_report.md

Use this file as the template when you want to evaluate a normal Python function or local agent loop. The generated sample report at examples/reports/hello_world_report.md shows tasks, trials, pass@k, pass^k, graders, baseline comparison, regression result, and CI summary.

HTTP Agent

python examples/http_agent_eval.py

This starts a local stdlib HTTP server, evaluates it with HTTPAPIAdapter, and grades the JSON response shape.

Contract Eval

python examples/contract_eval.py

This is the fastest way to encode strict output rules without writing every grader by hand.

Noise-Aware Regression

python examples/noise_aware_regression.py

This demonstrates how TraceLens separates agent regressions from small infrastructure-driven differences.

Coverage Notes

These four examples are intentionally small and dependency-light. They are enough to teach the core framework and support the first public release.

Future examples should focus on scenarios that are documented but not yet represented as runnable scripts:

LLM-as-judge using a fake or recorded provider.
Multi-step tool-use transcript review.
Human calibration against grader output.
Downstream project CI that installs TraceLens from PyPI.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

TraceLens Examples

Example Ladder

Hello World

HTTP Agent

Contract Eval

Noise-Aware Regression

Coverage Notes

FilesExpand file tree

examples

Directory actions

More options

Directory actions

More options

Latest commit

History

examples

Folders and files

parent directory

README.md

TraceLens Examples

Example Ladder

Hello World

HTTP Agent

Contract Eval

Noise-Aware Regression

Coverage Notes