majorcontext · andybons · May 11, 2026 · May 11, 2026 · May 11, 2026 · May 11, 2026
diff --git a/AGENTS.md b/AGENTS.md
@@ -0,0 +1,127 @@
+# AGENTS.md
+
+Instructions for AI coding agents working in this repository.
+
+## Project Overview
+
+Gatekeeper is a standalone credential-injecting TLS-intercepting proxy. It transparently injects authentication headers (tokens, API keys) into proxied HTTPS requests based on hostname matching. Clients never see raw credentials — they route traffic through the proxy, which handles credential resolution, injection, and TLS interception.
+
+Key capabilities:
+
+- **Credential injection** — Resolve credentials from environment variables, static values, or AWS Secrets Manager, then inject them as HTTP headers for matching hosts
+- **TLS interception** — MITM proxy with per-host certificate generation from a configured CA
+- **MCP relay** — Forward Model Context Protocol requests with credential injection and SSE streaming
+- **Network policy** — Allow/deny traffic by host pattern
+- **LLM policy** — Evaluate Anthropic API responses against Keep policy rules
+- **Host gateway** — Route synthetic container hostnames to the actual host IP
+- **OpenTelemetry** — Distributed traces, request metrics, and slog-to-OTel logs bridge; configured entirely via standard `OTEL_*` environment variables
+
+## Architecture
+
+```
+proxy/              Core TLS-intercepting proxy engine
+  proxy.go            Main proxy: CONNECT handling, TLS interception, credential injection
+  ca.go              CA certificate loading and per-host cert generation
+  hosts.go           Hostname matching (glob patterns, port stripping)
+  mcp.go             MCP relay handler (SSE streaming, tool credential injection)
+  llmpolicy.go       LLM response policy evaluation via Keep
+  relay.go           HTTP relay for non-CONNECT requests
+  otel.go            OpenTelemetry handler wrapper, metrics instruments, span helpers
+  server.go          Proxy server lifecycle (start/stop/listen)
+
+gatekeeper.go       Standalone server wiring (config → proxy + credential sources)
+config.go           YAML config parsing (proxy, TLS, credentials, network, log)
+config_credential.go  Credential source resolution (env, static, AWS Secrets Manager)
+
+credentialsource/   Pluggable credential backends
+  source.go           Source interface
+  env.go             Environment variable source
+  static.go          Literal value source
+  awssecretsmanager.go  AWS Secrets Manager source
+
+cmd/gatekeeper/     CLI entry point (--config flag)
+
+examples/           Sample config, CA generation script, and test harness
+```
+
+### Key Types
+
+- **`proxy.Proxy`** — The core proxy. Handles HTTP CONNECT, TLS interception, credential injection, network policy, and request logging.
+- **`proxy.RunContextData`** — Per-caller credential and policy context. Holds credentials, network policy, MCP servers, host gateway config, and Keep engines for a single caller.
+- **`proxy.ContextResolver`** — Function type (`func(token string) (*RunContextData, bool)`) that resolves a proxy auth token to per-caller context. Standalone mode uses a single static context; moat's daemon maps each registered run to its own scoped context.
+- **`gatekeeper.Server`** — Standalone server that loads config, resolves credential sources, and wires up the proxy.
+
+### How Credential Injection Works
+
+1. Client sends `CONNECT host:443` through the proxy (via `HTTP_PROXY` env var)
+2. Proxy establishes TLS with the client using a dynamically-generated certificate for that host
+3. Proxy reads the plaintext HTTP request from the client
+4. `RunContextData.Credentials` is checked — if a credential matches the request host, the configured header (default: `Authorization`) is injected
+5. Proxy forwards the request to the real server over a separate TLS connection
+6. Response streams back to the client
+
+### Host Gateway
+
+The `HostGateway` field in `RunContextData` maps a synthetic hostname (used inside containers) to the host machine's IP. When `HostGatewayIP` resolves to a loopback address, the proxy also matches `localhost`/`127.0.0.1`/`::1` as equivalent — so credentials configured for the gateway hostname also apply to direct loopback connections.
+
+### OpenTelemetry Instrumentation
+
+OTel integration uses a callback-based architecture — the proxy core (`proxy/proxy.go`) has no OTel imports. Instrumentation is layered on externally:
+
+- **`proxy.OTelHandler`** wraps the proxy as HTTP middleware, creating root spans and recording request duration/count metrics. Its `statusRecorder` implements `http.Hijacker` so CONNECT requests still work after hijack.
+- **Request/policy loggers** (set in `gatekeeper.go`) attach span events and record credential injection/policy denial metrics via exported functions `proxy.RecordCredentialInjection` and `proxy.RecordPolicyDenial`.
+- **slog bridge** — `gatekeeper.go` uses a `multiHandler` to fan out log records to both the configured slog handler and `otelslog.NewHandler`, correlating logs with trace context.
+- **Provider setup** — `cmd/gatekeeper/main.go` creates OTLP HTTP exporters for traces, metrics, and logs, registering them as global providers. All configuration is via standard `OTEL_*` env vars (no YAML knobs).
+
+## Development Commands
+
+```bash
+# Build
+go build ./...
+
+# Run tests (includes race detector)
+go test -race ./...
+
+# Run a single test
+go test -race -run TestName ./proxy/
+
+# Vet
+go vet ./...
+
+# Build the binary
+go build -o gatekeeper ./cmd/gatekeeper/
+```
+
+## Code Style
+
+- Follow standard Go conventions and `go fmt` formatting
+- Use `go vet` to catch common issues
+- No `internal/` packages — this is a library module meant to be imported
+
+## Git Commits
+
+- Use [Conventional Commits](https://www.conventionalcommits.org/) format: `type(scope): description`
+  - Types: `feat`, `fix`, `docs`, `style`, `refactor`, `test`, `chore`, `build`, `ci`, `perf`
+  - Scope is optional but encouraged (e.g., `feat(proxy): add header injection`)
+- Do not include `Co-Authored-By` lines for AI agents in commit messages
+
+## Security Considerations
+
+This proxy handles sensitive credentials. When making changes:
+
+- Never log credential values (tokens, keys, secrets) — log host/grant names only
+- Credentials must not appear in error messages returned to clients
+- The CA private key must stay in memory only — never written to temp files
+- Validate that TLS interception cannot be bypassed (e.g., via malformed CONNECT requests)
+- Host matching must be exact or use explicit glob patterns — no accidental wildcard leaks
+- Auth token comparison must be constant-time to prevent timing attacks
+
+## Relationship to Moat
+
+This module (`github.com/majorcontext/gatekeeper`) was extracted from moat's `internal/proxy/` package. Moat imports gatekeeper as a dependency and provides the daemon layer (per-run registration, token-scoped contexts, Unix socket management API). Gatekeeper has no knowledge of moat — it's a general-purpose credential-injecting proxy.
+
+## Creating Pull Requests
+
+- Use `gh pr create` with default flags only (no `--base`, `--head`, etc.)
+- If `gh pr create` fails, report the error to the operator immediately
+- Do not attempt to work around failures by adding flags or changing configuration
diff --git a/CHANGELOG.md b/CHANGELOG.md
@@ -4,6 +4,13 @@ Gatekeeper is a standalone credential-injecting TLS-intercepting proxy. It trans
 
 Gatekeeper is pre-1.0. The configuration schema and credential source interface may change between minor versions.
 
+## v0.10.0 — 2026-05-11
+
+### Added
+
+- **`capture_headers` log config** — new `log.capture_headers` field captures specified request headers as structured attributes in the canonical `"request"` log entry; matched headers are stripped before forwarding upstream; header names are logged as lowercase with hyphens converted to underscores (e.g., `X-Workspace-Slug` → `x_workspace_slug`); values are truncated at 256 characters; sensitive headers (`Authorization`, `Proxy-Authorization`, `Cookie`) are rejected at startup; max 10 headers allowed
+- **User ID in canonical request log** — the proxy auth username (from `HTTP_PROXY=http://user:token@host`) is now logged as `user_id` in the canonical request log entry and included in OTel span attributes
+
 ## v0.9.1 — 2026-04-26
 
 ### Fixed

diff --git a/CLAUDE.md b/CLAUDE.md
@@ -1,132 +1 @@
-# CLAUDE.md
-
-This file provides guidance to Claude Code (claude.ai/code) when working with code in this repository.
-
-## Project Overview
-
-Gatekeeper is a standalone credential-injecting TLS-intercepting proxy. It transparently injects authentication headers (tokens, API keys) into proxied HTTPS requests based on hostname matching. Clients never see raw credentials — they route traffic through the proxy, which handles credential resolution, injection, and TLS interception.
-
-Key capabilities:
-
-- **Credential injection** — Resolve credentials from environment variables, static values, or AWS Secrets Manager, then inject them as HTTP headers for matching hosts
-- **TLS interception** — MITM proxy with per-host certificate generation from a configured CA
-- **MCP relay** — Forward Model Context Protocol requests with credential injection and SSE streaming
-- **Network policy** — Allow/deny traffic by host pattern
-- **LLM policy** — Evaluate Anthropic API responses against Keep policy rules
-- **Host gateway** — Route synthetic container hostnames to the actual host IP
-- **OpenTelemetry** — Distributed traces, request metrics, and slog-to-OTel logs bridge; configured entirely via standard `OTEL_*` environment variables
-
-## Architecture
-
-```
-proxy/              Core TLS-intercepting proxy engine
-  proxy.go            Main proxy: CONNECT handling, TLS interception, credential injection
-  ca.go              CA certificate loading and per-host cert generation
-  hosts.go           Hostname matching (glob patterns, port stripping)
-  mcp.go             MCP relay handler (SSE streaming, tool credential injection)
-  llmpolicy.go       LLM response policy evaluation via Keep
-  relay.go           HTTP relay for non-CONNECT requests
-  otel.go            OpenTelemetry handler wrapper, metrics instruments, span helpers
-  server.go          Proxy server lifecycle (start/stop/listen)
-
-gatekeeper.go       Standalone server wiring (config → proxy + credential sources)
-config.go           YAML config parsing (proxy, TLS, credentials, network, log)
-config_credential.go  Credential source resolution (env, static, AWS Secrets Manager)
-
-credentialsource/   Pluggable credential backends
-  source.go           Source interface
-  env.go             Environment variable source
-  static.go          Literal value source
-  awssecretsmanager.go  AWS Secrets Manager source
-
-cmd/gatekeeper/     CLI entry point (--config flag)
-
-examples/           Sample config, CA generation script, and test harness
-```
-
-### Key Types
-
-- **`proxy.Proxy`** — The core proxy. Handles HTTP CONNECT, TLS interception, credential injection, network policy, and request logging.
-- **`proxy.RunContextData`** — Per-caller credential and policy context. Holds credentials, network policy, MCP servers, host gateway config, and Keep engines for a single caller.
-- **`proxy.ContextResolver`** — Function type (`func(token string) (*RunContextData, bool)`) that resolves a proxy auth token to per-caller context. Standalone mode uses a single static context; moat's daemon maps each registered run to its own scoped context.
-- **`gatekeeper.Server`** — Standalone server that loads config, resolves credential sources, and wires up the proxy.
-
-### How Credential Injection Works
-
-1. Client sends `CONNECT host:443` through the proxy (via `HTTP_PROXY` env var)
-2. Proxy establishes TLS with the client using a dynamically-generated certificate for that host
-3. Proxy reads the plaintext HTTP request from the client
-4. `RunContextData.Credentials` is checked — if a credential matches the request host, the configured header (default: `Authorization`) is injected
-5. Proxy forwards the request to the real server over a separate TLS connection
-6. Response streams back to the client
-
-### Host Gateway
-
-The `HostGateway` field in `RunContextData` maps a synthetic hostname (used inside containers) to the host machine's IP. When `HostGatewayIP` resolves to a loopback address, the proxy also matches `localhost`/`127.0.0.1`/`::1` as equivalent — so credentials configured for the gateway hostname also apply to direct loopback connections.
-
-### OpenTelemetry Instrumentation
-
-OTel integration uses a callback-based architecture — the proxy core (`proxy/proxy.go`) has no OTel imports. Instrumentation is layered on externally:
-
-- **`proxy.OTelHandler`** wraps the proxy as HTTP middleware, creating root spans and recording request duration/count metrics. Its `statusRecorder` implements `http.Hijacker` so CONNECT requests still work after hijack.
-- **Request/policy loggers** (set in `gatekeeper.go`) attach span events and record credential injection/policy denial metrics via exported functions `proxy.RecordCredentialInjection` and `proxy.RecordPolicyDenial`.
-- **slog bridge** — `gatekeeper.go` uses a `multiHandler` to fan out log records to both the configured slog handler and `otelslog.NewHandler`, correlating logs with trace context.
-- **Provider setup** — `cmd/gatekeeper/main.go` creates OTLP HTTP exporters for traces, metrics, and logs, registering them as global providers. All configuration is via standard `OTEL_*` env vars (no YAML knobs).
-
-Key env vars for deployment:
-- `OTEL_EXPORTER_OTLP_ENDPOINT` — Collector endpoint (e.g., `https://host.betterstackdata.com`)
-- `OTEL_EXPORTER_OTLP_HEADERS` — Auth headers (e.g., `Authorization=Bearer <token>`)
-- `OTEL_RESOURCE_ATTRIBUTES` — Resource tags (e.g., `environment=production`)
-
-## Development Commands
-
-```bash
-# Build
-go build ./...
-
-# Run tests (includes race detector)
-go test -race ./...
-
-# Run a single test
-go test -race -run TestName ./proxy/
-
-# Vet
-go vet ./...
-
-# Build the binary
-go build -o gatekeeper ./cmd/gatekeeper/
-```
-
-## Code Style
-
-- Follow standard Go conventions and `go fmt` formatting
-- Use `go vet` to catch common issues
-- No `internal/` packages — this is a library module meant to be imported
-
-## Git Commits
-
-- Use [Conventional Commits](https://www.conventionalcommits.org/) format: `type(scope): description`
-  - Types: `feat`, `fix`, `docs`, `style`, `refactor`, `test`, `chore`, `build`, `ci`, `perf`
-  - Scope is optional but encouraged (e.g., `feat(proxy): add header injection`)
-- Do not include `Co-Authored-By` lines for Claude in commit messages
-
-## Security Considerations
-
-This proxy handles sensitive credentials. When making changes:
-
-- Never log credential values (tokens, keys, secrets) — log host/grant names only
-- Credentials must not appear in error messages returned to clients
-- The CA private key must stay in memory only — never written to temp files
-- Validate that TLS interception cannot be bypassed (e.g., via malformed CONNECT requests)
-- Host matching must be exact or use explicit glob patterns — no accidental wildcard leaks
-- Auth token comparison must be constant-time to prevent timing attacks
-
-## Relationship to Moat
-
-This module (`github.com/majorcontext/gatekeeper`) was extracted from moat's `internal/proxy/` package. Moat imports gatekeeper as a dependency and provides the daemon layer (per-run registration, token-scoped contexts, Unix socket management API). Gatekeeper has no knowledge of moat — it's a general-purpose credential-injecting proxy.
-
-## Creating Pull Requests
-
-- Use `gh pr create` with default flags only (no `--base`, `--head`, etc.)
-- If `gh pr create` fails, report the error to the operator immediately
-- Do not attempt to work around failures by adding flags or changing configuration
+@AGENTS.md
diff --git a/config.go b/config.go
@@ -87,9 +87,10 @@ type NetworkConfig struct {
 
 // LogConfig configures logging.
 type LogConfig struct {
-	Level  string `yaml:"level"`  // Log level (e.g., "debug", "info", "warn", "error")
-	Format string `yaml:"format"` // Output format ("json" or "text")
-	Output string `yaml:"output"` // Destination ("stderr", "stdout", or a file path; default: stderr)
+	Level          string   `yaml:"level"`                      // Log level (e.g., "debug", "info", "warn", "error")
+	Format         string   `yaml:"format"`                     // Output format ("json" or "text")
+	Output         string   `yaml:"output"`                     // Destination ("stderr", "stdout", or a file path; default: stderr)
+	CaptureHeaders []string `yaml:"capture_headers,omitempty"`  // Request headers to log and strip before forwarding
 }
 
 // ParseConfig parses a Gate Keeper config from YAML bytes.

diff --git a/config_test.go b/config_test.go
@@ -1,9 +1,13 @@
 package gatekeeper
 
 import (
+	"fmt"
 	"os"
 	"path/filepath"
+	"strings"
 	"testing"
+
+	"github.com/majorcontext/gatekeeper/proxy"
 )
 
 func TestParseConfig_Full(t *testing.T) {
@@ -167,3 +171,79 @@ func TestLoadConfig_NotFound(t *testing.T) {
 		t.Fatal("expected error for missing file")
 	}
 }
+
+func TestParseConfig_CaptureHeaders(t *testing.T) {
+	yaml := `
+log:
+  capture_headers:
+    - X-Workspace-Slug
+    - X-Request-Source
+`
+	cfg, err := ParseConfig([]byte(yaml))
+	if err != nil {
+		t.Fatalf("ParseConfig: %v", err)
+	}
+	if len(cfg.Log.CaptureHeaders) != 2 {
+		t.Fatalf("CaptureHeaders len = %d, want 2", len(cfg.Log.CaptureHeaders))
+	}
+	if cfg.Log.CaptureHeaders[0] != "X-Workspace-Slug" {
+		t.Errorf("CaptureHeaders[0] = %q, want X-Workspace-Slug", cfg.Log.CaptureHeaders[0])
+	}
+}
+
+func TestValidateCaptureHeaders_MaxExceeded(t *testing.T) {
+	headers := make([]string, 11)
+	for i := range headers {
+		headers[i] = fmt.Sprintf("X-Header-%d", i)
+	}
+	err := proxy.ValidateCaptureHeaders(headers)
+	if err == nil {
+		t.Fatal("expected error for >10 headers")
+	}
+	if !strings.Contains(err.Error(), "max 10") {
+		t.Errorf("error = %q, want mention of max 10", err.Error())
+	}
+}
+
+func TestValidateCaptureHeaders_SensitiveRejected(t *testing.T) {
+	tests := []string{"Authorization", "proxy-authorization", "Cookie"}
+	for _, h := range tests {
+		t.Run(h, func(t *testing.T) {
+			err := proxy.ValidateCaptureHeaders([]string{h})
+			if err == nil {
+				t.Fatalf("expected error for sensitive header %q", h)
+			}
+			if !strings.Contains(err.Error(), "sensitive") {
+				t.Errorf("error = %q, want mention of sensitive", err.Error())
+			}
+		})
+	}
+}
+
+func TestValidateCaptureHeaders_Valid(t *testing.T) {
+	err := proxy.ValidateCaptureHeaders([]string{"X-Workspace-Slug", "X-Request-Source"})
+	if err != nil {
+		t.Fatalf("unexpected error: %v", err)
+	}
+}
+
+func TestValidateCaptureHeaders_Empty(t *testing.T) {
+	err := proxy.ValidateCaptureHeaders(nil)
+	if err != nil {
+		t.Fatalf("unexpected error for nil: %v", err)
+	}
+	err = proxy.ValidateCaptureHeaders([]string{})
+	if err != nil {
+		t.Fatalf("unexpected error for empty: %v", err)
+	}
+}
+
+func TestValidateCaptureHeaders_Duplicate(t *testing.T) {
+	err := proxy.ValidateCaptureHeaders([]string{"X-Workspace-Slug", "x-workspace-slug"})
+	if err == nil {
+		t.Fatal("expected error for duplicate headers")
+	}
+	if !strings.Contains(err.Error(), "duplicate") {
+		t.Errorf("error = %q, want mention of duplicate", err.Error())
+	}
+}