pii-proxy

Privacy layer for AI agents. Mask PII before it reaches any LLM — unmask when writing back to your systems. PII detection runs locally and never leaves your infrastructure.

Your data ──→ [pii-proxy] ──→ LLM sees only fake data ──→ [pii-proxy] ──→ Real data restored
              mask()           plausible fakes                unmask()       perfect round-trip

Works with Node.js, Bun, and any OpenAI-compatible API (Claude, GPT, local models).

Local detection. Fine-tuned gliner_small-v2.1 on Nemotron-PII healthcare: 96.1% F1 coarse, 94.9% F1 fine, 144ms/record — within 1.3pp of NVIDIA's flagship gliner-PII at 1.5x the speed (verified on the same test set). Or use the BERT classifier path for 93.9% F1 at 26ms. All local, no cloud dependency. Reproducible benchmarks →

Why

Your AI agent processes patient records, insurance claims, customer data. You don't want real names, emails, and ID numbers hitting Claude or GPT. But token-based masking (PERSON_1, EMAIL_2) degrades fluency — LLMs lose track of meaningless placeholders across long contexts.

pii-proxy replaces PII with plausible fake values — the model parses realistic-looking text fluently, and a bijective map reverses every fake when you write back. (Fluency, not correctness — see When this works for failure modes.)

Install

npm install pii-proxy

Quick start

import { PrivacyProxy } from 'pii-proxy';

const proxy = new PrivacyProxy();

// Mask PII with plausible fakes
const masked = await proxy.mask(
  "Ship order to alex@example.com, tracking AETH0000345323DY"
);
// → "Ship order to alex@johnson.net, tracking BFUI0000482918EZ"

// Send masked.text to your LLM...

// Reverse all fakes back to real values
const real = proxy.unmask(llmResponse);
// "I'll notify alex@johnson.net" → "I'll notify alex@example.com"

Local LLM detection

Regex catches emails, IPs, tracking numbers. But what about "Patient: Marcus Weber"? That's a name — no regex will reliably find it.

v0.2 adds a local LLM detection layer. A model running on your machine (via Ollama) detects names, organizations, locations, and domain-specific entities. PII never leaves your network — not even for detection.

import { PrivacyProxy } from 'pii-proxy';

// Regex detectors + local LLM via Ollama
const proxy = PrivacyProxy.withLocalLlm({ model: 'qwen3:1.7b' });

const masked = await proxy.mask(
  "Patient Marcus Weber, treated at Universitätsklinikum Heidelberg. Contact: marcus.weber@gmail.com"
);
// → "Patient James Thompson, treated at Bradtke Medical. Contact: lizeth53@yahoo.com"

Setup:

# Install Ollama
curl -fsSL https://ollama.com/install.sh | sh

# Pull a model (qwen3:1.7b is fast and great for entity detection, ~1.4GB)
ollama pull qwen3:1.7b

Layered pipeline

Detection runs in layers — fast regex first, then LLM for what regex can't catch:

Text ──→ [Regex Layer] ──→ [Local LLM Layer] ──→ Deduplicated detections
           emails            person names
           phones             organizations
           IPs                locations
           UUIDs              medical records
           credit cards       insurance IDs
           tracking #s        custom entities

Overlapping detections are deduplicated automatically (regex wins ties).

Configuring the LLM detector

import { PrivacyProxy, LlmDetector, defaultDetectors } from 'pii-proxy';

// Use a different model
const proxy = PrivacyProxy.withLocalLlm({ model: 'qwen3:0.6b' });

// Point to a remote Ollama instance or any OpenAI-compatible API
const proxy2 = PrivacyProxy.withLocalLlm({
  endpoint: 'https://your-server.com/v1/chat/completions',
  model: 'gpt-4o-mini',
});

// Detect only specific entity types (faster, more focused)
const proxy3 = PrivacyProxy.withLocalLlm({
  entityTypes: ['person_name', 'organization'],
});

// Full control — compose your own detector stack
const proxy4 = new PrivacyProxy({
  detectors: [
    ...defaultDetectors,                          // regex layer
    new LlmDetector({ model: 'qwen3:1.7b' }),    // LLM layer
    // add more layers here — custom NER, dictionary lookup, etc.
  ],
});

Detector order = priority. Each detector returns Detection[] (or Promise<Detection[]> for async). The first detector to claim a span wins; later detectors that overlap that span are dropped. Put your most-specific detectors first.

How it works

Detect — layered pipeline finds PII entities (regex + optional local LLM).
Replace — each entity is replaced with a plausible fake of the same type (an email becomes another email, a name becomes another name).
Map — a bijective map ensures the same real value always maps to the same fake, and vice versa. Consistent within a session, reversible at any time.

Real:   "Contact Marcus Weber at marcus@example.com"
         ↓ mask()
Fake:   "Contact James Thompson at cornell62@hotmail.com"
         ↓ send to LLM → get response
LLM:    "I've drafted an email to James Thompson"
         ↓ unmask()
Real:   "I've drafted an email to Marcus Weber"

When this works (and when it doesn't)

Like-for-like replacement preserves fluency, not correctness. The model parses realistic-looking text without losing entity tracking over many opaque tokens — but it's reasoning about the fake, not the real.

Works well for:

Drafting, replying, summarization, extraction, routing
Multi-entity tracking where opaque PERSON_1 tokens degrade attention
Any task where the entity is a referent, not analyzed for its surface properties

Breaks (silently) for:

Surface inference. The model infers from the fake's surface — locale, gender, demographics. Marcus Weber → Mei Chen is a legal swap; "draft this in the patient's likely language" picks the wrong one.
Cross-entity coherence. Marcus Weber and Anna Weber get severed; the model loses the family relationship.
Generated new PII. The model can invent associated names ("Dr. Schmidt") that were never in the map — unmask leaves them in, and hallucinated PII leaks through.

If your task leans on entity surface properties, treat pii-proxy as a fluency layer, not a correctness layer. For high-stakes inference, defense in depth: pii-proxy + structured output schema + post-hoc validation.

Entity types

Type	Detection	Fake replacement
Email	Regex	Realistic fake email
Phone	Regex	Format-preserving fake
Credit card	Regex + Luhn	Valid fake card number
IP address	Regex	Random valid IP
UUID	Regex	Random UUID
URL	Regex	Sanitized URL
Tracking number	Regex (UPS, USPS, DHL, etc.)	Format-preserving fake
Person name	Local LLM	Faker name
Organization	Local LLM	Faker company
Location	Local LLM	Faker address/city
Date of birth	Local LLM	Format-preserving fake date
Medical record	Local LLM	Format-preserving fake
Insurance ID	Local LLM	Format-preserving fake
Custom	Local LLM	Format-preserving fallback

Structured data

Mask entire objects (e.g., tool call inputs):

const { masked } = await proxy.maskObject({
  to: "alex@example.com",
  subject: "Order update",
  body: "Tracking: AETH0000345323DY",
  metadata: { ip: "10.0.0.1" }
});

// masked.to → "alex@johnson.net"
// masked.subject → "Order update" (no PII, unchanged)
// masked.body → "Tracking: BFUI0000482918EZ"
// masked.metadata.ip → "172.45.123.89"

// Reverse everything
const original = proxy.unmaskObject(masked);

Custom detectors

Any object with a detect(text) method is a detector. Use this to add domain-specific patterns, call external NER APIs, or integrate your own models:

import { PrivacyProxy, defaultDetectors, LlmDetector } from 'pii-proxy';

// Domain-specific: detect German health insurance numbers (Versichertennummer)
const germanInsuranceDetector = {
  detect(text) {
    const re = /\b[A-Z]\d{9}\b/g;
    const results = [];
    let m;
    while ((m = re.exec(text)) !== null) {
      results.push({ type: 'insurance_id', value: m[0], start: m.index, end: m.index + m[0].length });
    }
    return results;
  }
};

// Stack: regex → your domain detector → LLM for everything else
const proxy = new PrivacyProxy({
  detectors: [
    ...defaultDetectors,
    germanInsuranceDetector,
    new LlmDetector({ model: 'qwen3:1.7b' }),
  ],
});

You can also add custom generators for your entity types:

const proxy = new PrivacyProxy({
  detectors: [...defaultDetectors, new LlmDetector()],
  generators: {
    // Custom replacement for your entity type
    insurance_id: (real) => 'X' + Math.random().toString().slice(2, 11),
  },
});

Security model

pii-proxy is designed so that real PII never reaches the cloud LLM.

Data flow:

┌─────────────────────────────────────────────────────┐
│  Your infrastructure (on-prem / VPC)                │
│                                                     │
│  Real data ──→ Regex detection (in-process)         │
│            ──→ Local LLM detection (Ollama, local)  │
│            ──→ Fake replacement (in-process)         │
│                        │                            │
│                        ▼                            │
│              Masked data (fakes only)               │
└────────────────────────┬────────────────────────────┘
                         │ only fake data crosses this boundary
                         ▼
               ┌──────────────────┐
               │  Cloud LLM API   │
               │  (Claude, GPT)   │
               └──────────────────┘

Detection is local. Regex runs in-process. The LLM detector calls a model on your machine or your private network — never a cloud API.
The bijective map is sensitive. It maps real values to fakes — treat it like the data itself. Encrypt at rest, scope per session, and control access. Use proxy.getMap().serialize() for persistence; the format is a JSON array of [real, fake] pairs.
Unmask is deterministic. Same map always produces the same reversal. No network calls, no side effects.
Round-trip integrity. Every mask() → unmask() cycle restores the original text exactly. This is tested on every commit.

What pii-proxy does NOT do:

It does not guarantee 100% PII detection — regex has known patterns, the LLM layer catches most names/orgs/locations, but novel entity types may slip through. Defense in depth is recommended.
It does not encrypt the map for you — integrate with your existing secrets management (Vault, KMS, encrypted storage).
It does not log or audit automatically — call proxy.getMap().entries() to inspect or log what was masked per session.

Persistence

⚠ The map IS the PII. It maps every real value to its fake — anyone with the map can reverse every masked record. Encrypt before storing. See Security model.

Save and restore the map across sessions:

// Save — encrypt the serialized map before storing
const data = proxy.getMap().serialize();
await redis.set('pii-session:123', encrypt(data));  // bring your own encryption (Vault, KMS, libsodium)

// Restore in a new process
const proxy2 = new PrivacyProxy();
proxy2.loadMap(decrypt(await redis.get('pii-session:123')));
proxy2.unmask(text); // works with the same mappings

Examples

Health data with local LLM (examples/health-data.ts)

Full round-trip — local LLM detects patient names and providers, Claude analyzes the masked record, unmask restores real data:

export ANTHROPIC_API_KEY=sk-...
bun run examples/health-data.ts

Anthropic SDK integration (examples/anthropic-agent.ts)

export ANTHROPIC_API_KEY=sk-...
bun run examples/anthropic-agent.ts

Benchmarks

Evaluation on NVIDIA's Nemotron-PII healthcare subset, same held-out 100-record test set across all methods. All numbers independently verified (verify.py).

Truly fair head-to-head — same labels, same vocabulary, same query strategy:

Method	Fine F1	Coarse F1	Latency	Weights
nvidia/gliner-PII	96.2%	96.7%	211ms	1699MB
Our fine-tuned `gliner_small` (exp 004)	94.9%	96.1%	144ms	582MB
Our BERT classifier (exp 002)	—	93.9%	26ms	438MB
Claude Sonnet (proper prompt)	—	92.5%	2.5s	cloud
`gliner_small-v2.1` (zero-shot baseline)	54.8%	—	115ms	582MB

The honest takeaway: NVIDIA's flagship is still slightly more accurate. We're 1.3pp behind at fine granularity and 0.6pp behind at coarse granularity — but 1.5x faster and on a 3x smaller base model. The BERT classifier is the fastest option at 26ms when zero-shot capability isn't needed.

Three valid optima depending on your constraints:

Need maximum accuracy?         nvidia/gliner-PII   (96.7% coarse, 211ms, 1.7GB)
Need balance + zero-shot?      Our gliner_small     (96.1% coarse, 144ms, 582MB)
Need sub-30ms latency?         Our BERT classifier  (93.9% coarse,  26ms, 438MB)

The compile pipeline works — gliner_small-v2.1 (54.8% baseline) → 96.1% coarse F1 after 15 minutes of fine-tuning on a laptop. Within striking distance of NVIDIA's flagship.

Methodology pitfall (worth knowing): GLiNER bi-encoder models are extremely sensitive to query string choice. NVIDIA's gliner-PII scores 79.8% with simple coarse queries ("person_name"), 90.4% with natural-language ("person name"), and 97.2% with fine native labels ("first_name", "last_name"). The same model — three very different scores. Anyone benchmarking GLiNER must report the query strategy. See verify_labels.py.

Recommended training pattern: train on fine labels, collapse to coarse post-hoc. Beats training directly on coarse (96.1% vs 95.5% in our experiments — richer supervision signal during training).

See experiments/ for every script, every result, every caveat.

Comparison with alternatives

	pii-proxy	Presidio	Private AI	Nightfall
Data leaves your infra	No	No	Yes	Yes
Round-trip unmask	Yes	No	No	No
Replacement	Plausible fakes	Tokens (`<PERSON>`)	Tokens	Tokens
Custom entity types	Pluggable detectors	Custom recognizers	Limited	Limited
License	MIT	MIT	Commercial	Commercial

Roadmap

v0.1 — Regex detection, faker replacement, bijective round-trip
v0.2 — Pluggable entity detection — bring your own detectors (local LLM, custom regex). Layered pipeline: fast regex first, LLM for names/locations/domain-specific entities
v0.3 — Tool-aware selective masking (keep location real for hotel search, mask for email)
v0.4 — Persistent map backends (Redis, SQLite)
v0.5 — Anthropic/OpenAI SDK middleware (drop-in agent integration)

License

MIT — built by Daslab.

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
.github/workflows		.github/workflows
bin		bin
examples		examples
experiments		experiments
src		src
test		test
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
bun.lock		bun.lock
package.json		package.json
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

pii-proxy

Why

Install

Quick start

Local LLM detection

Layered pipeline

Configuring the LLM detector

How it works

When this works (and when it doesn't)

Entity types

Structured data

Custom detectors

Security model

Persistence

Examples

Health data with local LLM (examples/health-data.ts)

Anthropic SDK integration (examples/anthropic-agent.ts)

Benchmarks

Comparison with alternatives

Roadmap

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

pii-proxy

Why

Install

Quick start

Local LLM detection

Layered pipeline

Configuring the LLM detector

How it works

When this works (and when it doesn't)

Entity types

Structured data

Custom detectors

Security model

Persistence

Examples

Health data with local LLM (examples/health-data.ts)

Anthropic SDK integration (examples/anthropic-agent.ts)

Benchmarks

Comparison with alternatives

Roadmap

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages