fix: flush pgstat counters from worker so autovacuum sees its writes by utkarash2991 · Pull Request #254 · supabase/pg_net

utkarash2991 · 2026-05-17T11:46:54Z

The `pg_net` background worker performs DML on `net._http_response` and `net.http_request_queue` via 
SPI but never calls `pgstat_report_stat()`. As a result, per-write counters (`n_tup_ins`, `n_tup_del`,
`n_mod_since_analyze`) for the worker's writes never reach shared stats. Autovacuum/autoanalyze
read those counters to decide when to vacuum — when they stay at zero, the launcher never schedules a
run, and `net._http_response` accumulates bloat indefinitely.
  
User-backend INSERTs into `net.http_request_queue` (via `net.http_get/post/delete`) keep that 
table's autovacuum cadence healthy in practice — user backends flush pgstat automatically via the main
loop in `tcop/postgres.c`. So the customer-visible failure is specific to `net._http_response`, which 
has no user-backend traffic to compensate. Eventually the bloated `_http_response_created_idx` 
makes the worker's expiry query (`ORDER BY created LIMIT $batch`) walk huge stretches of dead index 
entries,  yielding 20–100s DELETEs and IO-wait spikes.

What kind of change does this PR introduce?

Bug fix

What is the current behavior?

Auto vacuum/analyze will not trigger on the net._http_response

Fix

Call pgstat_report_stat(false) after each worker transaction commits. Per pgstat.c:

"Must be called by processes that performs DML: tcop/postgres.c, logical receiver processes, SPI worker, etc. to flush pending statistics updates to shared memory."

Regular user backends invoke it from the main loop after each query; background workers have no equivalent and must flush themselves.

Tests

Two regression tests added to test/test_worker_behavior.py:

test_worker_writes_increment_pgstat_counters — drives 30 requests through the worker and asserts pg_stat_user_tables.n_tup_ins > 0 on net._http_response. Before the fix, this stays at 0; with the fix, it reflects the worker's INSERTs.
test_worker_writes_trigger_autoanalyze_on_http_response — end-to-end test that proves the customer-impacting symptom is resolved. Configures autovacuum_naptime=1s plus low per-table thresholds, drives traffic, and waits for autoanalyze_count > 0.

Linear: https://linear.app/supabase/issue/PSQL-1216

Background workers performing DML via SPI must call pgstat_report_stat() themselves; regular user backends get this for free via tcop/postgres.c's main loop. Without it, per-write counters (n_tup_ins, n_tup_del, n_mod_since_analyze) for the worker's writes to net._http_response and net.http_request_queue never reach shared stats. Autovacuum/autoanalyze never see anything to vacuum, and net._http_response (written only by the worker) silently bloats - eventually surfacing as 20-100s expiry DELETEs on a bloated `created` index.

za-arthur

Good catch! LGTM. I have just a suggestion below.

utkarash2991 requested review from olirice, steve-chavez and za-arthur May 17, 2026 11:46

za-arthur approved these changes May 18, 2026

View reviewed changes

Comment thread test/test_worker_behavior.py

utkarash2991 merged commit 3736d0f into master May 18, 2026
28 checks passed

utkarash2991 deleted the fix/pgstat-flush-in-worker branch May 18, 2026 17:26

This was referenced May 18, 2026

fix: report worker activity to pg_stat_activity #255

Merged

bump to 0.20.3 #256

Merged

chore: bump to pg_net 0.20.3 supabase/postgres#2159

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix: flush pgstat counters from worker so autovacuum sees its writes#254

fix: flush pgstat counters from worker so autovacuum sees its writes#254
utkarash2991 merged 1 commit into
masterfrom
fix/pgstat-flush-in-worker

utkarash2991 commented May 17, 2026 •

edited

Loading

Uh oh!

za-arthur left a comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

utkarash2991 commented May 17, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What kind of change does this PR introduce?

What is the current behavior?

Fix

Tests

Uh oh!

za-arthur left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

utkarash2991 commented May 17, 2026 •

edited

Loading