device: reduce reconnect delay after server restart from 120 s to ~5 s by full-bars · Pull Request #2 · urnetwork/userwireguard

full-bars · 2026-05-12T00:09:00Z

Problem

When the WireGuard server is restarted or redeployed, clients hold session keypairs that the new instance knows nothing about. WireGuard has no protocol-level "server is going away" message, so clients sit idle until their session expires naturally at RekeyAfterTime (120 s). During that window packets are silently dropped.

Raised in the URnetwork Discord: the userspacewireguard fork was identified as the right place to fix this because the kernel implementation has no equivalent hook.

Changes

1. Server-initiated handshake on startup (`device.go` — `upLocked`)

SendHandshakeInitiation is now called for every configured peer when the device comes up, in addition to the existing persistent-keepalive send. The new server proactively reaches out to all peers; clients respond within RekeyTimeout (5 s). Reconnect time after a restart drops from up to 120 s to under 5 s for peers with a known endpoint.

2. `DrainPeers` method + `Config.Drain` flag (`device.go` / `uapi.go`)

DrainPeers() calls ExpireCurrentKeypairs() on every peer — already implemented on Peer, just not wired up at the device level. Exhausting the send nonce makes the client's very next outbound packet trigger a fresh handshake instead of silently failing.

Config.Drain bool exposes this via IpcSet2 so deployment scripts can signal a drain through the existing IPC path before bringing the old process down:

device.IpcSet2(&device.Config{Drain: true})

Behaviour summary

Scenario	Before	After
Server restarts, peer has known endpoint	up to 120 s	~5 s
Server restarts, peer has dynamic endpoint	up to 120 s	up to `PersistentKeepalive` + 15 s
Graceful drain before restart	up to 120 s	immediate re-handshake on next send

Notes

bindtest compile errors are pre-existing on master and unrelated to these changes (./device/... builds cleanly).
A proper protocol-level CloseNotify message would be the cleanest long-term fix, but these two hooks address the problem without any protocol changes and are backwards-compatible.

Without a way to notify clients of a server restart, clients wait up to RekeyAfterTime (120 s) before re-handshaking with the new instance. Two changes to close that gap: - upLocked: call SendHandshakeInitiation for every peer when the device comes up. The new server proactively reaches out to all configured peers so they can re-establish in under RekeyTimeout (5 s) rather than waiting for natural session expiry. - DrainPeers / Config.Drain: add an explicit drain signal. Calling DrainPeers() (or setting Drain: true in IpcSet2) expires all current keypairs so the next send from each client triggers an immediate re-handshake instead of silently using a session the restarted server no longer knows about.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

device: reduce reconnect delay after server restart from 120 s to ~5 s#2

device: reduce reconnect delay after server restart from 120 s to ~5 s#2
full-bars wants to merge 1 commit into
urnetwork:masterfrom
full-bars:fix/session-rotation-on-drain-startup

full-bars commented May 12, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

full-bars commented May 12, 2026

Problem

Changes

1. Server-initiated handshake on startup (device.go — upLocked)

2. DrainPeers method + Config.Drain flag (device.go / uapi.go)

Behaviour summary

Notes

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

1. Server-initiated handshake on startup (`device.go` — `upLocked`)

2. `DrainPeers` method + `Config.Drain` flag (`device.go` / `uapi.go`)