feat: SpikeAsync by Kelvin-Ng · Pull Request #42 · aws-neuron/nkipy

Kelvin-Ng · 2026-03-13T01:06:15Z

Overview

See ASYNC_API.md for a detailed description of the API and design.

The current implementation is tested and ready to merge.

Known Limitations: Head-of-Line Blocking

The scheduler can exhibit head-of-line (HoL) blocking, which may be surprising to users.

In practice, I've worked around this by manually adding extra dependencies between operations to constrain the scheduler's decision space. However, this makes real-world usage of SpikeAsync more involved than the simple model described in ASYNC_API.md.

Why It Happens

The scheduler performs a topological sort at scheduling time. When only logical dependencies are specified, multiple valid topological orderings exist — and some of them are suboptimal. Different scheduling algorithms will select different orderings among the valid ones.

The current algorithm uses FIFO ordering. Consider the following example:

async def foo():
    await use_resource_a()  # X
    await use_resource_a()  # Y
    await use_resource_b()  # Z
    await use_resource_b()  # W

Running 3 instances concurrently:

for i in range(3):
    spike_async.submit(foo())

When use_resource_*() is called, the request is immediately pushed to the corresponding resource's FIFO queue. Because all 3 instances of X are enqueued before any instance of Y, the resulting schedule is:

A: [X0][X1][X2][Y0][Y1][Y2]
B:                 [Z0][Z1][Z2][W0][W1][W2]

Resource B sits idle while resource A drains the backlog — a classic HoL blocking scenario. The ideal interleaving would be:

A: [X0][Y0][X1][Y1][X2][Y2]
B:         [Z0][W0][Z1][W1][Z2][W2]

Workaround

We can achieve optimal interleaving without modifying the scheduler by adding cross-instance dependency edges:

X(i+1) → Y(i) — prevents X's from piling up on resource A
Z(i+1) → W(i) — prevents Z's from piling up on resource B

prev_y = None
prev_w = None

for i in range(3):
    x = spike_async.use_resource_a(deps=[prev_y] if prev_y else [])

    y = spike_async.use_resource_a(deps=[x])

    z = spike_async.use_resource_b(deps=[y] + ([prev_w] if prev_w else []))

    w = spike_async.use_resource_b(deps=[z])

    prev_y = y
    prev_w = w

w.wait()

Future Work

I'm considering alternative scheduling algorithms, but there is no one-size-fits-all solution — different algorithms perform best under different workload patterns. I believe this warrants further discussion and is best treated as future work.

Kelvin-Ng changed the title ~~SpikeAsync implementation~~ feat: SpikeAsync Mar 13, 2026

Kelvin-Ng requested a review from vgene March 13, 2026 01:08

Kelvin-Ng self-assigned this Mar 13, 2026

Kelvin-Ng added the enhancement New feature or request label Mar 13, 2026

Kelvin-Ng force-pushed the feat/spike-async branch 2 times, most recently from 3c31730 to 742f369 Compare March 13, 2026 22:50

Kelvin-Ng marked this pull request as ready for review March 13, 2026 22:52

Kelvin-Ng requested a review from a team March 13, 2026 22:52

SpikeAsync implementation

caa62ad

Kelvin-Ng force-pushed the feat/spike-async branch from 742f369 to caa62ad Compare March 14, 2026 00:31

Kelvin-Ng force-pushed the feat/spike-async branch from 67e3b0a to ee16ae3 Compare March 23, 2026 22:32

Add a section to explain when to use SpikeAsync

2549212

Kelvin-Ng force-pushed the feat/spike-async branch from ee16ae3 to 2549212 Compare March 23, 2026 23:04

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: SpikeAsync#42

feat: SpikeAsync#42
Kelvin-Ng wants to merge 2 commits intomainfrom
feat/spike-async

Kelvin-Ng commented Mar 13, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

Kelvin-Ng commented Mar 13, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Overview

Known Limitations: Head-of-Line Blocking

Why It Happens

Workaround

Future Work

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Kelvin-Ng commented Mar 13, 2026 •

edited

Loading