Add replay graph core skeleton by ProfRandom92 · Pull Request #150 · ProfRandom92/Comptextv7

ProfRandom92 · 2026-05-20T15:43:54Z

Motivation

Provide a minimal, deterministic, pure-Python graph utility layer to reason about replay-critical relations for future replay-validation work while keeping the change small and test-covered.

Description

Summary: Added a new deterministic replay graph utility package under src/comptext_v7/graph/ implementing topology, ordering, reachability, and deterministic edge-diff evidence with a small public export surface.
Changed files: src/comptext_v7/graph/__init__.py, src/comptext_v7/graph/topology.py, src/comptext_v7/graph/ordering.py, src/comptext_v7/graph/reachability.py, src/comptext_v7/graph/evidence.py, tests/test_replay_graph_core.py.
Key technical changes: added normalize_edges, nodes_from_edges, adjacency_map (topology); find_order_violations (ordering); reachable_nodes, has_path (reachability); ReplayGraphDiff dataclass and compare_edges (evidence); all outputs are immutable/tuple-based and deterministic.
Scope note: graph utility skeleton only; no artifacts changed; no fixtures changed; no README/docs changed; no runtime/orchestration behavior added.

Testing

Ran pytest tests/test_replay_graph_core.py -q and the module tests passed (10 passed).
Ran pytest tests/test_failure_taxonomy.py -q (4 passed) and pytest tests/test_fixture_manifest.py -q (8 passed).
Ran full CI-style check npm run check which completed and ran the test suite; full test run reported 247 passed across the repository.
All automated checks executed here passed successfully.

Codex Task

gemini-code-assist

Code Review

This pull request introduces a set of deterministic graph utility functions for replay validation, including edge normalization, adjacency mapping, reachability analysis, and sequence ordering checks. The feedback identifies several logic and performance improvements: handling duplicate nodes in sequence ordering by using the first occurrence, removing redundant normalization calls when calculating node sets, optimizing reachability queries via optional pre-computed adjacency maps, and simplifying the return logic in the reachability helper.

gemini-code-assist · 2026-05-20T15:45:35Z

+    required_before: Iterable[Edge],
+) -> tuple[Edge, ...]:
+    """Return lexicographically sorted order violations."""
+    positions = {node: index for index, node in enumerate(sequence)}


The current implementation overwrites the index for duplicate nodes in the sequence, effectively using the last occurrence. In replay sequences where events or states might repeat, this can lead to incorrect violation reports. For example, in a sequence ['A', 'B', 'A'], a requirement that A precedes B would be flagged as a violation because the last A (index 2) is after B (index 1). Using the first occurrence is generally more appropriate for causal ordering checks.

Suggested change

positions = {node: index for index, node in enumerate(sequence)}

positions: dict[str, int] = {}

for index, node in enumerate(sequence):

if node not in positions:

positions[node] = index

gemini-code-assist · 2026-05-20T15:45:36Z

+    original_nodes = set(nodes_from_edges(original))
+    replay_nodes = set(nodes_from_edges(replay))
+
+    missing_nodes = tuple(sorted(original_nodes - replay_nodes))
+    added_nodes = tuple(sorted(replay_nodes - original_nodes))


Calling nodes_from_edges here is redundant because it internally calls normalize_edges, which has already been performed on lines 29-30. Since original and replay are already normalized tuples of edges, you can extract the nodes more efficiently using a set comprehension.

Suggested change

original_nodes = set(nodes_from_edges(original))

replay_nodes = set(nodes_from_edges(replay))

missing_nodes = tuple(sorted(original_nodes - replay_nodes))

added_nodes = tuple(sorted(replay_nodes - original_nodes))

original_nodes = {n for edge in original for n in edge}

replay_nodes = {n for edge in replay for n in edge}

missing_nodes = tuple(sorted(original_nodes - replay_nodes))

added_nodes = tuple(sorted(replay_nodes - original_nodes))

gemini-code-assist · 2026-05-20T15:45:36Z

+def reachable_nodes(edges: Iterable[Edge], start: str) -> tuple[str, ...]:
+    """Return sorted reachable nodes from start.
+
+    The start node is excluded unless it is reachable through a cycle.
+    """
+    adjacency = adjacency_map(edges)


Reconstructing the adjacency map on every call to reachable_nodes (and has_path) is inefficient, especially for multiple queries on the same graph. Consider allowing an optional pre-computed adjacency map to be passed in to avoid the $O(E \log E)$ overhead of normalization and map construction.

Suggested change

def reachable_nodes(edges: Iterable[Edge], start: str) -> tuple[str, ...]:

"""Return sorted reachable nodes from start.

The start node is excluded unless it is reachable through a cycle.

"""

adjacency = adjacency_map(edges)

def reachable_nodes(

edges: Iterable[Edge],

start: str,

adjacency: dict[str, tuple[str, ...]] | None = None,

) -> tuple[str, ...]:

"""Return sorted reachable nodes from start.

The start node is excluded unless it is reachable through a cycle.

"""

if adjacency is None:

adjacency = adjacency_map(edges)

gemini-code-assist · 2026-05-20T15:45:36Z

+
+    if start in seen:
+        return tuple(sorted(seen))
+    return tuple(sorted(node for node in seen if node != start))


The conditional check node != start is redundant. If start is not in seen (no cycle), then the condition is always true for all elements in seen. If start is in seen (cycle), it is already handled by the if start in seen block on line 32. Simplifying this to return tuple(sorted(seen)) improves clarity.

Suggested change

return tuple(sorted(node for node in seen if node != start))

return tuple(sorted(seen))

Add replay graph core skeleton

b36fc73

ProfRandom92 added the codex label May 20, 2026 — with ChatGPT Codex Connector

gemini-code-assist Bot reviewed May 20, 2026

View reviewed changes

ProfRandom92 merged commit 2ff1e98 into main May 20, 2026
4 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add replay graph core skeleton#150

Add replay graph core skeleton#150
ProfRandom92 merged 1 commit into
mainfrom
codex/add-replay-graph-core-skeleton

ProfRandom92 commented May 20, 2026

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

gemini-code-assist Bot May 20, 2026

Uh oh!

gemini-code-assist Bot May 20, 2026

Uh oh!

gemini-code-assist Bot May 20, 2026

Uh oh!

gemini-code-assist Bot May 20, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

-    positions = {node: index for index, node in enumerate(sequence)}
+    positions: dict[str, int] = {}
+    for index, node in enumerate(sequence):
+        if node not in positions:
+            positions[node] = index

	return tuple(sorted(node for node in seen if node != start))
	return tuple(sorted(seen))

Conversation

ProfRandom92 commented May 20, 2026

Motivation

Description

Testing

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist Bot May 20, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot May 20, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot May 20, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot May 20, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant