Refactor workflow to DAG architecture with expanded testing, CI modernisation, and validated numerical equivalence by harryswift01 · Pull Request #281 · CCPBioSim/CodeEntropy

harryswift01 · 2026-02-25T16:52:32Z

Summary

This PR introduces a major architectural refactor that transitions CodeEntropy to a DAG-based execution model while preserving numerical behaviour. The refactor improves modularity, maintainability, and testability, and modernises the project’s testing, CI, and tooling infrastructure.

All systems and component-level contributions match the previous implementation within floating-point tolerance (maximum absolute difference 2.45e-08).

No breaking changes to user-facing behaviour are expected.

Motivation

The previous workflow was primarily procedural, which made it harder to reason about execution order, extend functionality, and test individual stages. Moving to a DAG model improves separation of concerns, enables clearer data flow, and provides a stronger foundation for future development while maintaining numerical equivalence.

Changes

DAG-based workflow architecture

Refines orchestration separating static setup, per-frame execution, and reducers.
Decomposes workflow into smaller nodes (detection, bead construction, covariance, reducers).
Standardises shared context passing between nodes.
Implements incremental reduction (streaming mean) for covariance accumulation.

Frame-level covariance computation

Introduces FrameCovarianceNode for per-frame second-moment matrices.
Adds optional combined force-torque block matrix generation at highest level.
Standardises axis handling via axes_manager.
Improves robustness for missing beads and metadata.

CLI and job-folder execution model

Ensures consistent job folder creation for each run.
Guarantees output artifacts land in job directories.

ResultsReporter improvements

Reworks JSON output into grouped structure:
groups: { "": { components: {...}, total: ... } }
Groups console tables by Group ID.
Adds metadata sections (args, provenance).
Adds utilities for argument serialization and git SHA detection.

Example output JSON.

{
  "args": {
    "top_traj_file": [
      "/home/tdo96567/BioSim/test_data/dna/md_A4_dna.tpr",
      "/home/tdo96567/BioSim/test_data/dna/md_A4_dna_xf.trr"
    ],
    "force_file": null,
    "file_format": null,
    "kcal_force_units": false,
    "selection_string": "all",
    "start": 0,
    "end": 1,
    "step": 1,
    "bin_width": 30,
    "temperature": 298.0,
    "verbose": false,
    "output_file": "/home/tdo96567/BioSim/temp/refactor/1-frame/job001/output_file.json",
    "force_partitioning": 0.5,
    "water_entropy": true,
    "grouping": "molecules",
    "combined_forcetorque": true,
    "customised_axes": true
  },
  "provenance": {
    "python": "3.14.0",
    "platform": "Linux-6.6.87.2-microsoft-standard-WSL2-x86_64-with-glibc2.39",
    "codeentropy_version": "1.0.7",
    "git_sha": "cb22762349b99c149f13392d9280acb4dffec976"
  },
  "groups": {
    "0": {
      "components": {
        "united_atom:Transvibrational": 0.0,
        "united_atom:Rovibrational": 0.002160679012128457,
        "residue:Transvibrational": 0.0,
        "residue:Rovibrational": 3.376800684085249,
        "polymer:FTmat-Transvibrational": 12.341104347192612,
        "polymer:FTmat-Rovibrational": 0.0,
        "united_atom:Conformational": 7.269386795471401,
        "residue:Conformational": 0.0
      },
      "total": 22.989452505761392
    },
    "1": {
      "components": {
        "united_atom:Transvibrational": 0.0,
        "united_atom:Rovibrational": 0.01846427765949586,
        "residue:Transvibrational": 0.0,
        "residue:Rovibrational": 2.3863201082544565,
        "polymer:FTmat-Transvibrational": 11.11037253388596,
        "polymer:FTmat-Rovibrational": 0.0,
        "united_atom:Conformational": 6.410455987098191,
        "residue:Conformational": 0.46183561256411515
      },
      "total": 20.387448519462218
    }
  }

Testing architecture overhaul

Expands unit test coverage across previously untested branches.
Adds regression test harness running CLI in isolated temp directories.
Ensures deterministic job folder creation during tests.
Adds baseline comparison against stored JSON outputs.

Regression dataset system

Automatic dataset download from CCPBioSim HTTPS filestore.
Local caching in .testdata/.
Intelligent detection of required files.
No manual setup required.

Quick vs slow regression separation

Introduces slow marker for long-running systems.
Quick regression suite excludes slow tests for fast feedback.
Full regression suite runs in weekly workflows.

Developer and tooling improvements

Standardises pytest markers and commands.
Adds --update-baselines workflow.
Improves debug diagnostics and artifact capture.
Migrates linting and formatting to Ruff.
Removes Black, Flake8, and isort.
Updates pre-commit configuration.
Simplifies optional dependencies.

CI/CD modernisation

Multi-OS testing (Linux, macOS, Windows).
Python matrix (3.12–3.14).
Quick regression tests on PRs.
Weekly full regression workflow.
Weekly docs build across Python versions.
Daily validation workflow.
Artifact upload on failures.
Dataset caching in CI.

Documentation pipeline

Docs build validation on PRs.
Weekly docs compatibility checks.
Updates developer guide reflecting new workflows.

Logging and error handling improvements

Eliminates duplicate traceback logging.
Centralises error boundary in CLI.
Improves exception chaining.
Adds argument logging on runtime failures.

Impact

Improves maintainability through DAG decomposition.
Increases confidence via expanded unit and regression coverage.
Provides faster CI feedback with quick regression separation.
Improves reproducibility with provenance metadata.
Simplifies developer setup with automatic datasets.
Modernises tooling stack with faster linting.
Improves cross-platform reliability via expanded CI matrices.
Provides clearer debugging through improved logging and artifacts.

Regression validation results

CodeEntropy Graph Implementation.xlsx

Entropy outputs from the refactored DAG workflow were compared against the previous implementation across all systems and component types. All values agree within floating-point tolerance.

Maximum absolute difference across all systems: 2.45e-08

This comparison confirms agreement across all individual contributions, not only group totals. Observed differences are consistent with expected floating-point variation introduced by consolidating numerical operations into NumPy.

…sses and files

- Renamed `CodeEntropy/levels/structual_analysis.py` -> `CodeEntropy/levels/dihedral_analysis.py` to more accuatly define what this Class does - Introduced a new class within the module `CodeEntropy/levels/neighbours.py`

- `VibrationalEntropy` class -> own dedicated file within `CodeEntropy/entropy/nodes/vibrational_entropy.py` - `ConformationalEntropy` class -> own dedicated file within `CodeEntropy/entropy/nodes/configurational_entropy.py` - `OrientationalEntropy` class -> own dedicated file within `CodeEntropy/entropy/nodes/orientational_entropy.py` - Created a placeholder for the new graph builder `CodeEntropy/entropy/entropy_graph.py`

…xecution of entropy calculations

…y.select_levels()`

…get_beads()`

…ect.toml`

…er than `LevelGraph`

…ute()`

…ctly

…ational.py`

- Arguments are added to the `output_file.json` - Provenance added to the `output_file.json` - Tidied output logging in the `output_file.json`

Implement regression framework with: - baseline JSON comparisons - automatic dataset download from filestore - .testdata cache - slow test markers - config-driven system tests - CI workflows for quick PR checks and weekly full regression This provides reproducible validation of scientific results across releases.

…ession: Run unit tests across all supported OS and Python versions, add quick regression suite to PRs with .testdata caching, and configure weekly workflow to run full regression including slow tests. Simplify docs builds to latest environment.

- Add badges for PR checks, daily tests, weekly regression, and weekly docs. - Remove obsolete workflow badges and align README with current CI setup.

- Replace black, flake8, and isort with Ruff for linting and formatting. - Update pre-commit configuration and dependencies, add Ruff config to pyproject.toml, and apply automatic fixes across the codebase.

…or handling: - Avoid double logging of exceptions by centralising traceback reporting in the CLI. - Runtime now raises clean errors while preserving original exception chaining.

…ined

- Add ResultsReporter progress context manager - Propagate optional progress sink through workflow orchestration - Add progress reporting for: - Conformational state construction (per group) - Frame processing stage (per frame) - Keep entropy graph execution silent due to fast runtime - Update runtime tests to reflect wrapped RuntimeError behavior

…tooling: - Add instructions for unit vs regression test suites - Document slow test markers and how to run them - Explain automatic regression dataset downloads via filestore - Add guidance for updating regression baselines - Update coding standards to use Ruff instead of Black/Flake8/isort - Document multi-OS and multi-Python CI workflows - Clarify developer setup and testing commands - Remove outdated tooling references

…n Ubuntu

harryswift01 added 30 commits November 10, 2025 11:29

restructure current levels file into abstracted and defined new cla…

8ef0ac1

…sses and files

Refinements to the layout of the levels module:

2f943e5

- Renamed `CodeEntropy/levels/structual_analysis.py` -> `CodeEntropy/levels/dihedral_analysis.py` to more accuatly define what this Class does - Introduced a new class within the module `CodeEntropy/levels/neighbours.py`

Abstract out water entropy calculations into dedicated file and class

955abd3

setup components for the entropy_graph which will orcanstrate the e…

61c6f26

…xecution of entropy calculations

refine entropy_graph.py to setup for DAG execution structure

c9a0a1e

Create levels/nodes/detect_levels.py node that wraps `LevelHierarch…

b5461c5

…y.select_levels()`

Create levels/nodes/build_beads.py node that wraps `LevelHierarchy.…

c088fd4

…get_beads()`

Create hierarchy_graph.py with a generic DAG engine

6ceb4a0

update levels/level_manager.py to coordinate the DAG approach

c01e785

Merge remote-tracking branch 'origin/main' into 173-refactor-levels

12c4323

restructure files from main branch merge

a2760a7

move main.py and remove CodeEntropy/cli folder

e058a9d

Merge remote-tracking branch 'origin/main' into 173-refactor-levels

d3f263a

updated level_manager.py to reflect graph implementation

1818b3f

Merge remote-tracking branch 'origin/main' into 173-refactor-levels

fabfc81

Merge remote-tracking branch 'origin/main' into 173-refactor-levels

dc9ba03

update networkx and matplotlib to dependency range within `pyproj…

287e239

…ect.toml`

Merge remote-tracking branch 'origin/main' into 173-refactor-levels

e7be5fb

build minimal graph nodes for the levels functionality

5750327

update hierarchy_graph to match levels/nodes

7d5b861

update level_manager.py to match changes within hierarchy_graph.py

7500f80

Merge remote-tracking branch 'origin/main' into 173-refactor-levels

03c3fe5

update execute() within entropy_manager.py to use DAG system

5010a8c

change execute() within entropy_manager.py to use LevelDAG rath…

edc3d86

…er than `LevelGraph`

refined entropy_graph.py

af816e0

Merge remote-tracking branch 'origin/main' into 173-refactor-levels

2eb7271

ensure shared_data is not been replaced within `EntropyManager.exec…

24ef756

…ute()`

updated LevelDAG.execute() to use shared_data functionality corre…

c13531e

…ctly

ensure LevelsDAG is a pure dataflow and not coupled

052aa57

harryswift01 added 19 commits February 23, 2026 16:33

ensure job*** folders are not created within test execution

a2361ad

add polymer branch test cases for test_frame_covariance_node.py

9481193

tidy tests within CodeEntropy/tests/unit/entropy

4142f1f

add unit test test_to_1d_array_returns_none_when_states_is_none

6edaadd

add unit tests to ensure _solute_id_to_resname is covered

c71ee1d

add unit test for _compute_ft_entropy function within `entropy/vibr…

3ed442c

…ational.py`

ensure unit tests cover all cases within _reduce_force_and_torque

0c38b55

remove legacy test_CodeEntropy tests

514b5a8

update results/reporter.py to have enriched data output:

cb22762

- Arguments are added to the `output_file.json` - Provenance added to the `output_file.json` - Tidied output logging in the `output_file.json`

update entropy/workflow.py to use update save_dataframes_as_json

226b37f

docs(badges): update CI badge set for new workflow structure:

305be6a

- Add badges for PR checks, daily tests, weekly regression, and weekly docs. - Remove obsolete workflow badges and align README with current CI setup.

ci(pre-commit): migrate linting to Ruff and update hooks:

58c084a

- Replace black, flake8, and isort with Ruff for linting and formatting. - Update pre-commit configuration and dependencies, add Ruff config to pyproject.toml, and apply automatic fixes across the codebase.

fix(cli,runtime): prevent duplicate traceback logging and improve err…

8355b3e

…or handling: - Avoid double logging of exceptions by centralising traceback reporting in the CLI. - Runtime now raises clean errors while preserving original exception chaining.

remove logger.info from files to ensure consistent output is mainta…

756a17d

…ined

docs: tidy comments and standardise to Google-style docstrings

fff6315

harryswift01 added this to the v1.1.0 milestone Feb 25, 2026

harryswift01 self-assigned this Feb 25, 2026

harryswift01 added the feature request New feature or request label Feb 25, 2026

harryswift01 added 2 commits February 25, 2026 17:03

ci: simplify job naming and limit regression tests to latest Python o…

b1a3ea1

…n Ubuntu

test: add regression baselines and whitelist them in .gitignore

ef7c342

harryswift01 requested a review from jimboid February 25, 2026 17:10

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactor workflow to DAG architecture with expanded testing, CI modernisation, and validated numerical equivalence#281

Refactor workflow to DAG architecture with expanded testing, CI modernisation, and validated numerical equivalence#281
harryswift01 wants to merge 110 commits intomainfrom
173-refactor-levels

harryswift01 commented Feb 25, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

harryswift01 commented Feb 25, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Motivation

Changes

DAG-based workflow architecture

Frame-level covariance computation

CLI and job-folder execution model

ResultsReporter improvements

Testing architecture overhaul

Regression dataset system

Quick vs slow regression separation

Developer and tooling improvements

CI/CD modernisation

Documentation pipeline

Logging and error handling improvements

Impact

Regression validation results

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

harryswift01 commented Feb 25, 2026 •

edited

Loading