Uellenberg/esu sampling by uellenberg · Pull Request #25 · hanpham32/NetMotif

uellenberg · 2026-03-02T22:26:04Z

No description provided.

…the labels

… label insted of all generated subgraphs

…on-canonical subgraphs

…directed

…e labelg path based on the platform

Improve ESU execution time

This adds tests for exampleGraph and random graph generation. I modified random graph generation slightly to make it easier to test - we might be better off removing the "mimic graph" parameter entirely and just specifying the exact properties of the graph we want.

Add snapshot tests

format all files with black

Using ./NetMotif for all paths required always running the server from outside its root directory, and required that it always lived inside a directory called "NetMotif". Both requirements make development and deployment a bit awkward, but we can fix it by always taking paths relative to one of our source files (which should be constant).

This was causing a warning from streamlit, where mixed types ("NA" and floats) worked, but weren't meant to be used. I also noticed the four-decimal formatting was done by streamlit and not written down anywhere, so I added it here explicitly.

Streamlit was giving a warning about the blank label (and there really should be one for accessibility anyway).

This creates a labelg process and worker thread, which receive (non canonical label, extra data), send it to labelg in batches, and run a callback function when the canonical label is available. This is opposed to the previous method, which required all data to be available upfront and ran labelg at the end. This will help the SubgraphProfile and SubgraphCollection implementations. In particular, if we end up doing a streaming implementation, we'll need labelg to be streaming as well, otherwise we'd need to keep all of the node data in memory until ESU is done, then run labelg in one pass. Technically, this allows labelg to run in parallel with ESU, although labelg is so fast that this doesn't improve performance. I ran a few tests and don't see any difference in performance between this and the old version. I also added a few controls (batch size and max batches), but I haven't tuned them.

Uellenberg/current version parity

allow 0 random graphs

remove the unused ESU progress param

… the temporarly created files

Implement SubgraphProfile and SubgraphCollection dwnload

This is based on FANMOD's sampling mode, and allows us to get good results at a fraction of the time a full ESU takes. It works by probabilistically determining whether to explore certain branches of the graph. I didn't add weighting because it always cancels out in our case. If it needs to be added in the future, the weight is 1/(product of all probabilities). For the probabilities themselves, the formula I used is a bit arbitrary, it's just one that scales with the depth (we don't want to take out too many of the early paths) and has nice probabilities. There's probably a more optimal formula to use, or we may allow users to input their own probabilities by hand.

Radu and others added 30 commits February 17, 2026 19:01

add performance timing info for ESU and labeling methods

337884c

calculate the d6 labels in parallel and run labelg only once for all …

8ef3817

…the labels

return ESU subgraphs using a generator and keep aggragated counts per…

2794ffd

… label insted of all generated subgraphs

avoid node iteration during esu

1be687b

remove the node_visited set used for all root nodes

477ac09

hash the g6 label by bytes instead of string and remove the list of n…

f33ccaa

…on-canonical subgraphs

fix the directed graph motif frequency by considering neighbors as un…

9290576

…directed

run ESU in parallel for the randomized graphs

b0c2027

generate html as a string instead of writing a temp file and determin…

527037c

…e labelg path based on the platform

add linux labelg binary

f50aa85

Merge pull request #1 from raduba/feature/measure_perf_time

0a11fef

Improve ESU execution time

Add snapshot tests

ba2c379

This adds tests for exampleGraph and random graph generation. I modified random graph generation slightly to make it easier to test - we might be better off removing the "mimic graph" parameter entirely and just specifying the exact properties of the graph we want.

Merge pull request #2 from raduba/uellenberg/unit-tests-2

aa81bd4

Add snapshot tests

format all files with black

20aec8e

Merge pull request #3 from raduba/code-format

0aa5101

format all files with black

Run formatter on pages/

1c0fc30

Explicitly convert statistics to strings

904f7ae

This was causing a warning from streamlit, where mixed types ("NA" and floats) worked, but weren't meant to be used. I also noticed the four-decimal formatting was done by streamlit and not written down anywhere, so I added it here explicitly.

Fix uploaded file label warning

90c1921

Streamlit was giving a warning about the blank label (and there really should be one for accessibility anyway).

Merge pull request #4 from raduba/uellenberg/current-version-parity

34bea09

Uellenberg/current version parity

allow 0 random graphs

c11edc2

Merge pull request #5 from raduba/feature/no-random-graphs

704a867

allow 0 random graphs

remove the unused ESU progress param

df815d4

Merge pull request #6 from raduba/feature/cleanup-esu-params

8bedad8

remove the unused ESU progress param

implement subgraph profile and subgraph collection download and clean…

9fc7dc7

… the temporarly created files

compress download files as gzip

69223cc

Merge pull request #7 from raduba/feature/profile-and-subgraph

ad66096

Implement SubgraphProfile and SubgraphCollection dwnload

allow 1000 random graphs

7c210bb

uellenberg closed this Mar 2, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uellenberg/esu sampling#25

Uellenberg/esu sampling#25
uellenberg wants to merge 30 commits intohanpham32:mainfrom
raduba:uellenberg/esu-sampling

uellenberg commented Mar 2, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

uellenberg commented Mar 2, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants