Monitor and plot RSS memory and CPU usage during `qlever index` by tanmay-9 · Pull Request #277 · qlever-dev/qlever-control

tanmay-9 · 2026-04-06T17:18:42Z

So far, the qlever index command gave no insight into how much memory an index build actually needs or which index phase is responsible for the peak.

With this change, every index build records RSS memory and CPU usage over time, writes <name>.usage-log.tsv, and renders <name>.usage-log.png once the index build finishes. The plot shades each index build phase (parsing, vocabulary merge, conversion, each permutation, and the text index) as a separate band and annotates the memory peak, so resource usage can be attributed to a specific phase. For comparison across runs and settings, the plot is captioned with the git hash of the index binary, the STXXL_MEMORY setting, and the batch size. This works whether the index is built natively or in a container (docker / podman).

The sampling rate can be set with --resource-monitor-interval and the plot density on long builds with --resource monitor-plot-max-points (the sampling itself is unaffected, only how many points are drawn). There is also a --replot-resource-usage option that re-renders the plot from an existing <name>.usage-log.tsv without re-running the index build, which is useful for tweaking plot settings.

…/podman different memUsage parsing

…onitor.py

…e to the plot. Also make gb use consistent

…u cores used and add downsampling (max_points=500) for plot

…l explicit in index.py

tanmay-9 and others added 3 commits April 2, 2026 11:50

Add first version of index memory monitoring

b764dcc

Fix memory monitor to correctly select native/container system

51ff865

Merge branch 'qlever-dev:main' into compute-index-mem-usage

556a54e

tanmay-9 changed the title ~~Compute the physical memory usage used by the qlever index command~~ Compute the physical memory used by the qlever index command Apr 7, 2026

tanmay-9 and others added 2 commits April 8, 2026 17:58

Use engine_name from qlever __init__ in memory monitor and fix docker…

85273cd

…/podman different memUsage parsing

Merge branch 'qlever-dev:main' into compute-index-mem-usage

c7e02d6

tanmay-9 changed the title ~~Compute the physical memory used by the qlever index command~~ Track memory usage during qlever index Apr 8, 2026

tanmay-9 and others added 19 commits April 9, 2026 10:06

Merge branch 'qlever-dev:main' into compute-index-mem-usage

0f8544a

Add a way to specify different parent pid for memory monitoring

1cba7a6

Add pss and uss memory monitoring

802652b

Fix process finding logic for memory monioring

f6d82ab

Add swap monitoring as well

d2a8b1d

Merge branch 'qlever-dev:main' into compute-index-mem-usage

37cef0d

Merge branch 'qlever-dev:main' into compute-index-mem-usage

91ed89e

Add tsv logging and matplotlib plotting with index phases to memory_m…

a378e5f

…onitor.py

Parse Qleverfile and logs to add git hash, stxxl memory and batch siz…

0de53d4

…e to the plot. Also make gb use consistent

Change memory_monitor to resource_monitor, cpu percent per core to cp…

126358f

…u cores used and add downsampling (max_points=500) for plot

Add plot_only option to index and make max_plot_points configurable

64083ca

Merge remote-tracking branch 'origin/main' into compute-index-mem-usage

318b5a8

Merge remote-tracking branch 'origin/main' into compute-index-mem-usage

29543e2

Merge remote-tracking branch 'origin/main' into compute-index-mem-usage

27640c3

Fix minor code issues for resource_monitor

bfa3898

Extract pure functions to module-level and make render_usage_plot cal…

3416812

…l explicit in index.py

Separate plot rendering logic in its own file

e5cfc7d

Fix failing index tests as a result of ResourceMonitor usage

7469737

Add pure-function tests for resource_monitor and usage_plot

019d379

tanmay-9 changed the title ~~Track memory usage during qlever index~~ Monitor and plot RSS memory and CPU usage during qlever index May 28, 2026

tanmay-9 added 2 commits May 28, 2026 15:12

Improve docstrings and comments, and fix minor issues

1631db9

Fix formatting

da3b0fc

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Monitor and plot RSS memory and CPU usage during `qlever index`#277

Monitor and plot RSS memory and CPU usage during `qlever index`#277
tanmay-9 wants to merge 26 commits into
qlever-dev:mainfrom
tanmay-9:compute-index-mem-usage

tanmay-9 commented Apr 6, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

tanmay-9 commented Apr 6, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

tanmay-9 commented Apr 6, 2026 •

edited

Loading