Misiug/benchmarkjsonconfig by misiugodfrey · Pull Request #239 · rapidsai/velox-testing

misiugodfrey · 2026-02-16T18:09:23Z

Added a benchmark_config.json output file to run_benchmark.sh. This file provides context for how the benchmark was run and will provide information such as the following:

{
  "benchmark": "tpch",
  "scale_factor": 1,
  "n_workers": 1,
  "kind": "single-node",
  "gpu_count": 1,
  "gpu_name": "NVIDIA RTX 5000 Ada Generation Laptop GPU",
  "engine": "velox-gpu",
  "timestamp": "2026-02-16T18:00:44Z"
}

The scripts should auto-detect everything while running the benchmark, including number of workers, engine, gpus, etc... Querying the node for their engine type using the presto API does not seem viable, so instead the type is currently determined by the container names (in docker), and via a nvidia-smi call in slurm clusters. The slurm path needs additional testing, but we need the updated slurm scripts to merge upstream before we can test them.

TomAugspurger

The output sample you shared looks good to me.

paul-aiyedun

The overall idea makes sense to me. However, I had questions about some of the implementation details, Slurm specific logic, and assumptions being made.

Also, please update the PR title to reflect this update.

paul-aiyedun · 2026-02-17T18:16:39Z

presto/docker/config/template/etc_worker/config_native.properties

 coordinator=false
 # Worker REST/HTTP port for internal and admin endpoints.
-http-server.http.port=8080
+http-server.http.port=10000


Is this an intentional update?

paul-aiyedun · 2026-02-17T18:19:25Z

presto/scripts/run_benchmark.sh

    -m, --metrics           Collect detailed metrics from Presto REST API after each query.
                            Metrics are stored in query-specific directories.

+ENVIRONMENT:


Can we use command line arguments instead of environment variables?

paul-aiyedun · 2026-02-17T18:19:48Z

presto/scripts/run_benchmark.sh

+    PRESTO_BENCHMARK_DEBUG   Set to 1 to print debug logs for worker/engine detection
+                             (e.g. node URIs, reachability, metrics, Docker containers).
+                             Use when engine is misdetected or the run fails.
+    Docker                  In Docker setups, engine is inferred from running worker


Can you please expand on this?

paul-aiyedun · 2026-02-17T18:21:21Z

presto/testing/performance_benchmarks/benchmark_keys.py

    CONTEXT_KEY = "context"
    ITERATIONS_COUNT_KEY = "iterations_count"
    SCHEMA_NAME_KEY = "schema_name"
+    SCALE_FACTOR_KEY = "scale_factor"


Can we copy over the metadata.json file from the dataset instead of duplicating some of the details here?

paul-aiyedun · 2026-02-17T18:24:32Z

presto/testing/performance_benchmarks/conftest.py

+        "benchmark": benchmark_types[0] if len(benchmark_types) == 1 else benchmark_types,
+        **run_config,
+    }
+    with open(f"{bench_output_dir}/benchmark_config.json", "w") as file:


The existing JSON file has a "context" field. Can we extend that instead of writing to a new file?

paul-aiyedun · 2026-02-17T18:59:34Z