Benchmarks¶

TractographyTRX includes a benchmark suite that mirrors the trx-cpp real-data benchmarks and extends them with ITK-specific workloads. The suite focuses on:

Translate + stream write: read streamlines, apply a +1 mm translation, and stream the result to a new TRX file.
Spatial slab queries: issue 20 AABB slab queries to emulate interactive slicing.
Atlas parcellation: assign streamlines to TRX groups with TrxParcellationLabeler.
Group TDI mapping: produce a density image for a selected TRX group.

Data¶

The recommended reference dataset is the same HCP-derived TRX file used by trx-cpp: a 10-million-streamline tractogram with float16 positions and one DPS field (10milHCP_dps-sift2.trx). Pass its path via --reference.

Results¶

The following figures were produced on an Apple M3 Pro (36 GB RAM, macOS 15) using the 10-million-streamline HCP tractogram with float16 positions.

Translate+Write runtime comparison — Translate-and-write runtime across 100k-10M streamlines for ITK, ITK with reusable `vnl_matrix` buffer, and raw `trx-cpp`.¶

Translate+Write peak memory footprint — Peak resident-memory delta during translate-and-write under matched workloads, up to 10M streamlines.¶

AABB query total runtime by tractogram size — Total runtime for the 20-slab AABB workload across ITK and raw `trx-cpp`, with a query cap of 500 streamlines per slab.¶

Per-group storage overhead — Group-only storage overhead as a function of streamline count and dilation radius (0/1/2 voxels) for two atlases (Glasser and 4S456).¶

Build (standalone)¶

Configure and build with benchmarks enabled:

cmake -S . -B build-release \
  -DCMAKE_BUILD_TYPE=Release \
  -DTractographyTRX_BUILD_BENCHMARKS=ON
cmake --build build-release --target bench_trx_itk_realdata

If trx-cpp or Google Benchmark are installed elsewhere, make sure CMake can find them using trx-cpp_DIR and benchmark_DIR. Otherwise, trx-cpp is fetched automatically by default.

Run the benchmarks¶

The helper script writes JSON output for plots and accepts explicit fairness controls:

bash bench/run_benchmarks.sh \
  --reference /path/to/reference.trx \
  --bench-data-dir /path/to/bench-data \
  --query-cap 500 \
  --out-dir bench/results

The benchmark binary also supports --reference-trx directly:

./build-release/bench/bench_trx_itk_realdata \
  --reference-trx /path/to/reference.trx \
  --benchmark_out=bench/results/bench_trx_itk_realdata.json \
  --benchmark_out_format=json

Plot benchmark results¶

An R plotting helper is provided at bench/plot_bench.R. It reads benchmark JSON output and generates PNG plots for translate/write runtime + RSS, AABB query runtime + p50/p95, parcellation runtime + storage overhead, and Group TDI runtime + RSS (when rows are present).

Rscript bench/plot_bench.R \
  --bench-dir bench/results/publication \
  --out-dir bench/results/plots \
  --run-type iteration \
  --strict TRUE

Required R packages:

jsonlite
ggplot2
dplyr
tidyr
scales

Environment variables¶

bench_trx_itk_realdata reuses several trx-cpp environment variables:

TRX_BENCH_MAX_STREAMLINES: cap streamline counts (default 10,000,000)
TRX_BENCH_ONLY_STREAMLINES: run a single streamline count
TRX_BENCH_LOG_PROGRESS_EVERY: emit progress every N streamlines
TRX_BENCH_CHUNK_BYTES: buffer size for ITK streaming writes
TRX_BENCH_MAX_QUERY_STREAMLINES: cap AABB query results (publication runs should use a positive value shared across backends)

Notes on comparison¶

Publication runs should always use capped query results for both backends. The recommended setting is TRX_BENCH_MAX_QUERY_STREAMLINES=500 to match interactive viewer behavior.
run_benchmarks.sh now enforces a positive cap; zero/uncapped query runs are not used for manuscript figures.