Single GPU benchmark scripts by ChSonnabend · Pull Request #15514 · AliceO2Group/AliceO2

ChSonnabend · 2026-06-12T07:28:27Z

This PR brings two scripts that benchmark the single GPU performance

gen_single_gpu_rtc_benchmark.sh generates the workflow from dpl-workflow.sh by setting environment variables and using early stops to avoid processing failures
analyze_gpu_benchmarks.py then analyzes the resulting log file for processing times, records then, histograms them and fits a gaussian to the result to determine the mean processing time per timeslice

…ithout accesibility to numactl

davidrohr

Didn't check anything in detail, but the things that immediately came to my mind

davidrohr · 2026-06-12T07:51:22Z

+
+# ROCm library injection is only useful for HIP runs. Keep it off by default for CUDA/NVIDIA containers,
+# because mixed AMD/NVIDIA hosts can otherwise leak ROCm libraries into LD_LIBRARY_PATH.
+if [[ "${GPUTYPE:-}" == "HIP" && "0$BENCH_AUTO_ROCM_LIBS" == "01" ]]; then


With new bash you can just use $BENCH_AUTO_ROCM_LIBS == 1

davidrohr · 2026-06-12T07:52:22Z

+
+export DPL_REPORT_PROCESSING="${DPL_REPORT_PROCESSING:-1}"
+
+export FST_TMUX_NO_EPN="${FST_TMUX_NO_EPN:-1}"


not needed, since start_tmux.sh is not used

davidrohr · 2026-06-12T07:52:39Z

+# ----------------------------------------------------------------------------------------------------------------------
+# Locate original workflow script. Keep the original untouched.
+
+: "${GEN_TOPO_MYDIR:=$(dirname "$(realpath "$0")")}"


Why don't you simple use $O2_ROOT/dpl-workflow.sh?

davidrohr · 2026-06-12T07:53:17Z

+export WORKFLOW_PARAMETERS="${WORKFLOW_PARAMETERS:-GPU,CTF}"
+export GPUTYPE="${GPUTYPE:-CUDA}"
+export NGPUS=1
+export NUMAGPUIDS=1


NUMAGPUIDS and NUMAID should not be set, if not using NUMA pinning

davidrohr · 2026-06-12T07:54:47Z

+export EPNSYNCMODE="${EPNSYNCMODE:-0}"
+export SYNCMODE="${SYNCMODE:-1}"
+export SYNCRAWMODE="${SYNCRAWMODE:-0}"
+
+export TIMEFRAME_RATE_LIMIT="${TIMEFRAME_RATE_LIMIT:-5}"
+export GEN_TOPO_NO_TF_RATE_UPSCALING="${GEN_TOPO_NO_TF_RATE_UPSCALING:-1}"
+
+export DISABLE_ROOT_OUTPUT="${DISABLE_ROOT_OUTPUT:-1}"
+
+# Double pipeline requires zsraw input. Therefore default to raw TF input, not CTF.
+export CTFINPUT="${CTFINPUT:-0}"
+export RAWTFINPUT="${RAWTFINPUT:-1}"
+export DIGITINPUT="${DIGITINPUT:-0}"
+export EXTINPUT="${EXTINPUT:-0}"


Why do you redefine all the defaults that come from setenv.sh?
I would only set those settings, which you need.
That should be
SYNCMODE=1
TIMEFRAME_RATE_LIMIT=5
RAWTFINPUT=1

davidrohr · 2026-06-12T07:55:45Z

+  source "$PWD/local_env.sh"
+fi
+
+export ALICE_O2_FST="${ALICE_O2_FST:-1}"


This is a hack for running on MI100, I would not put it in this script

davidrohr · 2026-06-12T07:56:00Z

+
+export ALICE_O2_FST="${ALICE_O2_FST:-1}"
+
+if [[ -f "$GEN_TOPO_MYDIR/setenv.sh" ]]; then


dpl-workflow.sh will source setenv.sh, why do you source it here?

davidrohr · 2026-06-12T07:56:33Z

+# Let O2/core dumps land in the benchmark run directory, not in the original working directory.
+export CORE_DUMP_DIR="${CORE_DUMP_DIR:-$RUNDIR}"
+export O2_CORE_DUMP_DIR="${O2_CORE_DUMP_DIR:-$RUNDIR}"
+export FAIRMQ_SHM_MONITOR_CONFIG="${FAIRMQ_SHM_MONITOR_CONFIG:-}"


We do not run the SHM MONITOR, why do you need this?

ChSonnabend added 4 commits June 8, 2026 14:59

Avoiding numactl execution to avoid crashes of FST in container env w…

dd89cae

…ithout accesibility to numactl

Adding GPU benchmark scripts and python analysis script

95f3190

Merge branch 'AliceO2Group:dev' into devel_fst_numactl

48b88e3

Resetting start_tumx.sh to upstream/dev

478d76b

ChSonnabend requested a review from a team as a code owner June 12, 2026 07:28

davidrohr requested changes Jun 12, 2026

View reviewed changes

ChSonnabend added 2 commits June 12, 2026 13:11

Updating scripts

f87246f

Update env variables

43e244b

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Single GPU benchmark scripts#15514

Single GPU benchmark scripts#15514
ChSonnabend wants to merge 6 commits into
AliceO2Group:devfrom
ChSonnabend:devel_fst_numactl

ChSonnabend commented Jun 12, 2026

Uh oh!

davidrohr left a comment

Uh oh!

davidrohr Jun 12, 2026

Uh oh!

davidrohr Jun 12, 2026

Uh oh!

davidrohr Jun 12, 2026

Uh oh!

davidrohr Jun 12, 2026

Uh oh!

davidrohr Jun 12, 2026

Uh oh!

davidrohr Jun 12, 2026

Uh oh!

davidrohr Jun 12, 2026

Uh oh!

davidrohr Jun 12, 2026

Uh oh!

Reviewers

Assignees

Labels

Milestone

Development

Uh oh!

2 participants


		export DPL_REPORT_PROCESSING="${DPL_REPORT_PROCESSING:-1}"

		export FST_TMUX_NO_EPN="${FST_TMUX_NO_EPN:-1}"


		export ALICE_O2_FST="${ALICE_O2_FST:-1}"

		if [[ -f "$GEN_TOPO_MYDIR/setenv.sh" ]]; then

Conversation

ChSonnabend commented Jun 12, 2026

Uh oh!

davidrohr left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Milestone

Development

Uh oh!

2 participants