chore: refactor CI to have centralized SBT action by comphead · Pull Request #4643 · apache/datafusion-comet

comphead · 2026-06-12T17:55:20Z

Which issue does this PR close?

Closes #.

CI: split Spark SBT compile from test execution

Problem

sql_core-* matrix entries randomly SIGKILL on Spark 4.1.2 runners with Killed (no JVM stack) — the kernel/container OOM killer, not -Xmx exhaustion. Each matrix entry runs sbt -mem 3072 testOnly *, so SBT pays the full Scala/Zinc compile heap (~2.5–3 GB peak) and then orchestrates the forked test JVM, on top of Comet's native off-heap pool, on a 7 GB hosted runner. The budget is at the
edge; cumulative allocation patterns push it over non-deterministically.

Change

New single build job per Spark version: builds libcomet.so (cargo --profile ci) and runs sbt -mem 3072 catalyst/Test/compile sql/Test/compile hive/Test/compile once. Uploads two
artifacts:
- native-lib-linux (libcomet.so, ~50 MB)
- jvm-compiled-spark-<full>-jdk<N> (apache-spark.tar.gz, sources + target/ + Zinc state, ~500 MB–1 GB, mtimes preserved via tar -czpf)
Matrix spark-sql-test entries needs: build, download + extract both, call setup-spark-builder with skip-spark-clone: true (only re-runs the mvn install of Comet's JAR), then sbt -mem 1536 testOnly *. Zinc verifies "no source changed" and skips compile.
setup-spark-builder gains a skip-spark-clone input to gate the Spark checkout + diff apply when sources are pre-staged from the artifact.

Effect

SBT heap per matrix runner: 3072 → 1536 MB, freeing ~1.5 GB of runner headroom — the budget gap that was producing the OOM kills.
Spark compile runs once per Spark version instead of seven times per matrix.
One fewer runner per Spark version: was build-native + 7×(compile+test) = 8; now build (native + compile) + 7×test = 8 — same job count, but the heavy compile is amortized.

comphead marked this pull request as draft June 12, 2026 20:50

comphead added 8 commits June 12, 2026 17:07

chore: set default value for spark.comet.memoryOverhead in tests

28206b1

chore: set default value for spark.comet.memoryOverhead in tests

164e51f

chore: set default value for spark.comet.memoryOverhead in tests

657c22d

chore: set default value for spark.comet.memoryOverhead in tests

c48549c

chore: set default value for spark.comet.memoryOverhead in tests

d410576

chore: set default value for spark.comet.memoryOverhead in tests

0b3750e

chore: set default value for spark.comet.memoryOverhead in tests

0c7c763

chore: set default value for spark.comet.memoryOverhead in tests

49b31b3

comphead force-pushed the chore branch from 6bef91f to 49b31b3 Compare June 13, 2026 00:10

comphead and others added 5 commits June 12, 2026 20:30

chore: set default value for spark.comet.memoryOverhead in tests

1795122

chore: set default value for spark.comet.memoryOverhead in tests

805134a

Merge branch 'main' into chore

ca496d2

chore: set default value for spark.comet.memoryOverhead in tests

554f875

Merge remote-tracking branch 'origin/chore' into chore

282f299

comphead changed the title ~~chore: set default value for spark.comet.memoryOverhead in tests~~ chore: refactor CI to have centralized SBT action Jun 13, 2026

comphead marked this pull request as ready for review June 13, 2026 15:52

andygrove approved these changes Jun 13, 2026

View reviewed changes

comphead merged commit d926e21 into apache:main Jun 13, 2026
33 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

chore: refactor CI to have centralized SBT action#4643

chore: refactor CI to have centralized SBT action#4643
comphead merged 13 commits into
apache:mainfrom
comphead:chore

comphead commented Jun 12, 2026 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

comphead commented Jun 12, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Which issue does this PR close?

CI: split Spark SBT compile from test execution

Problem

Change

Effect

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

comphead commented Jun 12, 2026 •

edited

Loading