Fix PyArrow stack-overflow segfault in PyIceberg's multi-column upsert match filter by steven-winfield-quantohm · Pull Request #3509 · apache/iceberg-python

steven-winfield-quantohm · 2026-06-15T13:43:52Z

Rationale for this change

The problem

When upserting into an Iceberg table, PyIceberg first scans the target table to
find which existing rows match the source rows' key columns. It builds that
"matching" predicate in pyiceberg.table.upsert_util.create_match_filter:

For a single join column it emits one flat In(col, [v1, v2, ...]).
PyArrow lowers this to a single is_in compute node, no matter how many
values it contains — so single-column upserts of huge tables are fine.

For a multi-column key it instead emits one disjunct per distinct key
tuple::

Or(And(c1 == v1, c2 == w1),
   And(c1 == v2, c2 == w2),
   ...)                          # ONE disjunct PER ROW

PyIceberg builds that Or as a balanced tree, so the Python side copes.
But when the expression is handed to PyArrow's dataset scanner as a filter, the
C++ expression engine canonicalises it: Dataset::GetFragments calls
SimplifyWithGuarantee → Canonicalize, which flattens the associative
or_kleene chain and then recurses over it. With tens of thousands of
disjuncts that recursion overflows the C++ call stack and the process
segfaults (SIGSEGV) — typically after several minutes of work, with a
backtrace full of arrow::compute::Canonicalize / ModifyExpression
frames.

Reference: #3272

Note that #3448 addresses a different upsert segfault (a
per-batch Acero re-filter in _task_to_record_batches, mostly observed on
Apple Silicon). It does not touch the GetFragments canonicalisation path
exercised here, so it does not help with this crash.

The fix

Produce a predicate that matches exactly the same rows, but with far fewer
disjuncts. Group the key tuples and emit a single In over whichever column
collapses to the fewest distinct "prefix" combinations (choosing that column
makes the result independent of the caller's column ordering)::

Or(And(c1 == v1, c2 IN [w, x, y]),
   And(c1 == v2, c2 IN [z]),
   ...)                            # one disjunct per distinct PREFIX

The disjunct count drops from "number of rows" to "number of distinct prefix
values". In the synthetic data below there are 50 000 unique ids spread over
just 50 group values, so the predicate shrinks from 50 000 disjuncts to 50 —
shallow enough that PyArrow's canonicaliser no longer overflows.

Caveat

This helps whenever at least one key column is low-cardinality (or, equivalently,
one column is near-unique and can be folded into the In). A genuinely
high-cardinality composite key — where every column is near-unique and all of
them are needed to identify a row — still produces roughly one disjunct per row
even after grouping, and can still overflow. For that pathological case the
only robust option is to upsert in smaller batches.

Are these changes tested?

Yes - one test changed to expect And(op1, op2) or And(op2, op1) where previously
the operand order mattered.

Are there any user-facing changes?

No

…le_condition

rambleraptor

Can we get some additional tests? I want to make sure that we're not breaking anything. Here's a couple test cases I can think of, but I'd love to get your thoughts:

Multiple columns, multiple prefix groups
Multiple columns, single prefix group

steven-winfield-quantohm added 3 commits June 15, 2026 14:09

More efficient (and less segfaulty) create_match_filter

f5865d4

Missing import

0362a91

Allow And(op1, op2) or And(op2, op1) in test_create_match_filter_sing…

269dcd2

…le_condition

rambleraptor reviewed Jun 15, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix PyArrow stack-overflow segfault in PyIceberg's multi-column upsert match filter#3509

Fix PyArrow stack-overflow segfault in PyIceberg's multi-column upsert match filter#3509
steven-winfield-quantohm wants to merge 3 commits into
apache:mainfrom
steven-winfield-quantohm:main

steven-winfield-quantohm commented Jun 15, 2026

Uh oh!

rambleraptor left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

steven-winfield-quantohm commented Jun 15, 2026

Rationale for this change

The problem

The fix

Caveat

Are these changes tested?

Are there any user-facing changes?

Uh oh!

rambleraptor left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants