docs: adopt issue #4419 terminology in Scala/Java UDF guide by andygrove · Pull Request #4651 · apache/datafusion-comet

andygrove · 2026-06-13T14:28:34Z

Which issue does this PR close?

Part of #4419.

Rationale for this change

Issue #4419 proposes a terminology framework to remove the overloaded, bare use of "native" from Comet's docs. The "Scala UDF and Java UDF Support" page describes how UDFs participate in Comet execution and used "native Comet path" / "native path" / "native execution" loosely. This page is a good follow-up to the Understanding Comet Plans cleanup because Scala/Java UDFs are JVM-implemented (codegen'd) yet still run inside the Comet pipeline, so precise wording matters here.

This PR keeps the change scoped to that one page.

What changes are included in this PR?

In docs/source/user-guide/latest/scala_java_udfs.md:

Use "Comet pipeline" for pipeline membership in place of "native Comet path" / "native path".
Describe the surrounding accelerated operators as "Rust-implemented" rather than "native".
Replace "for native execution" / "runs as one native unit" with "to keep execution in the Comet pipeline" / "runs as one unit in the Comet pipeline".

How are these changes tested?

Documentation-only change. Verified with prettier --check.

Replace bare uses of "native" with terms from the apache#4419 framework: "Comet pipeline" for pipeline membership, "Rust-implemented" for the surrounding operators, and "run in the Comet pipeline" / "keep execution in the Comet pipeline" in place of "native path" / "native execution".

comphead · 2026-06-13T22:39:40Z

-| `spark.comet.exec.scalaUDF.codegen.enabled` | `true`  | When `true`, eligible `ScalaUDF`s run on the Comet path. When `false`, the enclosing operator falls back to Spark. |
+| Key                                         | Default | Description                                                                                                            |
+| ------------------------------------------- | ------- | ---------------------------------------------------------------------------------------------------------------------- |
+| `spark.comet.exec.scalaUDF.codegen.enabled` | `true`  | When `true`, eligible `ScalaUDF`s run in the Comet pipeline. When `false`, the enclosing operator falls back to Spark. |


perhaps we need to highlight this param contols a fallback for user defined functions codegen, having another config for builtin Spark functions codegen?

comphead

Thanks @andygrove overall it is a good improvement

andygrove added this to the 0.17.0 milestone Jun 13, 2026

comphead reviewed Jun 13, 2026

View reviewed changes

comphead approved these changes Jun 13, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

docs: adopt issue #4419 terminology in Scala/Java UDF guide#4651

docs: adopt issue #4419 terminology in Scala/Java UDF guide#4651
andygrove wants to merge 1 commit into
apache:mainfrom
andygrove:docs-terminology-udf-4419

andygrove commented Jun 13, 2026

Uh oh!

comphead Jun 13, 2026

Uh oh!

comphead left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

andygrove commented Jun 13, 2026

Which issue does this PR close?

Rationale for this change

What changes are included in this PR?

How are these changes tested?

Uh oh!

comphead Jun 13, 2026

Choose a reason for hiding this comment

Uh oh!

comphead left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants