Skip to content

build: Enable Spark SQL tests for Spark 4.2 [will not merge until 4.2 is released]#4208

Draft
andygrove wants to merge 2 commits intoapache:mainfrom
andygrove:spark-4.2.0-preview4-diff
Draft

build: Enable Spark SQL tests for Spark 4.2 [will not merge until 4.2 is released]#4208
andygrove wants to merge 2 commits intoapache:mainfrom
andygrove:spark-4.2.0-preview4-diff

Conversation

@andygrove
Copy link
Copy Markdown
Member

Which issue does this PR close?

Part of #4142.

Rationale for this change

The spark-4.2 Maven profile (added in #4119) only compiled main sources; the SQL test diff and CI matrix entry were not yet in place. This PR turns on the Spark SQL test workflow against Spark 4.2.0-preview4 so we can start iterating on the failures tracked in #4142.

What changes are included in this PR?

  • Add dev/diffs/4.2.0-preview4.diff, seeded from 4.1.1.diff and reconciled against v4.2.0-preview4. Notable manual reconciliations versus 4.1.1:
    • KeyGroupedPartitioningSuite: 4.2 already declares collectAllShuffles / collectShuffles as protected and returning Seq[ShuffleExchangeLike], so only the Comet case arms and imports are added.
    • WholeStageCodegenSuite: BroadcastHashJoinExec gained a 9th constructor argument; the CometColumnarToRowExec match was rewritten to the new arity.
    • SQLTestUtils, PartitionedTablePerfStatsSuite, WriteDistributionAndOrderingSuite, QueryExecutionSuite, JoinHintSuite, ParquetRowIndexSuite, ParquetTypeWideningSuite, DisableUnnecessaryBucketedScanSuite: import-only or class-extension reconciliations.
  • Add {spark-short: '4.2', spark-full: '4.2.0-preview4', java: 17, scan-impl: 'auto'} to the spark_sql_test.yml matrix.
  • Drop the WIP comment from the spark-4.2 profile in pom.xml.

How are these changes tested?

  • Verified dev/diffs/4.2.0-preview4.diff re-applies cleanly to a fresh v4.2.0-preview4 checkout with no rejects.
  • Local ./mvnw -Prelease -Pspark-4.2 -DskipTests -Dmaven.test.skip=true install builds clean.
  • The new matrix row will exercise the diff in CI.

Adds dev/diffs/4.2.0-preview4.diff so the Spark SQL test suite can run
against Apache Spark 4.2.0-preview4 with Comet enabled, and wires the
4.2 profile into the spark_sql_test workflow matrix. The diff was
seeded from 4.1.1.diff and reconciled against v4.2.0-preview4.
@andygrove andygrove marked this pull request as draft May 4, 2026 15:20
Inherited from 4.1.1.diff during reconciliation. The Spark build was
trying to resolve comet-spark-spark4.1_2.13 instead of the 4.2 artifact,
causing the sql_hive jobs to fail before any tests ran.
@andygrove andygrove changed the title build: Enable Spark SQL tests for Spark 4.2.0-preview4 build: Enable Spark SQL tests for Spark 4.2.0-preview4 [will not merge until 4.2 is released] May 4, 2026
@andygrove andygrove changed the title build: Enable Spark SQL tests for Spark 4.2.0-preview4 [will not merge until 4.2 is released] build: Enable Spark SQL tests for Spark 4.2 [will not merge until 4.2 is released] May 4, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant