fix: handle inf/-inf in ShimSparkErrorConverter cast overflow by manuzhang · Pull Request #3768 · apache/datafusion-comet

manuzhang · 2026-03-23T10:11:51Z

Which issue does this PR close?

Closes #3767.

Rationale for this change

Fixes incorrect exception translation for overflow cases involving infinity literals and aligns Comet behavior with Spark expectations in ANSI mode.

What changes are included in this PR?

Normalize inf literals for float/double cast overflow conversion across Spark 3.4/3.5/4.0 and add unit tests in SparkErrorConverterSuite.

How are these changes tested?

Add new UT SparkErrorConverterSuite.

manuzhang · 2026-03-23T10:12:17Z

@parthchandra Please take a look when you find time.

andygrove · 2026-03-24T00:09:17Z

spark/src/main/spark-3.4/org/apache/spark/sql/comet/shims/ShimSparkErrorConverter.scala


+  private def parseFloatLiteral(value: String): Float = {
+    value.toLowerCase match {
+      case "inf" | "+inf" | "infinity" | "+infinity" => Float.PositiveInfinity


I know this issue is focused on inf but do we need to do anything with nan as well?

Covered nan as well.

Normalize inf/nan literals for float/double cast overflow conversion across Spark 3.4/3.5/4.0 and add unit tests in SparkErrorConverterSuite for float/double inf/-inf/nan. Co-authored-by: Codex <[email protected]>

parthchandra · 2026-03-24T20:12:36Z

spark/src/main/spark-3.4/org/apache/spark/sql/comet/shims/ShimSparkErrorConverter.scala

+  }
+
+  private def parseDoubleLiteral(value: String): Double = {
+    value.toLowerCase match {


In conversion_funcs/numeric.rs:spark_cast_nonintegral_numeric_to_integral the calls to cast_float_to_int16_down and cast_float_to_int32_up explicitly format the string with "{:e}D" (a suffix D).
I think inf and nan will get this D suffix and the resultant string infD or nanD would not match.
The unit tests below will not catch this either.

manuzhang force-pushed the fix-shim-cast-overflow-inf branch from ad224ca to f38fd91 Compare March 23, 2026 12:09

andygrove reviewed Mar 24, 2026

View reviewed changes

fix: handle inf/-inf/nan in ShimSparkErrorConverter cast overflow

5068336

Normalize inf/nan literals for float/double cast overflow conversion across Spark 3.4/3.5/4.0 and add unit tests in SparkErrorConverterSuite for float/double inf/-inf/nan. Co-authored-by: Codex <[email protected]>

manuzhang force-pushed the fix-shim-cast-overflow-inf branch from f38fd91 to 5068336 Compare March 24, 2026 02:55

parthchandra reviewed Mar 24, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: handle inf/-inf in ShimSparkErrorConverter cast overflow#3768

fix: handle inf/-inf in ShimSparkErrorConverter cast overflow#3768
manuzhang wants to merge 1 commit intoapache:mainfrom
manuzhang:fix-shim-cast-overflow-inf

manuzhang commented Mar 23, 2026

Uh oh!

manuzhang commented Mar 23, 2026

Uh oh!

andygrove Mar 24, 2026

Uh oh!

manuzhang Mar 24, 2026

Uh oh!

parthchandra Mar 24, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

manuzhang commented Mar 23, 2026

Which issue does this PR close?

Rationale for this change

What changes are included in this PR?

How are these changes tested?

Uh oh!

manuzhang commented Mar 23, 2026

Uh oh!

andygrove Mar 24, 2026

Choose a reason for hiding this comment

Uh oh!

manuzhang Mar 24, 2026

Choose a reason for hiding this comment

Uh oh!

parthchandra Mar 24, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants