Skip to content

Support higher-order array functions via JVM UDF bridge #4224

@andygrove

Description

@andygrove

What is the problem the feature request solves?

Higher-order functions (exists, forall, transform, filter, aggregate) fall back to Spark because they involve lambda expressions. This causes C2R/R2C transitions even when the rest of the pipeline is native.

Describe the potential solution

Use the JVM UDF bridge (#4170) to evaluate lambdas on the JVM while keeping data in columnar Arrow format. Prototype for exists in #4223.

Additional context

No response

Metadata

Metadata

Assignees

No one assigned

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions