Skip to content

[GLUTEN-11599][VL] Support approx_count_distinct_for_intervals function#11729

Open
malinjawi wants to merge 2 commits intoapache:mainfrom
malinjawi:gluten-11599-approx-count-distinct-for-intervals
Open

[GLUTEN-11599][VL] Support approx_count_distinct_for_intervals function#11729
malinjawi wants to merge 2 commits intoapache:mainfrom
malinjawi:gluten-11599-approx-count-distinct-for-intervals

Conversation

@malinjawi
Copy link
Contributor

Summary

  • Map Spark approx_count_distinct_for_intervals to the Velox backend aggregate

Context

Testing

  • Not run (mapping‑only change)

@github-actions github-actions bot added CORE works for Gluten Core DOCS labels Mar 9, 2026
@github-actions
Copy link

github-actions bot commented Mar 9, 2026

Run Gluten Clickhouse CI on x86

@github-actions
Copy link

Run Gluten Clickhouse CI on x86

@jinchengchenghh
Copy link
Contributor

Please update here to verify you Velox PR, https://github.com/apache/incubator-gluten/blob/main/ep/build-velox/src/get-velox.sh#L28, and make sure your native implementation is really called. Provide an example test in CI, e.g. warning message disappeared.

[main] WARN org.apache.spark.sql.execution.GlutenFallbackReporter - Validation failed for plan: ObjectHashAggregate[QueryId=113], due to: 
 - Validation failed with exception from: RegularHashAggregateExecTransformer, reason: Could not find a valid substrait mapping name for approx_count_distinct_for_intervals(s_store_sk#87088L, org.apache.spark.sql.catalyst.expressions.UnsafeArrayData@3915186b, 0.05, 0, 0).

@jinchengchenghh jinchengchenghh changed the title [GLUTEN-11599][VL] Map approx_count_distinct_for_intervals [GLUTEN-11599][VL] Support approx_count_distinct_for_intervals function Mar 10, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

CORE works for Gluten Core DOCS

Projects

None yet

Development

Successfully merging this pull request may close these issues.

[VL] Add support for approx_count_distinct_for_intervals

2 participants