feat(planner): logical plan for rcte #16680

xzhseh · 2024-05-10T04:46:09Z

I hereby agree to the terms of the RisingWave Labs, Inc. Contributor License Agreement.

What's changed and what's your intention?

refer #16483.

Checklist

I have written necessary rustdoc comments
I have added necessary unit tests and integration tests
I have added test labels as necessary. See details.
I have added fuzzing tests or opened an issue to track them. (Optional, recommended for new SQL features Sqlsmith: Sql feature generation #7934).
My PR contains breaking changes. (If it deprecates some features, please create a tracking issue to remove them in the future).
All checks passed in ./risedev check (or alias, ./risedev c)
My PR changes performance-critical code. (Please run macro/micro-benchmarks and show the results.)

My PR contains critical fixes that are necessary to be merged into the latest release. (Please check out the details)

Documentation

My PR needs documentation updates. (Please use the Release note section below to summarize the impact on users)

Release note

If this PR includes changes that directly affect users or other significant modifications relevant to the community, kindly draft a release note to provide a concise summary of these changes. Please prioritize highlighting the impact these changes will have on users.

xzhseh · 2024-05-10T04:47:01Z

cc @TennyZhuang @chenzl25 @xiangjinwu @wangrunji0408.

gitguardian · 2024-05-10T04:47:11Z

️✅ There are no secrets present in this pull request anymore.

If these secrets were true positive and are still valid, we highly recommend you to revoke them.
Once a secret has been leaked into a git repository, you should consider it compromised, even if it was deleted immediately.
Find here more information about risks.

^{_{🦉 GitGuardian detects secrets in your source code to help developers and security teams secure the modern development process. You are seeing this because you or someone else with access to this repository has authorized GitGuardian to scan your pull request.}}

chenzl25 · 2024-05-10T06:59:46Z

src/frontend/src/optimizer/plan_node/logical_recursive_union.rs

+        _predicate: Condition,
+        _ctx: &mut PredicatePushdownContext,
+    ) -> PlanRef {
+        self.clone().into()


We should handle predicate pushdown as well.

yep this should be handled once the optimization is enabled - since there is no actual implementation for predicate pushdown at present, it's alright.

I think we need to change it to unimplemented!(), if we don't handle it right now, because the current implementation doesn't handle the push down predicate and it will generate a wrong plan.

it will generate a wrong plan

I don't quite get this, why directly cloning the current plan without predicate pushdown will lead to a wrong plan? 👀

If you don't want to push down the predicate you should implement it like this, because the predicate in this function couldn't be thrown away, at least we need to generate a filter for it.

fn predicate_pushdown( &self, predicate: Condition, ctx: &mut PredicatePushdownContext, ) -> PlanRef { // No pushdown. gen_filter_and_pushdown(self, predicate, Condition::true_cond(), ctx) }

update the implementation for predicate_pushdown, I think it's reasonable to push predicate down to the base and recursive plan.

chenzl25 · 2024-05-10T07:06:02Z

src/frontend/src/optimizer/plan_node/logical_recursive_union.rs

+    fn prune_col(&self, required_cols: &[usize], ctx: &mut ColumnPruningContext) -> PlanRef {
+        let new_inputs = self
+            .inputs()
+            .iter()
+            .map(|input| input.prune_col(required_cols, ctx))
+            .collect_vec();
+        let new_plan = self.clone_with_inputs(&new_inputs);
+        self.ctx()
+            .insert_rcte_cache_plan(self.core.id, new_plan.clone());
+        new_plan
+    }


Here we reuse the ShareId but insert a new_plan which might have a different schema from the old_plan, but cte_ref has a base field which represents a plan at the beginning of planning, so it would be inconsistent with the rcte_cache_plan updated here. That's why I think we should never use base of cte_ref after the initial planning.

but cte_ref has a base field which represents a plan at the beginning of planning, so it would be inconsistent with the rcte_cache_plan updated here.

agree. the only essential use case for base is to get the current OptimizerContext, for GenericPlanNode's impl methods I think we could just stick with the current stream_key's logic - in this case everything is consistent with the on-the-fly plan cache stored in the context.

implementation of the current stream_key is here (the consistent one we should stick to): https://github.com/risingwavelabs/risingwave/pull/16680/files#diff-1d6ad1cdae116ec20167aafcc080d20606c2fb3da492cea9c6a398412456d96bR63.

chenzl25

LGTM, thanks!

xzhseh · 2024-05-14T23:31:02Z

@chenzl25 could you help me merge this pr, thanks!

chenzl25 · 2024-05-15T08:11:33Z

@chenzl25 could you help me merge this pr, thanks!

Sure

xzhseh added 17 commits April 24, 2024 21:48

add generic nodes and logical nodes

c9a8d63

add planner related functions; update fmt

7a43527

update comment; tiny refactor

52148e8

add rcte_cache in OptimizerContext

e18d11a

add base to BoundBackCteRef

c51f77a

logical plan rcte

1580c26

update fmt

6740299

Merge branch 'main' into xzhseh/plan-rcte

09c2231

change schema of recursive union in generic plan node

799bd43

remove redundant debugging stmt

86545f8

update planner tests; pretty fields for LogicalRecursiveUnion

61fe892

update planner tests for cte ref

4ae584f

add case with explicit column to planner test

6a2fe60

fix check

35b6d61

Merge branch 'risingwavelabs:main' into xzhseh/plan_rcte

4d5d790

update rcte cache plan for prune_col

defe677

Merge branch 'main' into xzhseh/plan-rcte

27ed103

update fmt

bb43608

chenzl25 reviewed May 10, 2024

View reviewed changes

xzhseh added 2 commits May 12, 2024 17:46

update methods for consistency purpose, conforming to stream_key

d258495

update unimplemented msg for LogicalRecursiveUnion & LogicalCteRef

ce16c83

chenzl25 approved these changes May 14, 2024

View reviewed changes

update comment for the example rcte in BindingCteState and bind_with

995aee7

xzhseh force-pushed the xzhseh/plan-rcte branch from 9406b81 to 995aee7 Compare May 14, 2024 19:34

xzhseh added 3 commits May 14, 2024 13:10

implement predicate_pushdown for LogicalRecursiveUnion & LogicalCteRef

5460f17

update fmt

d54d95d

update expect test

23875f3

chenzl25 added this pull request to the merge queue May 15, 2024

Merged via the queue into risingwavelabs:main with commit a63fa8c May 15, 2024
27 of 28 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(planner): logical plan for rcte #16680

feat(planner): logical plan for rcte #16680

xzhseh commented May 10, 2024 •

edited

xzhseh commented May 10, 2024

gitguardian bot commented May 10, 2024 •

edited

chenzl25 May 10, 2024

xzhseh May 13, 2024

chenzl25 May 13, 2024

xzhseh May 14, 2024

chenzl25 May 14, 2024

xzhseh May 14, 2024

chenzl25 May 10, 2024 •

edited

xzhseh May 13, 2024

xzhseh May 13, 2024

chenzl25 left a comment

xzhseh commented May 14, 2024

chenzl25 commented May 15, 2024

feat(planner): logical plan for rcte #16680

feat(planner): logical plan for rcte #16680

Conversation

xzhseh commented May 10, 2024 • edited

What's changed and what's your intention?

Checklist

Documentation

Release note

xzhseh commented May 10, 2024

gitguardian bot commented May 10, 2024 • edited

️✅ There are no secrets present in this pull request anymore.

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

chenzl25 May 10, 2024 • edited

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

chenzl25 left a comment

Choose a reason for hiding this comment

xzhseh commented May 14, 2024

chenzl25 commented May 15, 2024

xzhseh commented May 10, 2024 •

edited

gitguardian bot commented May 10, 2024 •

edited

chenzl25 May 10, 2024 •

edited