-
Notifications
You must be signed in to change notification settings - Fork 3.7k
[feat](olap) Support lazy reading mode for pruned complex columns #59263
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: master
Are you sure you want to change the base?
Conversation
|
Thank you for your contribution to Apache Doris. Please clearly describe your PR:
|
|
run buildall |
Cloud UT Coverage ReportIncrement line coverage Increment coverage report
|
5ff30aa to
b39fda3
Compare
|
run buildall |
Cloud UT Coverage ReportIncrement line coverage Increment coverage report
|
FE UT Coverage ReportIncrement line coverage |
FE Regression Coverage ReportIncrement line coverage |
b39fda3 to
c734a83
Compare
|
run buildall |
Cloud UT Coverage ReportIncrement line coverage Increment coverage report
|
BE UT Coverage ReportIncrement line coverage Increment coverage report
|
c734a83 to
ca5a57c
Compare
|
run buildall |
ca5a57c to
1734d3f
Compare
|
run buildall |
Cloud UT Coverage ReportIncrement line coverage Increment coverage report
|
FE UT Coverage ReportIncrement line coverage |
TPC-H: Total hot run time: 34393 ms |
TPC-DS: Total hot run time: 174201 ms |
ClickBench: Total hot run time: 28 s |
FE Regression Coverage ReportIncrement line coverage |
BE UT Coverage ReportIncrement line coverage Increment coverage report
|
BE Regression && UT Coverage ReportIncrement line coverage Increment coverage report
|
What problem does this PR solve?
The subcolumns of a pruned complex-type column can fall into two categories:
1. Predicate columns — columns required to evaluate filter predicates, which need to be read upfront.
2. Non-predicate columns — columns that are not needed when evaluating filter predicates.
For non-predicate columns, we can defer reading them until after predicate evaluation, which may significantly reduce the amount of data read.
This PR also removes references to
olap/rowset/segment_v2/column_reader.hfrom many header files, avoiding large-scale recompilation of source files caused by changes toColumnReader/ColumnIterator.Related PR: #xxx
Problem Summary:
Release note
For non-predicate columns, we can defer reading them until after predicate evaluation, which may significantly reduce the amount of data read.
Check List (For Author)
Test
Behavior changed:
Does this need documentation?
Check List (For Reviewer who merge this PR)