[Feature Request] [Spark] Optionally sort within partitions when Z-ordering #4000

maltevelin · 2024-12-23T14:00:20Z

Feature request

Which Delta project/connector is this regarding?

Overview

Z-ordering tables doesn’t sort data within partitions (files) and consequently data skipping on the Parquet level, based on row group metadata, is inefficient.

Motivation

To increase read efficiency by leveraging mdc on the row group level. Global sort is considered in the design details, but deemed too slow. Sorting within partitions, on the other hand, is relatively fast because it does not introduce a shuffle. It can be optionally applied after the current repartitionByRange step. To the best of my knowledge, this approach has not been considered.

Further details

I originally discussed this problem in the Slack channel with @Kimahriman, who suggested I raise an issue here.

I've implemented the feature by adding configuration property spark.databricks.io.skipping.mdc.sortWithinPartitions defaulting to false. When the property is enabled, the partitions are sorted on repartitionKeyColName after repartitionByRange.

I ran a comparison based on the Delta Lake Z Order blog post and notebook by @MrPowers. I don't have local disk for the large data set (G1_1e9_1e2_0_0.csv), so I used a medium-sized one instead (G1_1e8_1e8_100_0.csv) and timed query_c on four table versions:

version 0: unoptimized
version 1: compacted
version 2: z-ordered on id1 and id2
version 3: z-ordered on id1 and id2, and sorted within partitions

On a 2021 MBP with 16 GB RAM. The results were

version 0
[id052,id45689,1.0]
Time taken: 4524 ms
version 1
[id052,id45689,1.0]
Time taken: 3137 ms
version 2
[id052,id45689,1.0]
Time taken: 1280 ms
version 3
[id052,id45689,1.0]
Time taken: 112 ms

The id column values queried are different because the original combination did not exist in my data set.
Update: I ran the experiment on the larger data set (G1_1e9_1e9_100_0.csv) using cloud storage and the results are

version 0
[id038,id8508161,4.0]
Time taken: 667717 ms
version 1
[id038,id8508161,4.0]
Time taken: 589716 ms
version 2
[id038,id8508161,4.0]
Time taken: 48994 ms
version 3
[id038,id8508161,4.0]
Time taken: 6386 ms

Willingness to contribute

The Delta Lake Community encourages new feature contributions. Would you or another member of your organization be willing to contribute an implementation of this feature?

Yes. I can contribute this feature independently.
Yes. I would be willing to contribute this feature with guidance from the Delta Lake community.
No. I cannot contribute this feature at this time.

I have opened PR #4006.

The text was updated successfully, but these errors were encountered:

maltevelin added the enhancement New feature or request label Dec 23, 2024

maltevelin changed the title ~~[Feature Request] [Spark] Optionally sort within partitions when Z-ordering.~~ [Feature Request] [Spark] Optionally sort within partitions when Z-ordering Dec 29, 2024

maltevelin mentioned this issue Dec 29, 2024

[Spark] Add support for sorting within partitions when Z-ordering #4006

Merged

5 tasks

vkorukanti closed this as completed in #4006 Feb 6, 2025

vkorukanti closed this as completed in 2d07216 Feb 6, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature Request] [Spark] Optionally sort within partitions when Z-ordering #4000

[Feature Request] [Spark] Optionally sort within partitions when Z-ordering #4000

maltevelin commented Dec 23, 2024 •

edited

Loading

[Feature Request] [Spark] Optionally sort within partitions when Z-ordering #4000

[Feature Request] [Spark] Optionally sort within partitions when Z-ordering #4000

Comments

maltevelin commented Dec 23, 2024 • edited Loading

Feature request

Which Delta project/connector is this regarding?

Overview

Motivation

Further details

Willingness to contribute

maltevelin commented Dec 23, 2024 •

edited

Loading