Ordered compaction and inlining with full catalog integration #642

Alex-Monahan · 2025-12-22T19:00:06Z

Hi folks!

This is a new PR meant to solve the same use case as #593, but addressing the PR feedback! Thank you for the guidance - it was super helpful. I am still open to any changes you recommend!

This adds 2 new tables to the DuckLake spec: ducklake_sort_info and ducklake_sort_expression.

ducklake_sort_info keeps a version history of the sort settings for tables over time. It has 1 row per time that a table has a new sort setting applied. (The prior PR used an option for this, but that has been removed based on the feedback).

ducklake_sort_expression tracks the details of that sort. For each time a new sort setting is applied, this table includes one row for each expression in the order by. (If I order by column3 asc, column42 desc, column64 asc, then there will be 3 rows.)

There are still a few limitations with this PR:

This does not order during insert (only during compaction and inline flush).
- I would love to try to do a follow up PR to add this!
Only explicit column names can be used in the sorting, not expressions.
- I tried to implement this but could not, so I could use some help with this piece!
- There is a friendly error message (and tests) to document this limitation.
- The spec has a column expression, so the intention was to make the spec itself forwards-compatible with expression-oriented sorting.
Files are still selected for compaction based on insertion order. It could be better to sort the list of files by min/max metadata before selecting files for compaction.
- Let me know if this is desirable and I can work on it after the "order during insert" in number 1!

I believe that I made this fully compatible with the batching code, but I was testing locally on a DuckDB catalog and not on Postgres. Any extra eyes on that side would be great!

If this looks good, I can also do any docs PRs that you recommend - happy to help there.

Thanks folks! CC @philippmd as well as an FYI

…ting.

Alex-Monahan · 2026-01-07T22:31:07Z

Welcome back @Mytherin , hope you had an excellent time away! I'd love your feedback - happy to address changes big or small. Sorry to ping, I just know that DuckLake 1.0 is not far away, and it takes me a bit of time to make C++ edits! :-)

Many thanks!!

Mytherin

Thanks! I've left a comment on the spec side of things - haven't looked at the code

src/storage/ducklake_metadata_manager.cpp

Alex-Monahan · 2026-01-08T18:18:41Z

I've added in that separate table! Existing tests are passing with the join added - let me know if there are other things worth testing now that there are 2 tables involved!

Mind kicking off CI for me? I appreciate it!

Alex-Monahan · 2026-01-09T04:01:01Z

I realized that I was allowing the SET SORTED BY command to complete even when the columns were not valid (a legacy from the old option approach that was hard to validate up front). I've now restricted it so that it will fail if the column is incorrect and added some tests!

pdet

Hey Alex, the PR is looking super solid, thanks again for all the effort you put in!

I'm wondering if you could also add the following tests:

If you drop a table with an existing order, and then create a table with the same name but different columns?
If you get the changes information on snapshot()
If the alter statement won't intefeer with compaction (see, #671)
If we can also verify the new parquet files are correctly written directly (without acessing them from the table), and also if different files have the correct orders, when having different orders over multiple snapshots.
More transaction tests, for example, what happens if we begin a transaction and in the same transaction we add a new sorting order and alter a column name?
What happens if you do a batch insertion on a table with a set column order (not inlined)

The test files you are pretty detailed, but my bio llm context window is small, do you mind also moving them to a sorted_table folder and modularizing them a bit more?

I also guess in general we can do StringUtil::Lower() for catalog access when the result can be case insensitive.

pdet · 2026-01-12T11:48:47Z

src/functions/ducklake_compaction_functions.cpp

+		if (pre_bound_order.dialect != "duckdb") {
+			continue;
+		}


I'm wondering if we should error if it is an unsupported dialect?

Ah, I will look at the macros as an example. Would you prefer that I filter them out in SQL when I pull from the catalog?

This is more of a question, I'm open to suggestions. On one hand, we can be more permissive with anything dialect related, but I fear this might cause confusion, while an error removes any kind of ambiguity of the system's state.

Looking back at this, the reason I implemented it this way was in case we wanted to deliberately have 2 sort orders active at once: one for DuckDB, and one for another engine like Spark for example. So if there were 2 separate sets of sorts, skipping the non-DuckDB ones (but not throwing an error) should allow them to co-exist.

Unfortunately, this case is very difficult to test in this repo! I think as catalog editing gets added on the Spark side, we can do some testing in that repo.

So, do you mind if we leave it as-is?

src/functions/ducklake_compaction_functions.cpp

src/storage/ducklake_metadata_manager.cpp

src/storage/ducklake_table_entry.cpp

Alex-Monahan · 2026-01-12T21:35:00Z

Hey Alex, the PR is looking super solid, thanks again for all the effort you put in!

I'm wondering if you could also add the following tests:

If you drop a table with an existing order, and then create a table with the same name but different columns?

If you get the changes information on snapshot()

If the alter statement won't intefeer with compaction (see, Change schema tracking from global to table. #671)

If we can also verify the new parquet files are correctly written directly (without acessing them from the table), and also if different files have the correct orders, when having different orders over multiple snapshots.

More transaction tests, for example, what happens if we begin a transaction and in the same transaction we add a new sorting order and alter a column name?

What happens if you do a batch insertion on a table with a set column order (not inlined)

The test files you are pretty detailed, but my bio llm context window is small, do you mind also moving them to a sorted_table folder and modularizing them a bit more?

I also guess in general we can do StringUtil::Lower() for catalog access when the result can be case insensitive.

Thanks as always for the feedback! These tests make sense and I'm working on them! I had a few follow up questions.

On 3, would you like for me to validate that an alter table statement on a different table does not interfere? (Ahead of time or in the same transaction?) Based on #684, I was assuming that any alter table on the same table would be a hard line that prevents compaction - please let me know if I am wrong!

On 4, let me try to repeat back to you what I think you are looking for! I think you want to test the situation where I insert into the same table 3 times, each time with a different sort order setting, and then I run 1 compaction on that table. My goal was to only consider the most recent ducklake_sort_expression setting, regardless of the value of the setting at insert time, so this test would ensure that! I could also test multiple compactions and make sure that the final compaction resorts to the latest sort setting.

On 6, I'm not sure exactly how to proceed - I could use your help! This PR does not include the capability to do the sorting during an insert, so I'm not sure how the features would intersect just yet. Does adjusting the column order during insert result in the parquet file having the columns stored in a different order? I'm not sure what behavior I should be exercising with the test. Just let me know!

pdet · 2026-01-13T10:08:57Z

Hey Alex, the PR is looking super solid, thanks again for all the effort you put in!
I'm wondering if you could also add the following tests:

If you drop a table with an existing order, and then create a table with the same name but different columns?

If you get the changes information on snapshot()

If the alter statement won't intefeer with compaction (see, Change schema tracking from global to table. #671)

If we can also verify the new parquet files are correctly written directly (without acessing them from the table), and also if different files have the correct orders, when having different orders over multiple snapshots.

More transaction tests, for example, what happens if we begin a transaction and in the same transaction we add a new sorting order and alter a column name?

What happens if you do a batch insertion on a table with a set column order (not inlined)

The test files you are pretty detailed, but my bio llm context window is small, do you mind also moving them to a sorted_table folder and modularizing them a bit more?
I also guess in general we can do StringUtil::Lower() for catalog access when the result can be case insensitive.

Thanks as always for the feedback! These tests make sense and I'm working on them! I had a few follow up questions.

On 3, would you like for me to validate that an alter table statement on a different table does not interfere? (Ahead of time or in the same transaction?) Based on #684, I was assuming that any alter table on the same table would be a hard line that prevents compaction - please let me know if I am wrong!

Basically, we should check that the alter statement does not increase the schema counter of a table, since the alter statement here does not change the actual schema, and the sorting is somewhat "optional" I believe that files should still be compactable before/after the alter.

On 4, let me try to repeat back to you what I think you are looking for! I think you want to test the situation where I insert into the same table 3 times, each time with a different sort order setting, and then I run 1 compaction on that table. My goal was to only consider the most recent ducklake_sort_expression setting, regardless of the value of the setting at insert time, so this test would ensure that! I could also test multiple compactions and make sure that the final compaction resorts to the latest sort setting.

I wanted to see if files would always be generated with the latest sorting (e.g., by flushing) , and also compacted with the latest sorting, but now that I think about it again, maybe is an overkill because you already have similar tests to that, let's skip it.

On 6, I'm not sure exactly how to proceed - I could use your help! This PR does not include the capability to do the sorting during an insert, so I'm not sure how the features would intersect just yet. Does adjusting the column order during insert result in the parquet file having the columns stored in a different order? I'm not sure what behavior I should be exercising with the test. Just let me know!

Let's skip this.

Alex-Monahan · 2026-01-14T15:09:31Z

When investigating 3, I found that you are correct that I am currently incrementing the schema_version when altering the sort order. Dang! I think there are few other cases where we probably want to have the same "alter table, but don't change the schema_version". I'm thinking:

SET_SORT_KEY
SET_COMMENT
SET_COLUMN_COMMENT

I'll start trying to implement that behavior, but let me know if I need to course correct! From my first prototype, it requires some additional complexity in a few spots (I think we will need a different construct than new_tables for storing those kinds of alters), so I'm open to feedback and ideas!

pdet · 2026-01-15T07:29:49Z

When investigating 3, I found that you are correct that I am currently incrementing the schema_version when altering the sort order. Dang! I think there are few other cases where we probably want to have the same "alter table, but don't change the schema_version". I'm thinking:

SET_SORT_KEY

SET_COMMENT

SET_COLUMN_COMMENT

I'll start trying to implement that behavior, but let me know if I need to course correct! From my first prototype, it requires some additional complexity in a few spots (I think we will need a different construct than new_tables for storing those kinds of alters), so I'm open to feedback and ideas!

Yes, I think none of those should impact the schema ranges for compaction. I think the main thing you have to do is to ensure that these statements won't write a new entry into ducklake_schema_versions, as this table keeps track of schema changes per ducklake table.

Alex-Monahan · 2026-01-15T15:00:06Z

Yup, I will prevent that! Unfortunately, I'm also finding that I need to update the local schema cache (since usually a new schema version would be handled differently). I'm making progress!

…chema changes and keep schema cache up to date.

…ersion

Alex-Monahan added 30 commits October 4, 2025 21:33

Working hardcoded compaction sort!

eb59a29

parse order by string and manually bind.

f2b5947

approx_order_by param in merge_adjacent_files

856e479

config option for approx_order_by

3b52466

Refactor into a method on DuckLakeCompactor

3b24a22

Move compactor to .hpp, GetApproxOrderBy fn, WIP inlined order by

d670375

working tests for ducklake_flush_inlined_data!

b214ccd

Use const references. More commented out attempt to dynamically bind

295e7f5

rename parameter to local_order_by

e67d5e2

Rename fns, - comments/prints,+ negative test

87964df

Merge remote-tracking branch 'origin' into ordered-compaction

cf11b9b

Accept SET SORTED BY syntax (do nothing yet)

d12e237

SET SORTED inserts to DB! Lots of wiring...

62beb45

Working compact with sort_data instead of option! WIP cleanup.

41da46b

Update tests to use new syntax. Passing!

cf6c617

Inlining tests using new syntax working!

6bc926b

Remove local_order_by option and naming

d9e4e61

inline flush sorted within txn works. Not compaction (seems ok?). Tes…

f8b4425

…ting.

Revert rename of duckdb property

1449bdb

RESET SORTED BY works and is tested!

f55888b

If sorts match, don't insert to catalog

10d527d

Merge branch 'main' into ordered-compaction-catalog

f89e12a

Remove duplicate InsertSort fn

e3bf7a4

batch_queries for sort catalog operations

6b612cb

Remove comment block

a0a2424

retarget to duckdb main

4c13a28

try to point duckdb submodule to correct commit

61cc765

extension-ci-tools git commit hash fix

c988bed

Undo old ducklake_option edits. Edit comments

b73fc7d

Add FIXME for expressions in sort. (And re-run CI/CD)

a1a6b26

Maxxen mentioned this pull request Jan 5, 2026

Add support for CLUSTER <tbl> ORDER BY <expr> statement duckdb/duckdb#19696

Open

Mytherin reviewed Jan 8, 2026

View reviewed changes

src/storage/ducklake_metadata_manager.cpp Outdated Show resolved Hide resolved

Merge branch 'main' into ordered-compaction-catalog

20e2c11

Alex-Monahan marked this pull request as draft January 8, 2026 17:04

Alex-Monahan added 2 commits January 8, 2026 10:17

Fix merge by adding old sort table back

d864904

add ducklake_sort_expression table to spec. Update tests.

3286d82

Alex-Monahan marked this pull request as ready for review January 8, 2026 18:17

Alex-Monahan added 2 commits January 8, 2026 16:07

Error on SET SORTED BY if wrong columns or non-columns

cbb5318

col names from latest_table. Test post rename

3152e2e

pdet reviewed Jan 12, 2026

View reviewed changes

Alex-Monahan marked this pull request as draft January 12, 2026 20:26

PR feedback, case ins. equals, split to fns

7ac3077

Alex-Monahan added 3 commits January 12, 2026 16:10

Split existing merge adjacent tests

7434510

cleanup test comments

9f4e352

Split existing inline flush tests

ebcfc4a

Alex-Monahan added 2 commits January 13, 2026 15:22

test that SET SORTED BY applies by table_id not by name

fb5dd0b

test that sort catalog tables cleared out by expire_snapshots

918ebf3

Alex-Monahan added 5 commits January 15, 2026 08:53

Don't update schema_version on SET SORTED BY. Track updates without s…

27cfa9a

…chema changes and keep schema cache up to date.

format fixes

7ae9384

Comments on tables, table columns, and views do not increase schema_v…

bd8c991

…ersion

format fix

58f23df

add test for snapshots() changes

79064e6

Ordered compaction and inlining with full catalog integration #642

Are you sure you want to change the base?

Ordered compaction and inlining with full catalog integration #642

Uh oh!

Conversation

Alex-Monahan commented Dec 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Alex-Monahan commented Jan 7, 2026

Uh oh!

Mytherin left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Alex-Monahan commented Jan 8, 2026

Uh oh!

Alex-Monahan commented Jan 9, 2026

Uh oh!

pdet left a comment

Choose a reason for hiding this comment

Uh oh!

pdet Jan 12, 2026

Choose a reason for hiding this comment

Uh oh!

Alex-Monahan Jan 12, 2026

Choose a reason for hiding this comment

Uh oh!

pdet Jan 12, 2026

Choose a reason for hiding this comment

Uh oh!

Alex-Monahan Jan 12, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Alex-Monahan commented Jan 12, 2026

Uh oh!

pdet commented Jan 13, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Alex-Monahan commented Jan 14, 2026

Uh oh!

pdet commented Jan 15, 2026

Uh oh!

Alex-Monahan commented Jan 15, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Alex-Monahan commented Dec 22, 2025 •

edited

Loading

pdet commented Jan 13, 2026 •

edited

Loading