all tables in lake storage configured cluster should use lake's bucke… #362

luoyuxia · 2025-02-08T12:27:34Z

…t assigner

Purpose

Linked issue: close #343

Make all tables in lake storage configured cluster use lake's bucket assign strategy..

If a table is created in lakestorage configured cluster, set table.datalake.format to the corresponding datalake
If table is with table.datalake.format, use lake's bucket assinger to assign bucket for rows
Split current bucket assinger to StaticBucketAssigner and DynamicBucketAssigner.. For tables with bucket keys,
use StaticBucketAssigner to caclcuate bucket id during convert row into WriteRecord.

Tests

LakeTableManagerITCase to verify table.datalake.format will be set in lakestorage configured cluster
FlussLakeTableITCase to verfy write/lookup will use lake's bucket assinger t

API and Format

Documentation

wuchong

Some high-level design suggestions:

The cluster datalake configuration should align with table configuration, how about:

datalake.format = paimon/iceberg
datalake.paimon.catalog = hive
datalake.paimon.metastore = thrift://localhost:9083
datalake.paimon.warehouse = hdfs://localhost:9000/user/hive/warehouse

Besides, the table.datalake.format, we can also add the catalog and metastore information into table config, so that the table can be self-contained, and connectors don't need to retrieve describeLakeStorage (describeLakeStorage can be removed then). If there is any auth/securtiy tokens to retrieve, we can add another RPC similar to getFileSystemSecurityToken, but that should be optional.

table.datalake.enabled = true/false
table.datalake.format = paimon/iceberg

// logical properties
table.datalake.paimon.catalog = hive
table.datalake.paimon.metastore = thrift://localhost:9083
table.datalake.paimon.warehouse = hdfs://localhost:9000/user/hive/warehouse

Considering the metastore location may be changed over time, it would be a logical property that is added during getTableInfo instead of a physical property stored in zookeeper. And thus metastore properties shouldn't be set or altered.

It's confusing that the key encoding is different for bucketing and storing. It's meaningless to keep our own key encoding when it is a datalake-enabled table. We also discussed that bucket-rescale can be supported even if we use Paimon/Iceberg key encoding/bucketing. So we can introduce two separate interfaces for encoding and bucketing.

interface KeyEncoder {
    byte[] encodeKey(InternalRow row);
}

interface BucketAssigner {
  int assignBucket(@Nullable byte[] bucketKey, Cluster cluster);
}

We should add tests to verify the encoding and bucketing is the same with Paimon

fluss-client/src/main/java/com/alibaba/fluss/client/write/WriterClient.java

fluss-client/src/main/java/com/alibaba/fluss/client/table/writer/UpsertWriterImpl.java

fluss-common/src/main/java/com/alibaba/fluss/config/TableConfig.java

luoyuxia · 2025-02-17T09:40:28Z

@wuchong Thanks for review..
Comments addressed... Let's introduce Paimon's bucket & key encoding in #408 and remove describeLakeStorage in #411

…ated in DataLake configured cluster (alibaba#362)

wuchong · 2025-02-21T12:10:35Z

...-lakehouse-paimon/src/main/java/com/alibaba/fluss/lakehouse/paimon/FlussLakehousePaimon.java

@@ -124,4 +124,14 @@ public static Map<String, String> extractConfigStartWith(
        }
        return extractedConfig;
    }
+
+    private static Map<String, String> normalizeToPaimonConfigs(


Use datalake.paimon.metastore to replace datalake.paimon.catalog.

wuchong · 2025-02-21T12:11:21Z

fluss-server/src/main/java/com/alibaba/fluss/server/utils/LakeStorageUtils.java

+                datalakeFormat.toString(), normalizeToPaimonConfigs(datalakeConfig));
+    }
+
+    private static Map<String, String> normalizeToPaimonConfigs(


Use datalake.paimon.metastore to replace datalake.paimon.catalog

wuchong · 2025-02-21T14:25:22Z

fluss-server/src/main/java/com/alibaba/fluss/server/coordinator/CoordinatorService.java

@@ -186,6 +194,14 @@ private TableDescriptor applySystemDefaults(TableDescriptor tableDescriptor) {
            newDescriptor = newDescriptor.withReplicationFactor(defaultReplicationFactor);
        }

+        // if lake storage is not null, we need to add the datalake type
+        // to the property of the table
+        if (dataLakeFormat != null) {


we should throw exception if the table is datalake enabled, but cluster not.

wuchong · 2025-02-21T14:29:35Z

fluss-client/src/main/java/com/alibaba/fluss/client/lookup/PrefixKeyLookuper.java

-            this.lakeTableBucketAssigner = null;
+            this.bucketKeyEncoder =
+                    CompactedKeyEncoder.createKeyEncoder(lookupRowType, tableInfo.getBucketKeys());
+            this.lakeBucketAssigner = null;


Constructing the bucketKeyEncoder is still very complex. However, we can unify the default and lake assigners into one util method.

wuchong · 2025-02-21T14:31:51Z

fluss-client/src/main/java/com/alibaba/fluss/client/utils/ClientUtils.java

        } else {
-            return lakeTableBucketAssigner.assignBucket(
-                    keyBytes, key, metadataUpdater.getCluster());
+            return lakeBucketAssigner.assignBucket(bucketKeyBytes);


It is still not unified and complex to assign bucket. We can introduce a BucketingFunction#bucketing(byte[] key, int numBuckets) to unify the bucket assigned instead of using a util method and passing optional lake bucket assigner. We should hold only one not-null bucket assigner, that will be easier to use.

wuchong · 2025-02-21T14:50:54Z

Please review the changes I just pushed @luoyuxia .

…lake enabled or not

luoyuxia

@wuchong Thanks for the commit to unify encoding and bucket assigning.. Looks much better after unifying. It make simpler to use.. LGTM

…ated in DataLake configured cluster (#362)

luoyuxia force-pushed the use-lake-bucket-assigner branch 5 times, most recently from 84ab0a0 to b69bc6b Compare February 10, 2025 09:49

luoyuxia marked this pull request as ready for review February 10, 2025 10:10

luoyuxia requested review from wuchong, swuferhong and loserwang1024 February 10, 2025 10:22

luoyuxia force-pushed the use-lake-bucket-assigner branch from b69bc6b to 12e2d38 Compare February 11, 2025 08:44

wuchong requested changes Feb 13, 2025

View reviewed changes

luoyuxia force-pushed the use-lake-bucket-assigner branch 7 times, most recently from 021edb8 to 840980a Compare February 17, 2025 08:14

luoyuxia mentioned this pull request Feb 17, 2025

Remove describeLakeStorage #411

Open

2 tasks

luoyuxia force-pushed the use-lake-bucket-assigner branch from 840980a to 4a8108f Compare February 17, 2025 08:37

[lake] Use DataLake's key encoding and bucket assigner for tables cre…

123433c

…ated in DataLake configured cluster (alibaba#362)

wuchong reviewed Feb 21, 2025

View reviewed changes

wuchong force-pushed the use-lake-bucket-assigner branch from 4a8108f to fffb61c Compare February 21, 2025 14:48

[common] Use a unified way to encode keys and assign buckets for data…

e5f1628

…lake enabled or not

wuchong force-pushed the use-lake-bucket-assigner branch from fffb61c to e5f1628 Compare February 21, 2025 16:29

luoyuxia commented Feb 22, 2025

View reviewed changes

wuchong approved these changes Feb 22, 2025

View reviewed changes

wuchong merged commit 3bf243d into alibaba:main Feb 22, 2025
2 checks passed

wuchong pushed a commit that referenced this pull request Feb 22, 2025

[lake] Use DataLake's key encoding and bucket assigner for tables cre…

7a4d71e

…ated in DataLake configured cluster (#362)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

all tables in lake storage configured cluster should use lake's bucke… #362

all tables in lake storage configured cluster should use lake's bucke… #362

luoyuxia commented Feb 8, 2025 •

edited

Loading

wuchong left a comment

luoyuxia commented Feb 17, 2025 •

edited

Loading

wuchong Feb 21, 2025

wuchong Feb 21, 2025

wuchong Feb 21, 2025

wuchong Feb 21, 2025

wuchong Feb 21, 2025

wuchong commented Feb 21, 2025

luoyuxia left a comment

all tables in lake storage configured cluster should use lake's bucke… #362

all tables in lake storage configured cluster should use lake's bucke… #362

Conversation

luoyuxia commented Feb 8, 2025 • edited Loading

Purpose

Tests

API and Format

Documentation

wuchong left a comment

Choose a reason for hiding this comment

luoyuxia commented Feb 17, 2025 • edited Loading

wuchong Feb 21, 2025

Choose a reason for hiding this comment

wuchong Feb 21, 2025

Choose a reason for hiding this comment

wuchong Feb 21, 2025

Choose a reason for hiding this comment

wuchong Feb 21, 2025

Choose a reason for hiding this comment

wuchong Feb 21, 2025

Choose a reason for hiding this comment

wuchong commented Feb 21, 2025

luoyuxia left a comment

Choose a reason for hiding this comment

luoyuxia commented Feb 8, 2025 •

edited

Loading

luoyuxia commented Feb 17, 2025 •

edited

Loading