Skip to content

Conversation

shujingyang-db
Copy link
Contributor

What changes were proposed in this pull request?

This PR implements the repartitionById method for PySpark DataFrames

Why are the changes needed?

Support Direct Passthrough Partitioning in the PySpark

Does this PR introduce any user-facing change?

Yes

How was this patch tested?

New unit tests.

Was this patch authored or co-authored using generative AI tooling?

@HyukjinKwon HyukjinKwon changed the title [SPARK-53429] Support Direct Passthrough Partitioning in the PySpark Dataframe API [SPARK-53429][PYTHON] Support Direct Passthrough Partitioning in the PySpark Dataframe API Sep 10, 2025
@shujingyang-db
Copy link
Contributor Author

@HyukjinKwon @zhengruifeng Can you please help review the PR? Thanks a lot

@zhengruifeng
Copy link
Contributor

LGTM pending CI

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants