Skip to content

Conversation

mapleFU
Copy link
Member

@mapleFU mapleFU commented Sep 1, 2025

Which issue does this PR close?

Previously, v2 uses "delta" to encoding values, however it's too expansive for non-delta. Can we shift it to delta length byte array?

Rationale for this change

Change the parquet-v2 default encoding to delta-length-byte-array

What changes are included in this PR?

Change the parquet-v2 default encoding to delta-length-byte-array

Are these changes tested?

Covered by existing

Are there any user-facing changes?

No

@github-actions github-actions bot added the parquet Changes to the parquet crate label Sep 1, 2025
@etseidl
Copy link
Contributor

etseidl commented Sep 4, 2025

Maybe we should discuss this one in an issue. I know the spec says delta_length_byte_array is the default fallback, but I think delta_byte_array is used by parquet-java (I'll admit to not having looked at this for several years). I'm mildly in favor of this change, but others might have different opinions.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
parquet Changes to the parquet crate
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants