Skip to content

[HUDI-8680] Update doc description for metadata index partition enable #13214

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Conversation

voonhous
Copy link
Member

@voonhous voonhous commented Apr 23, 2025

Change Logs

ENABLE_METADATA_INDEX_PARTITION_STATS is enabled by default, a change that was introduce in the PR below that sets its default value to the value of ENABLE_METADATA_INDEX_COLUMN_STATS here:

https://github.com/apache/hudi/pull/12707/files#diff-11e9ef6bd53ef1001b669a1dc68dde2aba9b33c9eb72cc1e4198750336d79772R893

However, the config value is false. This PR changes the config value to remove this discrepancy. Automated config docs will also be updated as a result to reduce confusion.

In this PR, we are updating the doc description to better reflect the default value of this config.

Impact

None

Risk level (write none, low medium or high below)

None

Documentation Update

Default value of ENABLE_METADATA_INDEX_PARTITION_STATS / hoodie.metadata.index.partition.enable will be changed in the config docs.

Contributor's checklist

  • Read through contributor's guide
  • Change Logs and Impact were stated clearly
  • Adequate tests were added if applicable
  • CI passed

@voonhous voonhous requested a review from codope April 23, 2025 13:39
@voonhous
Copy link
Member Author

Actually, let me reuse the JIRA of pr https://github.com/apache/hudi/pull/12707

@voonhous voonhous changed the title [MINOR] Fix default value of metadata partition stats enable [HUDI-8680] Fix default value of metadata partition stats enable Apr 23, 2025
@voonhous voonhous force-pushed the fix-redudant-partition-stats-config-discrepancy branch from 05353ce to 6083ec0 Compare April 23, 2025 13:41
@github-actions github-actions bot added the size:XS PR with lines of changes in <= 10 label Apr 23, 2025
@@ -381,9 +381,10 @@ public final class HoodieMetadataConfig extends HoodieConfig {

public static final ConfigProperty<Boolean> ENABLE_METADATA_INDEX_PARTITION_STATS = ConfigProperty
.key(METADATA_PREFIX + ".index.partition.stats.enable")
.defaultValue(false)
.defaultValue(true)
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

we do this at per engine level. thats why we did not flip this value.

if you check HoodieMetadataConfig,
we have getDefaultColStatsEnable() which will choose the right value for col stats for different engines.
and the default value for partitions stats enable goes along w/ the derived value for col stats.
if col stats is enabled, PSI will also be enabled. if not, it will not.

We just need to update our release docs to call this out.

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I have updated the description ,PTAL

Copy link
Contributor

@nsivabalan nsivabalan left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

not sure apart from doc discrepancy, what else we are trying to solve here.

@voonhous
Copy link
Member Author

not sure apart from doc discrepancy, what else we are trying to solve here.

No other issues aside from doc discrepancy. Let me change the doc description for this to call the behaviour out then and keep the default value as false.

@voonhous voonhous changed the title [HUDI-8680] Fix default value of metadata partition stats enable [HUDI-8680] Update doc description for metadata index partition enable Apr 23, 2025
@voonhous voonhous requested a review from nsivabalan April 23, 2025 17:29
@voonhous voonhous force-pushed the fix-redudant-partition-stats-config-discrepancy branch from faba473 to 53eb921 Compare April 23, 2025 17:32
@voonhous voonhous force-pushed the fix-redudant-partition-stats-config-discrepancy branch from 53eb921 to eac3675 Compare April 23, 2025 17:46
@github-actions github-actions bot added size:S PR with lines of changes in (10, 100] and removed size:XS PR with lines of changes in <= 10 labels Apr 23, 2025
@hudi-bot
Copy link

CI report:

Bot commands @hudi-bot supports the following commands:
  • @hudi-bot run azure re-run the last Azure build

@nsivabalan nsivabalan merged commit abae99c into apache:master Apr 24, 2025
57 of 58 checks passed
voonhous added a commit to voonhous/hudi that referenced this pull request Apr 25, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
release-1.0.2 size:S PR with lines of changes in (10, 100]
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants