suxiaogang223 opened a new pull request, #63809:
URL: https://github.com/apache/doris/pull/63809
### What problem does this PR solve?
Issue Number: close #xxx
Related PR: #xxx
Problem Summary:
External catalog meta cache statistics exposed cumulative eviction count,
but did not provide a direct replacement frequency metric for judging whether
cache capacity is too small. This PR adds `EVICTION_RATE` to
`information_schema.catalog_meta_cache_statistics`, calculated as
`eviction_count / request_count` and returned as `0` when there are no requests.
Hive partition metadata cache defaults were also too small for common
external catalog workloads, causing frequent evictions without explicit tuning.
This PR increases the default Hive single-partition cache capacity from 10,000
to 100,000 and the Hive partitioned-table values cache capacity from 1,000 to
10,000. While checking similar cache entries, MaxCompute `partition_values` was
found to cache table-level partition value structures but use the Hive
single-partition capacity; it now follows the table-level partition values
capacity.
### Release note
Add `EVICTION_RATE` to `information_schema.catalog_meta_cache_statistics`,
increase default Hive partition meta cache capacities, and make MaxCompute
`partition_values` use the table-level partition values capacity.
### Check List (For Author)
- Test <!-- At least one of them must be included. -->
- [ ] Regression test
- [x] Unit Test
- `./run-fe-ut.sh --run
org.apache.doris.datasource.metacache.MetaCacheEntryTest`
- `./run-fe-ut.sh --run
org.apache.doris.datasource.hive.HiveMetaStoreCacheTest,org.apache.doris.datasource.maxcompute.MaxComputeExternalMetaCacheTest`
- [ ] Manual test (add detailed scripts or steps below)
- [ ] No need to test or manual test. Explain why:
- [ ] This is a refactor/code format and no logic has been changed.
- [ ] Previous test can cover this change.
- [ ] No code files have been changed.
- [ ] Other reason <!-- Add your reason? -->
- Behavior changed:
- [ ] No.
- [x] Yes. `catalog_meta_cache_statistics` includes `EVICTION_RATE`;
default Hive partition meta cache capacities are larger; MaxCompute
`partition_values` uses the table-level partition values capacity.
- Does this need documentation?
- [x] No.
- [ ] Yes. <!-- Add document PR link here. eg:
https://github.com/apache/doris-website/pull/1214 -->
### Check List (For Reviewer who merge this PR)
- [ ] Confirm the release note
- [ ] Confirm test cases
- [ ] Confirm document
- [ ] Add branch pick label <!-- Add branch pick label that this PR should
merge into -->
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]