lnbest0707-uber opened a new pull request, #12779:
URL: https://github.com/apache/pinot/pull/12779
tags: `feature`, `refactor`, `release-notes`
This adds an evolved version of ShemaConformingTransformerV2, it evolves
from the existing one with following new features:
1. Refactored code with better readability and extensibility
2. Support over-lapping schema fields, in which case it could support schema
column "a" and "a.b" at the same time. And it only allows primitive type fields
to be the value.
3. Extract flattened key-value pairs as mergedTextIndex for better text
searching.
4. Add shingle index tokenization functionality for extremely large text
fields.
5. Add flexibility to map json extracted field name to meaningful user
specified column name
6. Improve serialization logics to handle nested json fields
7. Enforce graceful handling on extracted String type column. Will convert
collection or array to String if column type is singleField.
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]