1. Would it possible for users of Kylin 2-4 to upgrade their metadata to Kylin5 
easily?


I was talking with some early users of new codebase, they told me that they 
have a plan to 
upgrade to new codebase(kylin 5) from kylin 3, and they plan to developed and 
contributed
 the metadata upgradation tools. So I think this issue will be solved soon.




2. Would the structures(URL, request and response) of the Restiful Apis in 
Kylin 2-4 be kept?


It is a good question, but I have to say most REST APIs have been rewritten so 
they 
are call in new way.  I think new REST doc will help to solve this partially.




3. Any benchmark test has been done?


I think I will do a benchmark in next month.


4. Are features like Realtime Cubing, Cube Planner in Kylin3 are included in 
Kylin5?


Kafka streaming/Realtime cubing/JDBC source are implemented in new way so 
previous 
code are not exists.
For cube planner, Liu Kun are trying hard to implement it in new codebase
(see https://github.com/apache/kylin/pull/2089 ).





--

Best wishes to you ! 
From :Xiaoxiang Yu





At 2023-02-15 21:03:40, "George Ni" <n...@apache.org> wrote:
>Hi,
>
>Overrall, I'd like to give +1 to this proposal, for Kylin5 has implemented
>such a lot significant breakthroughs. Below are some of my questions:
>
>1. Would it possible for users of Kylin 2-4 to upgrade their metadata to
>Kylin5 easily?
>2. Would the structures(URL, request and response) of the Restiful Apis in
>Kylin 2-4 be kept?
>3. Any benchmark test has been done?
>4. Are features like Realtime Cubing, Cube Planner in Kylin3 are included
>in Kylin5?
>
>Li Yang <liy...@apache.org> 于2023年2月15日周三 17:23写道:
>
>> As Xiaoxiang mentioned, the code donation has a lot of improvements
>> compared to current Kylin 4. Many are long wanted, like
>>
>>    - The flexible model can greatly improve the smoothness of adding new
>>    dimensions in a production environment.
>>    - The computed column can mind the gap of last-mile data transformation.
>>    - The new model metadata design that is more friendly to dynamic
>>    indexing.
>>    - Support of 63+ dimensions.
>>
>> Accepting this code base a good thing for the whole Kylin community.
>>
>> Cheers
>> Yang
>>
>>
>> On Tue, Feb 14, 2023 at 10:46 PM ShaoFeng Shi <shaofeng...@apache.org>
>> wrote:
>>
>> > The current limitations are very difficult to solve in normal ways. For
>> > example, the Cuboid ID is represented by a Long number, which is 64 bit,
>> > and the sequence of each dimension is fixed. The Cuboid ID appears in
>> every
>> > part of Kylin's source code. This design couldn't be refactored easily.
>> So
>> > I agree that a whole new design is necessary, in long term it can help a
>> > lot.
>> >
>> > Best regards,
>> >
>> > Shaofeng Shi 史少锋
>> > Apache Kylin PMC,
>> > Apache Incubator PMC,
>> > Email: shaofeng...@apache.org
>> >
>> > Apache Kylin FAQ: https://kylin.apache.org/docs/gettingstarted/faq.html
>> > Join Kylin user mail group: user-subscr...@kylin.apache.org
>> > Join Kylin dev mail group: dev-subscr...@kylin.apache.org
>> >
>> >
>> >
>> >
>> > Xiaoxiang Yu <x...@apache.org> 于2023年2月14日周二 14:22写道:
>> >
>> > > A formatted version of the discussion with the same content:
>> > >
>> > > ## Background ##
>> > >
>> > > As we discussed in the mailing list[2] last year, Kylin 4.0 has
>> achieved
>> > > its goal in new storage (columnar file) and new query engine (Spark
>> > based),
>> > > and gained some adoptions from the community. But due to the old design
>> > > from the early versions, Kylin 4.0 still keep some limitations from
>> > > previous versions, such as max. 63 dimension cap, cube structure
>> couldn't
>> > > be modified once built, etc. We think the only way to solve those
>> > > limitations is to do a whole redesign, especially in the metadata.
>> > >
>> > > The good news is, Kyligence has started to do that from years ago, and
>> > its
>> > > comercial version has been verified by many customers in terms of its
>> > > functionality, performance and stability. Last year, Kyligence open
>> > sourced
>> > > its core under Apache License v2.0, and signed CCLA to Apache Software
>> > > Foundataion. We staged it in a separate branch of the github repository
>> > for
>> > > review[1]. Engineers from other teams such as eBay also reviewed the
>> > > codebase, and put forward many new ideas. We think based on the
>> codebase,
>> > > Kylin will not only gain a flexible metadata design, a faster computing
>> > > engine, but also will gain richer user scenarios.
>> > >
>> > > The new codebase has the following features compared with the latest
>> > > release (Kylin 4.0.3):
>> > >
>> > > - More flexible and enhanced data model
>> > >     * Allow adding new dimensions and measures to the existing data
>> model
>> > >     * The model adapts to table schema changes while retaining the
>> > > existing index at the best effort
>> > >     * Support last-mile data transformation using Computed Column
>> > >     * Support raw query (non-aggregation query) using Table Index
>> > >     * Support changing dimension table (SCD2)
>> > > - Simplified metadata design
>> > >     * Merge DataModel and CubeDesc into new DataModel
>> > >     * Add DataFlow for more generic data sequence, e.g. streaming like
>> > > data flow
>> > >     * New metadata AuditLog for better cache synchronization
>> > > - More flexible index management
>> > >     * Add IndexPlan to support flexible index management
>> > >     * Add IndexEntity to support different index type
>> > >     * Add LayoutEntity to support different storage layouts of the same
>> > > Index
>> > > - Toward a native and vectorized query engine
>> > >     * Experiment: Integrate with a native execution engine, leveraging
>> > > Gluten
>> > >     * Support async query
>> > >     * Enhance cost-based index optimizer
>> > > - More
>> > >     * Build engine refactoring and performance optimization
>> > >     * New WEB UI based on Vue.js, a brand new front-end framework, to
>> > > replace AngularJS
>> > >     * Smooth modeling process on one canvas
>> > >
>> > >
>> > >
>> > >
>> > > ## Proposal ##
>> > > So, I'd like to propose adopting the new codebase from Kyligence as
>> Kylin
>> > > 's future code base, e.g, Kylin 5. If accepted, we will request an IP
>> > > clearance in Apache Incubator for it as the next step.
>> > >
>> > >
>> > >
>> > >
>> > >
>> > > ## Reference ##
>> > > https://github.com/apache/kylin/tree/kylin5
>> > > https://lists.apache.org/thread/4fkhyw1fyf0jg5cb18v7vxyqbn6vm3zv
>> > >
>> > >
>> > > --
>> > >
>> > > Best wishes to you !
>> > > From :Xiaoxiang Yu
>> > >
>> > >
>> > >
>> > >
>> > >
>> > > At 2023-02-14 14:09:31, "Xiaoxiang Yu" <x...@apache.org> wrote:
>> > > >Background
>> > > >
>> > > >
>> > > >As we discussed in the mailing list[2] last year, Kylin 4.0 has
>> achieved
>> > > its goal in new storage (columnar file) and new query engine (Spark
>> > based),
>> > > and gained some adoptions from the community. But due to the old design
>> > > from the early versions, Kylin 4.0 still keep some limitations from
>> > > previous versions, such as max. 63 dimension cap, cube structure
>> couldn't
>> > > be modified once built, etc. We think the only way to solve those
>> > > limitations is to do a whole redesign, especially in the metadata.
>> > > >
>> > > >
>> > > >The good news is, Kyligence has started to do that from years ago, and
>> > > its comercial version has been verified by many customers in terms of
>> its
>> > > functionality, performance and stability. Last year, Kyligence open
>> > sourced
>> > > its core under Apache License v2.0, and signed CCLA to Apache Software
>> > > Foundataion. We staged it in a separate branch of the github repository
>> > for
>> > > review[1]. Engineers from other teams such as eBay also reviewed the
>> > > codebase, and put forward many new ideas. We think based on the
>> codebase,
>> > > Kylin will not only gain a flexible metadata design, a faster computing
>> > > engine, but also will gain richer user scenarios.
>> > > >
>> > > >
>> > > >The new codebase has the following features compared with the latest
>> > > release (Kylin 4.0.3):
>> > > >More flexible and enhanced data model
>> > > >Allow adding new dimensions and measures to the existing data model
>> > > >The model adapts to table schema changes while retaining the existing
>> > > index at the best effort
>> > > >Support last-mile data transformation using Computed Column
>> > > >Support raw query (non-aggregation query) using Table Index
>> > > >Support changing dimension table (SCD2)
>> > > >Simplified metadata design
>> > > >Merge DataModel and CubeDesc into new DataModel
>> > > >Add DataFlow for more generic data sequence, e.g. streaming like data
>> > flow
>> > > >New metadata AuditLog for better cache synchronization
>> > > >More flexible index management
>> > > >Add IndexPlan to support flexible index management
>> > > >Add IndexEntity to support different index type
>> > > >Add LayoutEntity to support different storage layouts of the same
>> Index
>> > > >Toward a native and vectorized query engine
>> > > >Experiment: Integrate with a native execution engine, leveraging
>> Gluten
>> > > >Support async query
>> > > >Enhance cost-based index optimizer
>> > > >More
>> > > >Build engine refactoring and performance optimization
>> > > >New WEB UI based on Vue.js, a brand new front-end framework, to
>> replace
>> > > AngularJS
>> > > >Smooth modeling process on one canvas
>> > > >Proposal
>> > > >So, I'd like to propose adopting the new codebase from Kyligence as
>> > Kylin
>> > > 's future code base, e.g, Kylin 5. If accepted, we will request an IP
>> > > clearance in Apache Incubator for it as the next step.
>> > > >Reference
>> > > >https://github.com/apache/kylin/tree/kylin5
>> > > >https://lists.apache.org/thread/4fkhyw1fyf0jg5cb18v7vxyqbn6vm3zv
>> > > >https://kylin.apache.org/5.0/blog/introduction_of_metastore_cn
>> > > >
>> > > >--
>> > > >
>> > > >Best wishes to you !
>> > > >From :Xiaoxiang Yu
>> >
>>
>
>
>-- 
>
>---------------------
>
>Best regards,
>
>
>
>Ni Chunen / George

Reply via email to