Thank you for the information! This is a very interesting topic.
 In last year’s Flink Forward conference, there was an interesting talk about 
Hologres.
 
https://www.flink-forward.org/sf-2020/conference-program#data-warehouse--data-lakes--what-s-next-
 
<https://www.flink-forward.org/sf-2020/conference-program#data-warehouse--data-lakes--what-s-next->

What do you think?

> On Nov 4, 2021, at 10:16 PM, Caizhi Weng <tsreape...@gmail.com> wrote:
> 
> Hi!
> 
> Flink is a distributed, stateful streaming data-flow engine (with 
> optimizations for batch or olap jobs too) and it currently is not shipped 
> with any storage system. It needs to be used along with external storage / 
> computation system like hdfs, hive, kafka, iceberg, etc. to build a data 
> warehouse [1] or data lake [2].
> 
> Most use cases [3] of Flink includes continuous streaming or long-running 
> batch analytical jobs (which are all levels in a data warehouse, including 
> etl jobs) so I can't say Flink is specialized to etl or olap. But as for olap 
> there are a few companies currently using Flink as their olap execution 
> engine. If you're interested, you can keep an eye on Flink Forward Asia this 
> year, in which two talks are about using Flink as a olap execution engine in 
> production (search olap in [4] for more detail).
> 
> [1] 
> https://www.alibabacloud.com/blog/flink-is-attempting-to-build-a-data-warehouse-simply-by-using-a-set-of-sql-statements_596346
>  
> <https://www.alibabacloud.com/blog/flink-is-attempting-to-build-a-data-warehouse-simply-by-using-a-set-of-sql-statements_596346>
> [2] 
> https://www.alibabacloud.com/blog/building-an-enterprise-level-real-time-data-lake-based-on-flink-and-iceberg_597755
>  
> <https://www.alibabacloud.com/blog/building-an-enterprise-level-real-time-data-lake-based-on-flink-and-iceberg_597755>
> [3] https://flink.apache.org/usecases.html 
> <https://flink.apache.org/usecases.html>
> [4] https://flink-forward.org.cn/#agenda 
> <https://flink-forward.org.cn/#agenda>
> Ww J <junww2...@gmail.com <mailto:junww2...@gmail.com>> 于2021年11月5日周五 
> 下午12:49写道:
> Thanks. Can Flink replace the popular OLAP databases, for example, AWS 
> redshift?
> It seems to me that generally Flink is used as ETL for OLAP. 
> 
>> On Nov 4, 2021, at 9:33 PM, Caizhi Weng <tsreape...@gmail.com 
>> <mailto:tsreape...@gmail.com>> wrote:
>> 
>> Hi!
>> 
>> Yes you can. Note that it is recommended to run Flink in session cluster 
>> mode (instead of per job mode) to minimize distribution and scheduling time 
>> for each OLAP query.
>> 
>> Ww J <junww2...@gmail.com <mailto:junww2...@gmail.com>> 于2021年11月5日周五 
>> 下午12:30写道:
>> Hi,
>> 
>> Can Flink be used for OLAP queries?
>> 
>> Thanks,
>> 
>> Jack
> 

Reply via email to