Re: [DISCUSS] Hybrid Source Connector

Timo Walther Mon, 12 Dec 2022 02:44:37 -0800

Hi Ran,

Thanks for proposing a FLIP. Btw according to the process, the subjectof this email should be `[DISCUSS] FLIP-278: Hybrid Source Connector` sothat people can identify this discussion as a FLIP discussion.

Supporting the hybrid source for SQL was a long-standing issue on ourroadmap. Happy to give feedback here:


1) Options

Coming up with stable long-term options should be a shared effort.Having an index as a key could cause unintended side effects if theindex is not correctly chosen, I would suggest we use IDs instead.


What do you think about the following structure?

CREATE TABLE ... WITH (
  'sources'='historical;realtime',   -- Config option of type string list
  'historical.connector' = 'filesystem',
  'historical.path' = '/tmp/a.csv',
  'historcal.format' = 'csv',
  'realtime.path' = '/tmp/b.csv',
  'realtime.format' = 'csv'"
)

I would limit the IDs to simple [a-z0-9_] identifiers. Once we supportmetadata columns, we can also propagate these IDs easily.


2) Schema field mappings

The FLIP mentions `schema-field-mappings` could you elaborate on this inthe document?


3) Start position strategies

Have you thought about how we can represent start position strategies.The FLIP is very minimal but it would be nice to at least hear someopinions on this topic. Maybe we can come up with some general strategythat makes the most common use case possible in the near future.


Thanks,
Timo



On 08.12.22 10:26, Ran Tao wrote:

Hi guys. HybridSource is a good feature, but now released version did not
support table & sql api for a long time.

I notice that there is a related ticket here:
https://issues.apache.org/jira/browse/FLINK-22793
but the progress is slow, i wonder can we push forward this function.

I have wrote a discussed FLIP,  look forward to your comments.
https://cwiki.apache.org/confluence/pages/viewpage.action?pageId=235836225

Re: [DISCUSS] Hybrid Source Connector

Reply via email to