[ 
https://issues.apache.org/jira/browse/FLINK-17961?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17119635#comment-17119635
 ] 

Etienne Chauchot commented on FLINK-17961:
------------------------------------------

Thanks Aljoscha for commenting. ES has data streams features but only for time 
series data; the aim of this source is to read all kind of data. Apart from 
data streams it behaves like a database. You read the content of an index 
(similar to a table) corresponding to the given query (similar to SQL). So, 
regarding streaming changes, if there are changes between 2 read requests, at 
the second the whole index (containing the change) will be read another time. 
Regarding failover: I guess exactly once semantics cannot be guaranteed only at 
least once. Indeed there is no ack mechanism on already read data. Under those 
circumstances, I guess an ES source cannot get into ES. So what should a user 
do to read from ES? Should he send ES requests manually from a Map ?

> Create an Elasticsearch source
> ------------------------------
>
>                 Key: FLINK-17961
>                 URL: https://issues.apache.org/jira/browse/FLINK-17961
>             Project: Flink
>          Issue Type: New Feature
>          Components: Connectors / ElasticSearch
>            Reporter: Etienne Chauchot
>            Priority: Minor
>
> There is only an Elasticsearch sink available. There are opensource github 
> repos such as [this 
> one|[https://github.com/mnubo/flink-elasticsearch-source-connector]]. Also 
> the apache bahir project does not provide an Elasticsearch source connector 
> for flink either. IMHO I think the project would benefit from having an 
> bundled source connector for ES alongside with the available sink connector.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to