GitHub user pdu-mn1 opened a pull request: https://github.com/apache/samza/pull/432
Initial implementation of remote table provider Please see commit messages for detailed descriptions. You can merge this pull request into a Git repository by running: $ git pull https://github.com/pdu-mn1/samza remote-table-0222 Alternatively you can review and apply these changes as the patch at: https://github.com/apache/samza/pull/432.patch To close this pull request, make a commit to your master/trunk branch with (at least) the following in the commit message: This closes #432 ---- commit 5f9c71ef374055acbf82ea1b828631ebc6168ea9 Author: Peng Du <pdu@...> Date: 2018-02-22T19:49:09Z Make Table initiable with config and task context Samza table implementations might have the need to access global config or data structures in the task context, eg. registering metrics with MetricsRegistry. This change adds a init() method to ReadableTable similar to other initable entities in Samza. TableManager will invoke this init before handing out Table instances to the operators. commit 71b3cf33d3c4fc4ab962f4216c590272e47208c6 Author: Peng Du <pdu@...> Date: 2018-02-22T19:51:53Z Initial RemoteTable implementation RemoteTable is a generic table representation to plug remote stores with random access by key, eg. certain REST services, key-value databases, into Samza API. Actual IO functions are abstracted by RecordReader and RecordWriter interfaces. Common implementations for record reader/writer for commonly used remote store technologies will be added later on to ease application development with remote tables. ---- ---