Re: Samza for text processing

2019-04-28 Thread Rob Martin
ease let us know should you have further questions! > > On Sun, Apr 28, 2019 at 7:09 AM Rob Martin wrote: > > > Im looking at creating a distributed steaming pipeline for processing > text > > documents (eg cleaning, NER and machine learning). Documents will > generally

Samza for text processing

2019-04-28 Thread Rob Martin
Im looking at creating a distributed steaming pipeline for processing text documents (eg cleaning, NER and machine learning). Documents will generally be under 1mb and processing will be stateless. Was aiming to feed documents from various sources and additional data into Kafka to be streamed to th

Fwd: Access Kafka headers

2019-04-27 Thread Rob Martin
How do you access Kafka headers from an implementation of StreamApplication? I can only access the message content from a MessageStream. Thanks in advance