Hi,

I am new to Samza, I am evaluating Samza as the backbone for my streaming
CEP requirement.  I have:

1. Multiple data enrichment and ETL jobs
2. Multiple domain specific CEP rulesets
3. Common/shared system services like consuming topics/streams and
persisting the messages in ElasticSearch and HDFS.

My questions are:

1. Can I, or is it recommended to, package multiple jobs as 1 deployment
with 1 properties file or keep each app separated?  Based on the
documentation, it appears to support 1 app/job within a single
configuration as there is no mechanism to assign multiple app classes and
given each a name unless I am mistaken.
2. If only 1 app per config+deployment, what is the best way to handle
requirement #3 - common/shared system services as there is no app or job
per say, I just need to specify the streams and output system (ie
org.apache.samza.system.hdfs.writer.
BinarySequenceFileHdfsWriter or
org.apache.samza.system.elasticsearch.indexrequest.DefaultIndexRequestFactory).
Given it's a common shared system service not tied to specific jobs, can it
be deployed without an app?

Thank you in advance for your help, looking forward to learning more about
Samza and developing this critical feature using Samza!

Regards,

Eric

Reply via email to