Anyone any thoughts on this? On Sat, Jan 4, 2020 at 5:16 PM Debraj Manna <subharaj.ma...@gmail.com> wrote:
> I am using samza on yarn with Kafka. I need to reduce the number of > partitions in kafka. I am ok with some data loss. Can someone suggest what > should be the recommended way of doing this? > > Samza Job Config looks like this - > > job.factory.class = org.apache.samza.job.yarn.YarnJobFactory > task.class = com.vnera.grid.task.GenericStreamTask > task.window.ms = 100 > systems.kafka.samza.factory = > org.apache.samza.system.kafka.KafkaSystemFactory > systems.kafka.consumer.zookeeper.connect = localhost:2181 > systems.kafka.consumer.auto.offset.reset = largest > systems.kafka.producer.metadata.broker.list = localhost:9092 > systems.kafka.producer.producer.type = sync > systems.kafka.producer.batch.num.messages = 1 > systems.kafka.samza.key.serde = string > serializers.registry.string.class = > org.apache.samza.serializers.StringSerdeFactory > yarn.package.path = > file://${basedir}/target/${project.artifactId}-${pom.version}-dist.tar.gz > yarn.container.count = ${container.count} > yarn.am.container.memory.mb = ${samzajobs.memory.mb} > job.name = job4 > task.inputs = kafka.Topic3 >