ssage about how the container is requesting more
resources than it can allocate. With 1 core, everything is fine. Is
there another Samza option I need to set?
Cheers,
Malcolm
--
Malcolm McFarland
Cavulus
d be sufficient. We haven't seen this
> issue before. What Samza/YARN versions are you using? Can you also include
> the logs from where you get the error and your yarn configuration?
>
> - Prateek
>
> On Mon, Apr 1, 2019 at 2:33 AM Malcolm McFarland
> wrote:
>
> &g
this? Looking at the Samza source on Github,
it appears to be information that's passed back to the AM when it
starts up.
Cheers,
Malcolm
On Mon, Apr 1, 2019 at 10:44 AM Malcolm McFarland
wrote:
>
> Hi Prateek,
>
> Sorry, meant to include these versions with my email; I'm
etected this and decided to default to 1? Can you
> try setting maximum-allocation-vcores lower?
>
> - Prateek
>
> On Mon, Apr 1, 2019 at 11:59 AM Malcolm McFarland
> wrote:
>
> > One other detail: I'm running YARN on ECS in AWS. Has anybody seen
> > issues with
yScheduler.html>
> and DominantResourceCalculator to account for vcore allocations in
> scheduling.
>
> - Prateek
>
> On Mon, Apr 1, 2019 at 3:00 PM Malcolm McFarland
> wrote:
>
> > Hi Prateek,
> >
> > This still seems to be manifesting with the same problem. Sinc
One more thing -- fwiw, I actually also came across the possibility that I
would need to use the DominantResourceCalculator, but as you point out,
this doesn't seem to be available in Hadoop 2.6.
On Mon, Apr 1, 2019 at 5:27 PM Malcolm McFarland
wrote:
> That's quite helpfu
=org.apache.hadoop.yarn.util.resource.DominantResourceCalculator
And on the Samza side, I'm setting:
cluster-manager.container.cpu.cores=2
However, YARN is still telling me that the running task has 1 vcore
assigned. Do you have any other suggestions for options to tweak?
Cheers,
Malcolm
On Mon, Apr 1, 2019 at 5:28 PM Malcolm McFarland
wrote:
esponse. The YARN RM supports container requests with max-mem:
> > 14336, max-cpu: 1"
> >
> > On Tue, Apr 2, 2019 at 12:09 AM Malcolm McFarland
> > wrote:
> >
> >> Hey Prateek,
> >>
> >> The upgrade to Hadoop 2.7.6 went fine; everything seems
tion shown on-demand, as
opposed to preemptive?
Cheers,
Malcolm
Cheers,
Malcolm
On Tue, Apr 2, 2019 at 12:54 PM Malcolm McFarland
wrote:
> Hi Prateek,
>
> I'm not getting an error now, just an unyielding vcore allotment of 1.
> I just verified that we
from the v2.6.1 docs (which I was initially using because of
its inclusion in the hello-samza project) to mean that this was a
per-container setting.
Thanks again for the help, and for the tip on upgrading to Yarn 2.7.6!
Cheers,
Malcolm
On Tue, Apr 2, 2019 at 1:47 PM Malcolm McFarland
wrote
situation here:
> >
> >
> https://stackoverflow.com/questions/55737123/samza-1-1-0-run-app-sh-does-not-work-during-deployment-of-hello-samza
> >
> > Can someone on your team please help?
> >
> > Many thanks,
> >
> > Majd
> >
> >
>
container to try and
deduce what's happening?
Cheers,
Malcolm
--
Malcolm McFarland
Cavulus
This correspondence is from HealthPlanCRM, LLC, d/b/a Cavulus. Any
unauthorized or improper disclosure, copying, distribution, or use of
the contents of this message is prohibited. The information con
e, May 7, 2019 at 3:22 PM Malcolm McFarland
wrote:
> Hey folks,
>
> We're having some trouble running Samza under YARN. The YARN
> containers are launching fully into the RUNNING state, and I can see
> in the node manager logs that the containers are running, but my logs
> are sh
n-version?
>
> Can you roll-back to a known-good version to better isolate the issue?
>
> Best,
> Jagadish
>
> On Tue, May 7, 2019 at 3:54 PM Malcolm McFarland
> wrote:
>
> > As a followup to this, here's what I see when the Samza app tries to
> start;
&
mpsonLinux";
rel="noreferrer"
target="_blank">https://confluence.atlassian.com/doc/generating-a-thread-dump-externally-182158040.html#GeneratingaThreadDumpExternally-GeneratingthreaddumpsonLinux>
> java.net.ConnectException
Since this is a ConnectException, can you rule out network
process on a YARN cluster, from Accepted status, to
localization, to the application master startup, to the actual
application's startup?
Cheers,
Malcolm
--
Malcolm McFarland
Cavulus
This correspondence is from HealthPlanCRM, LLC, d/b/a Cavulus. Any
unauthorized or improper disc
astructure is
configured. I think I need to find which logs to look at in order to trace
the startup/localization steps. If anybody has advice on this, I'd
appreciate it!
Cheers,
Malcolm
On Mon, May 20, 2019 at 9:49 PM Malcolm McFarland
wrote:
>
> Hey Jagadish,
>
> Thanks for the ti
ething you were looking for?
>
> Also, by "don't fully start up" do you mean that
> applications are missing some containers (but the ApplicationMaster is
> running)?
> Or the application is missing entirely.
>
> --
> thanks
> rayman
> [image: Samza Job La
Hey folks,
Are there any guidelines for how to provision an Application Master in
relation to the number of StreamTask instances it will be managing? Ie, are
there different memory, CPU, and thread-count figures for 100S StreamTasks
vs 1000, vs 1?
Cheers,
Malcolm McFarland
Cavulus
This
nd the container (ie,
yarn.am.container.memory.mb=1536, yarn.am.opts=-Xmx1024m); does that sound
reasonable?
Cheers,
Malcolm McFarland
Cavulus
This correspondence is from HealthPlanCRM, LLC, d/b/a Cavulus. Any
unauthorized or improper disclosure, copying, distribution, or use of the
contents of this messag
at's interfering with this,or maybe
just isn't white-listing a port correctly, and if I could identify where
the application is stalling, it'd probably help to narrow down the
possibilities.
Cheers,
Malcolm McFarland
Cavulus
This correspondence is from HealthPlanCRM, LLC, d/b/a Ca
other than
what's returned from /bin/hostname (I'm guessing it uses gethostname() on
Ubuntu, could be wrong).
Anybody ideas?
Cheers,
Malcolm McFarland
Cavulus
This correspondence is from HealthPlanCRM, LLC, d/b/a Cavulus. Any
unauthorized or improper disclosure, copying, distributio
Hey folks,
Is there any way to specify which ports the tasks communicate with the job
coordinator on? Right now it looks like it's 3+. Is there any way to
narrow this?
Cheers,
Malcolm McFarland
Cavulus
This correspondence is from HealthPlanCRM, LLC, d/b/a Cavulus. Any
unauthoriz
te only allows 10GB of storage (this can be extended a small amount
via an ephemeral mounted volume but seemingly not enough to satisfy YARN's
VM requirements).
Hth, and thanks for everybody's patience,
Malcolm McFarland
Cavulus
This correspondence is from HealthPlanCRM, LLC, d/b
No problem -- I'm happy that we finally figured this out and could share
our results. ECS could actually be a good choice for Node Managers; it's
easy in ECS to scale node counts up and down and to cycle out unhealthy
servers.
Malcolm McFarland
Cavulus
This correspondence is from Hea
re if the bundle is built locally or not?
Cheers,
Malcolm McFarland
Cavulus
This correspondence is from HealthPlanCRM, LLC, d/b/a Cavulus. Any
unauthorized or improper disclosure, copying, distribution, or use of the
contents of this message is prohibited. The information contained in this
messa
* the task from affect the box that *runs* the task?
Cheers,
Malcolm McFarland
Cavulus
This correspondence is from HealthPlanCRM, LLC, d/b/a Cavulus. Any
unauthorized or improper disclosure, copying, distribution, or use of the
contents of this message is prohibited. The information contained in this
me
as anybody attempted this sort of combination?
Cheers,
Malcolm McFarland
Cavulus
[0]
https://kafka.apache.org/0110/javadoc/org/apache/kafka/clients/producer/ProducerInterceptor.html
[1]
https://cwiki.apache.org/confluence/display/KAFKA/KIP-42%3A+Add+Producer+and+Consumer+Interceptors
This corresponde
* topics to be removed.
I have two questions:
1) Where within these queues is the grouper configuration stored?
2) Would a Kafka topic cleanup.policy of "compact" cause trouble here?
Cheers,
Malcolm McFarland
Cavulus
This correspondence is from HealthPlanCRM, LLC, d/b/a Ca
when Samza picks up the
message for processing. Are there any ideas out there about how to do this?
Cheers,
Malcolm McFarland
Cavulus
This correspondence is from HealthPlanCRM, LLC, d/b/a Cavulus. Any
unauthorized or improper disclosure, copying, distribution, or use of the
contents of this mes
io silence on a YARN
deploy (we've had no other code-related issues transitioning to YARN).
I'll take a look through the metrics and see if any of those could fill
this role. Right now we're looking at per-partition consumption, and maybe
"process-calls" will help with th
mechanism. Are there any best practices or gotchas
surrounding restarting Samza applications on YARN that could help here?
Cheers,
Malcolm McFarland
Cavulus
This correspondence is from HealthPlanCRM, LLC, d/b/a Cavulus. Any
unauthorized or improper disclosure, copying, distribution, or use of the
c
Hey folks,
We have cleanup.policy=compact set on our checkpoint topics. Even with
this, we have almost 3 billion messages in some of these topics, and this
is causing huge startup times. Are there any other settings we should set
to optimize our startup times?
Cheers,
Malcolm McFarland
Cavulus
s and
yarn.nodemanager.process-kill-wait.ms YARN values. Would this give Samza
more time to shutdown, perhaps allowing unpersisted checkpoints to be
written out?
Cheers,
Malcolm McFarland
Cavulus
This correspondence is from HealthPlanCRM, LLC, d/b/a Cavulus. Any
unauthorized or improper disclosure, co
=compact?
Cheers,
Malcolm McFarland
Cavulus
This correspondence is from HealthPlanCRM, LLC, d/b/a Cavulus. Any
unauthorized or improper disclosure, copying, distribution, or use of the
contents of this message is prohibited. The information contained in this
message is intended only for the personal
Actually, do you have an example of some appropriate settings for Kafka to
ensure that compaction is behaving correctly for the Samza checkpoint
topics?
Cheers,
Malcolm McFarland
Cavulus
This correspondence is from HealthPlanCRM, LLC, d/b/a Cavulus. Any
unauthorized or improper disclosure
Also, is there a way to produce this error, ie if we added extra messages
to the __checkpoint topics?
Malcolm McFarland
Cavulus
This correspondence is from HealthPlanCRM, LLC, d/b/a Cavulus. Any
unauthorized or improper disclosure, copying, distribution, or use of the
contents of this message
so, do you recommend scaling up in box YARN node processing capability,
or out in YARN node count?
Thanks,
Malcolm McFarland
Cavulus
This correspondence is from HealthPlanCRM, LLC, d/b/a Cavulus. Any
unauthorized or improper disclosure, copying, distribution, or use of the
contents of this message is
What would the effect be on a container that was only allowed one CPU core?
Would it be ok to trade that off for more containers?
Cheers,
Malcolm McFarland
Cavulus
This correspondence is from HealthPlanCRM, LLC, d/b/a Cavulus. Any
unauthorized or improper disclosure, copying, distribution, or use
about good
rule-of-thumb values for each of these parameters.
Cheers,
Malcolm McFarland
Cavulus
This correspondence is from HealthPlanCRM, LLC, d/b/a Cavulus. Any
unauthorized or improper disclosure, copying, distribution, or use of the
contents of this message is prohibited. The information contain
ver?
We're running Samza 0.14.1 and using AWS MSK which is running version Kafka
2.2.1.
Thanks so much,
Malcolm McFarland
Cavulus
ation. Fwiw, we're using Kafka 0.11.0.2 with Samza 0.14.1; my
understanding is that there should be version compatibility between Kafka
0.11.0.x-2.x.
If you have any other ideas, I'd be interested in hearing them.
Cheers,
Malcolm McFarland
Cavulus
On Fri, Apr 10, 2020 at 8:36 PM Yi Pa
in
pid=a3e86ddf-8d18-40c9-8063-1efd588cec56. Stopping this processor. New
JobModel: JobModel [..]
At this point the ThreadJob shuts down cleanly. Afaict, the legacy
configuration is set up correctly, and mirrors our functional build under
0.14.1. Any thoughts?
Cheers,
Malcolm McFarland
Cavulus
t down for session: 0x18fae2900a0
2020-12-17 17:04:32.309 [Samza Debounce
Thread-81ac6a4e-3d5e-479c-9a6c-2f2d9b4372d3] StreamProcessor [INFO]
Shutting down the executor service of the stream processor:
81ac6a4e-3d5e-479c-9a6c-2f2d9b4372d3.
Does this help?
Cheers,
Malcolm McFarland
Cav
ones using coordination service) one must use
org.apache.samza.container.grouper.task.GroupByContainerIdsFactory". I
added that in and everything is starting up smoothly.
Cheers,
Malcolm McFarland
Cavulus
This correspondence is from HealthPlanCRM, LLC, d/b/a Cavulus. Any
unauthorized or impr
ress) correctly:
HADOOP_YARN_HOME
YARN_HOME
HADOOP_COMMON_HOME
HADOOP_HOME
HADOOP_PREFIX
HADOOP_CONF_DIR
My yarn-site.xml is at $YARN_HOME/etc/hadoop/yarn-site.xml, and contains
the following configuration:
yarn.resourcemanager.address
${RM_IP_ADDRESS}
This worked fine in 0.14.1.
Cheers,
Ma
e the local yarn-site.xml
accurately. When it starts on yarn, though, it seems to be resolving the
resource manager to localhost. The final exception information is at the
end of this email. Any ideas?
Cheers,
Malcolm McFarland
Cavulus
Failed to connect to server: localhost/127.0.0.1:8030: retries get f
Hey all,
Has Samza 1.7.0 been officially released? I've been following the
discussion here, and it seems like it was cleared in March, but I haven't
seen any announcements or updates to the docs. Not trying to be pushy here,
just curious about the status of release 1.7.0.
Cheer
this PR
and see if we can integrate it into a local Java 11-based build. Thanks
Jamie!
Cheers,
Malcolm McFarland
Cavulus
On Wed, Aug 24, 2022 at 1:47 PM James DeMichele
wrote:
> Hi, I'm not sure if my previous email went through so thought that I would
> try again.
>
> I know
/simple-legacy-task-1-2.0-coordinationData/jobModelGeneration/jobModelVersion
At which point the application silently exits.
Thanks in advance for any advice, ideas, things to check, etc.
Cheers,
Malcolm McFarland
Cavulus
.0 (
https://hadoop.apache.org/docs/r3.3.0/index.html). Are there any unit tests
in Samza that verify compatibility against a YARN cluster? If so, that
could be a place to validate YARN v2.10/v3.3 cross-compatibility.
Just throwing my 2 cents out there,
Malcolm McFarland
Cavulus
On Fri, Sep 2, 2022
nybody know a) if it's possible to define a custom key-to-partition
hashing algorithm in Samza, or b) if there is a reliable general-purpose
algorithm that can create the same results as Samza's algorithm?
Cheers,
Malcolm McFarland
Cavulus
[0]
https://github.com/apache/samz
pecific about my
reasoning!
Cheers,
Malcolm McFarland
Cavulus
[0]
https://github.com/apache/samza/blob/1.7.0/samza-kafka/src/main/scala/org/apache/samza/system/kafka/KafkaSystemProducer.scala#L97-L101
[1]
https://github.com/apache/kafka/blob/2.4/clients/src/main/java/org/apache/kafka/clients/produce
Hey all,
I just noticed that, in the blog post for Apache 1.8.0, there is mention of
following instructions for the 1.7.0 upgrade. However, there is no blog
post about 1.7.0, nor is there a version of the documentation for 1.7.0.
Did that get accidentally dropped?
Cheers,
Malcolm McFarland
Just found this under the "Releases" tab. A little confusing, but that
works!
Cheers,
Malcolm McFarland
Cavulus
On Thu, Apr 27, 2023 at 4:51 PM Malcolm McFarland
wrote:
> Hey all,
>
> I just noticed that, in the blog post for Apache 1.8.0, there is mention
> of followin
what is the preferred way to run a
single instance of a streamtask locally? I'm using Samza 1.6.0, Kafka
2.2.2, and ZooKeeper 3.4.14.
Thanks in advance for the help!
Cheers,
Malcolm McFarland
Cavulus
Here's a sample of the last few log messages, log level set to TRACE:
[INFO] Metad
an autoscaling module
(removed in version 1.4.0), but no actual documentation or examples.
Cheers,
Malcolm McFarland
Cavulus
57 matches
Mail list logo