pdsh is available in head node only, but when I tried to do
*start-cluster *from head
node (note Job manager node is not head node) it didn't work, which is why
I modified the scripts.
Yes, exactly, this is what I was trying to do. My research area has been on
these NUMA related issues and binding
Greg,
where did you see the OOM log as shown in this mail thread? In my case none
of the TaskManagers nor JobManger reports an error like this.
On Sun, Jul 10, 2016 at 8:45 PM, Greg Hogan wrote:
> These symptoms sounds similar to what I was experiencing in the following
> thread. Flink can have
These symptoms sounds similar to what I was experiencing in the following
thread. Flink can have some unexpected memory usage which can result in an
OOM kill by the kernel, and this becomes more pronounced as the cluster
size grows.
https://www.mail-archive.com/dev@flink.apache.org/msg06346.html
Hi Saliya,
Would you happen to have pdsh (parallel distributed shell) installed? If so
the TaskManager startup in start-cluster.sh will run in parallel.
As to running 24 TaskManagers together, are these running across multiple
NUMA nodes? I had filed FLINK-3163 (
https://issues.apache.org/jira/br
Thank you. Yes, the previous format is still supported. If a number is
specified after the hostname then only it'll kick in this change.
On Sun, Jul 10, 2016 at 5:42 PM, Gyula Fóra wrote:
> Hi,
>
> I think this would be a nice addition especially for Flink clusters
> running on big machines wh
Hi,
I think this would be a nice addition especially for Flink clusters running
on big machines where you might want to run multiple task managers just to
split the memory between multiple java processes.
In any case the previous config format should also be supported as the
default.
I am curiou
Hi David!
Have you had a look at the docs for Event Time and Watermark Generation?
There are some examples for some typical cases:
Event Time / Watermark Overview:
https://ci.apache.org/projects/flink/flink-docs-master/apis/streaming/event_time.html
Typical Watermark Generators:
https://ci.apac
I can not join or coGroup two tuple2 datasets of the same tome. The error is
java.lang.IllegalArgumentException: The two inputs have different execution
contexts.:-(
Changing to env.setStreamTimeCharacteristic(TimeCharacteristic.ProcessingTime),
and removing assignTimestampsAndWatermarks(new MyTimestampExtractor)
get the code executing now.
One more question. I read the java doc[1] it seems watermark is a mark
telling operators that no more elements will arriv