Re: why BlobServer use ServerSocket instead of Netty's ServerBootstrap?

2018-06-11 Thread makeyang
thanks -- Sent from: http://apache-flink-user-mailing-list-archive.2336050.n4.nabble.com/

why BlobServer use ServerSocket instead of Netty's ServerBootstrap?

2018-06-10 Thread makeyang
after checking code, I found that BlobServer use ServerSocket instead of Netty's ServerBootstrap. I wonder why and is it ok to migtate to ServerBootstrap -- Sent from: http://apache-flink-user-mailing-list-archive.2336050.n4.nabble.com/

Re: is there a config to ask taskmanager to keep retrying connect to jobmanager after Disassociated?

2018-06-05 Thread makeyang
can anybody share anythoughts, insights about this issue? -- Sent from: http://apache-flink-user-mailing-list-archive.2336050.n4.nabble.com/

Re: some default config questions

2018-06-04 Thread makeyang
thanks -- Sent from: http://apache-flink-user-mailing-list-archive.2336050.n4.nabble.com/

Re: why does flink release package preferred uber jar than small jar?

2018-06-04 Thread makeyang
thanks rongrong, but it seems unrelevant. -- Sent from: http://apache-flink-user-mailing-list-archive.2336050.n4.nabble.com/

why does flink release package preferred uber jar than small jar?

2018-06-04 Thread makeyang
what's the benefit that uber jar have while small jar doesn't? -- Sent from: http://apache-flink-user-mailing-list-archive.2336050.n4.nabble.com/

Re: is there a config to ask taskmanager to keep retrying connect to jobmanager after Disassociated?

2018-06-04 Thread makeyang
so is there a way or config to ask taskmanager to keep continue connectting to jobmanager? -- Sent from: http://apache-flink-user-mailing-list-archive.2336050.n4.nabble.com/

Re: is there a config to ask taskmanager to keep retrying connect to jobmanager after Disassociated?

2018-06-04 Thread makeyang
when I debug the jobmanager and below is the error log in task manager: 2018-06-04 17:16:33,295 INFO org.apache.flink.runtime.taskexecutor.TaskExecutor- The heartbeat of ResourceManager with id 35df0455efc2fb6fa3f2467f7f5d2ba1 timed out. 2018-06-04 17:16:33,297 DEBUG org.apache.flink.r

is there a config to ask taskmanager to keep retrying connect to jobmanager after Disassociated?

2018-06-03 Thread makeyang
I depoloy a standard alone cluster with flink1.5 and when I try to restart the only jobmanger, below is the log print from task manager: 2018-06-04 12:06:35,882 WARN akka.remote.ReliableDeliverySupervisor - Association with remote system [akka.tcp://flink@ipaddress:6123] ha

Re: is it OK to PR just for code comment errors?

2018-06-01 Thread makeyang
got it. thanks guys -- Sent from: http://apache-flink-user-mailing-list-archive.2336050.n4.nabble.com/

some default config questions

2018-05-31 Thread makeyang
why jobmanager.execution.failover-strategy default value is full not Individual? why taskmanager.jvm-exit-on-oom default value is false not true? code is flink 1.5 -- Sent from: http://apache-flink-user-mailing-list-archive.2336050.n4.nabble.com/

is it OK to PR just for code comment errors?

2018-05-31 Thread makeyang
I am not sure if it is OK to PR just for code comment errors since I found there is one when I am reading code -- Sent from: http://apache-flink-user-mailing-list-archive.2336050.n4.nabble.com/

Re: Task did not exit gracefully and lost TaskManager

2018-05-30 Thread makeyang
met the same problem in 1.4 when I cancel job, one of taskmanager keep logging the exception -- Sent from: http://apache-flink-user-mailing-list-archive.2336050.n4.nabble.com/

Re: latency critical job

2018-05-28 Thread makeyang
Rong Rong: my flink version is 1.4.2 since we are using the docker env which is sharing disk-io, based on our observation, disk-io spike cased by other process in the same physical machine can lead to long time operator processing. -- Sent from: http://apache-flink-user-mailing-list-arc

Re: latency critical job

2018-05-28 Thread makeyang
Timo: thanks for u suggestion -- Sent from: http://apache-flink-user-mailing-list-archive.2336050.n4.nabble.com/

does Flink call FullGC to reclaim direct memory mainly occupied by RocksDB

2018-05-25 Thread makeyang
each time when cancel Job does Flink call FullGC to reclaim direct memory mainly occupied by RocksDB? if so, where does this? if not, why? -- Sent from: http://apache-flink-user-mailing-list-archive.2336050.n4.nabble.com/

latency critical job

2018-05-25 Thread makeyang
some job is latency critical job which means it can't accept certain threadhold of latency so will flink provide timeout operator in near future which means when one operator timeout, the jobmanager will schedule a new operator which starts from previous state of the OP and keep dealing with new ev

chained operator with different parallelism question

2018-05-18 Thread makeyang
someStream.filter(...).map(...).map(...); there operators are supposed to chained. but what if there are set different parallelism like below: someStream.filter(...).setParallelism(X).map(...).setParallelism(Y).map(...).setParallelism(Z); X != Y != Z what will happen? -- Sent from: http://apache

Re: Fwd: Decrease initial source read speed

2018-05-18 Thread makeyang
Andrei Shumanski: which source are u using? -- Sent from: http://apache-flink-user-mailing-list-archive.2336050.n4.nabble.com/

are there any ways to test the performance of rocksdb state backend?

2018-05-17 Thread makeyang
I'd like to integrate newer version of rocksdb with flink. I'd like to know if there are existing tools/ways to benchmark the performance of rocksdb state backend to see if there are performence improve or drop? MaKeyang TIG.JD.COM -- Sent from: http://apache-flink-user-mailing-li

managin order to use epoll (tasker.network.netty.transport: epoll), is it required that linux version is 4.0.16 or newer or not

2018-04-20 Thread makeyang
my flink veriso is 1.4.2 my jdk version is 1.8.0.20 my linux version is:3.10.0 I try to use epoll with setting: tasker.network.netty.transport: epoll but it throws excption which leads me here: https://github.com/apache/flink-shaded/issues/30 I followed the instruction and still exception throws.

gonna need more logs when task manager is shutting down

2018-04-19 Thread makeyang
one of my task manager is out ot the cluster and I checked its log found something below: 2018-04-19 22:34:47,441 INFO org.apache.flink.runtime.taskmanager.Task - Attempting to fail task externally Process (115/120) (19d0b0ce1ef3b8023b37bdfda643ef44). 2018-04-19 22:34:47,441 I

Re: Slow flink checkpoint

2018-04-16 Thread makeyang
since flink forward SF has done. can you guys give some minutes to take a look at this issue and give some thoughts on it? help to review/comments on my desgin? or give us a design so that I can help to implement it. thanks a lot. -- Sent from: http://apache-flink-user-mailing-list-archive.2336

heartbeat.timeout in 1.4 document

2018-04-13 Thread makeyang
in code of flink 1.4: HeartbeatManagerOptions HEARTBEAT_TIMEOUT = key("heartbeat.timeout").defaultValue(5L); but this config is not finkd in https://ci.apache.org/projects/flink/flink-docs-release-1.4/ops/config.html -- Sent from: http://apache-flink-user-mailing-list-archive.2336050.n4.nab

Re: Slow flink checkpoint

2018-04-04 Thread makeyang
the test is very promising. the time sync part takes from couple of seconds to couple of mill-seconds. 1000x time reduce(overall time not save since it is just move from sync to async) are u guys interested in this change? -- Sent from: http://apache-flink-user-mailing-list-archive.2336050.n4.na

Re: Slow flink checkpoint

2018-04-01 Thread makeyang
I have put a lot of efforts on this issue and try to resolve it: 1. let me describe current timers' snapshot path first: a) for each keygroup, invoke InternalTimeServiceManager.snapshotStateForKeyGroup b) InternalTimeServiceManager create a InternalTimerServiceSerializationProxy to write sn

Re: Restart hook and checkpoint

2018-03-19 Thread makeyang
currently there is only time based way to trigger a checkpoint. based on this discussion, I think flink need to introduce event based way to trigger checkpoint such as restart a task manager should be count as a event. -- Sent from: http://apache-flink-user-mailing-list-archive.2336050.n4.nabble