I found out that there is a limitaion in a number of schedulers in 
SchedulerFactory.java[1]

"executor = ExecutorFactory.singleton().createOrGet("SchedulerFactory", 100);"

It can be tested by:
Set a small number for SchedulerFactory, for example 16.
Run notes with interpreters in an isolated mode per user and per note.
See pending paragraphs when a dozen of interpreter processes will start.

There is no limitation in total number of started interpreter processes, but 
there is a limitation in schedulers.
Scheduler born inside interpreter. If we need a limitation it's to be good to 
limit a number of interpreter processes.

Is this limitation in schedulers useful?


1. 
https://github.com/apache/zeppelin/blob/master/zeppelin-interpreter/src/main/java/org/apache/zeppelin/scheduler/SchedulerFactory.java


Maksim Belousov

From: Belousov Maksim Eduardovich [mailto:m.belou...@tinkoff.ru]
Sent: Tuesday, October 03, 2017 10:37 AM
To: users@zeppelin.apache.org
Subject: RE: Is any limitation of maximum interpreter processes?

> Which interpreter is pending ?
There comes a time when any paragraph with any interpreter doesn't run and 
remains in 'Pending' state.
We use local spark instances in spark interpretator.

Logs don't contain errors.


Максим Белоусов
Архитектор
Отдел отчетности и витрин данных
Управление хранилищ данных и отчетности
Тел.: +7 495 648-10-00, доб. 2271

From: Jianfeng (Jeff) Zhang [mailto:jzh...@hortonworks.com]
Sent: Tuesday, October 03, 2017 2:01 AM
To: users@zeppelin.apache.org<mailto:users@zeppelin.apache.org>
Subject: Re: Is any limitation of maximum interpreter processes?


Which interpreter is pending ? It is possible that spark interpreter pending 
due to yarn resource capacity if you run it in yarn client mode

If it is pending, you can check the log first.



Best Regard,
Jeff Zhang


From: Belousov Maksim Eduardovich 
<m.belou...@tinkoff.ru<mailto:m.belou...@tinkoff.ru>>
Reply-To: "users@zeppelin.apache.org<mailto:users@zeppelin.apache.org>" 
<users@zeppelin.apache.org<mailto:users@zeppelin.apache.org>>
Date: Monday, October 2, 2017 at 9:26 PM
To: "users@zeppelin.apache.org<mailto:users@zeppelin.apache.org>" 
<users@zeppelin.apache.org<mailto:users@zeppelin.apache.org>>
Subject: Is any limitation of maximum interpreter processes?

Hello, users!

Our analysts run notes with such interpreters: markdown, one or two jdbc and 
pyspark. The interpreters are instantiated Per User in isolated process and Per 
Note in isolated process.

And the analysts complain that sometimes paragraphs aren't processed and stay 
in status 'Pending'.
We noticed that it happen when number of started interpreter processes is about 
90-100.
If admin restarts one of the popular interpreter (that is killing some 
interpreter processes), the paragraphs become 'Running'.

We can't see any workload on zeppelin server when paragraphs are pended. RAM is 
sufficiently, iowait ~ 0
Also we can't find out any parameters about maximum interpreter processes.


Has anyone of you faced the same problem? How can this problem be solved?


Thanks,

Maksim Belousov


Reply via email to