Re: [DISCUSS] Update Roadmap

Shabeel Syed Mon, 29 Feb 2016 08:22:04 -0800

Hi Moon,

       Some of my requirements.


   1. Can we achieve better memory management for notebooks ? I'm also
   facing some similar OOM issue, like Dafeng mentioned in other
   discussion.I'm using the iframe view of a paragraph, can we load that
   code+results to memory only when requested ? I think this is one area to be
   focused on.
   2. In table/graph view can we include below features along with
   pagination ?

                a) Search , similar to
https://docs.angularjs.org/api/ng/filter/filter
                b) Sorting of columns. Also custom sorting algorithms ?

    Also any idea on GA for these suggested improvements ?


Regards
Shabeel

On Mon, Feb 29, 2016 at 1:51 PM, Vinayak Agrawal <[email protected]
> wrote:

> Moon,
> The new roadmap looks very promising. I am very happy to see security in
> the list.
> I have some suggestions regarding Enterprise Ready features:
>
> 1. Job Scheduler - Can this be improved?
> Currently the scheduler can be used with Cron expression or a pre-set
> time. But in an enterprise solution, a notebook might be one piece of the
> workflow. Can we look towards the functionality of scheduling notebook's
> based on other notebooks finishing their job successfully?
> This requirement would arise in any ETL workflow, where all the downstream
> users wait for the ETL notebook to finish successfully. Only after that,
> other business oriented notebooks can be executed.
>
> 2. Importing a notebook - Is there a current requirement or future plan to
> implement a feature that allows import-notebook-from-github? This would
> allow users to share notebooks seamlessly.
>
> Thanks
> Vinayak
>
> On Sun, Feb 28, 2016 at 11:22 PM, moon soo Lee <[email protected]> wrote:
>
>> Zhong Wang,
>> Right, Folder support would be quite useful. Thanks for the opinion.
>> Hope i can finish the work pr-190
>> <https://github.com/apache/incubator-zeppelin/pull/190>.
>>
>> Sourav,
>> Regarding concurrent running, Zeppelin doesn't have limitation of run
>> paragraph/query concurrently. Interpreter can implement it's own scheduling
>> policy. For example, SparkSQL interpreter and ShellInterpreter can already
>> run paragraph/query concurrently.
>>
>> SparkInterpreter is implemented with FIFO scheduler considering nature of
>> scala compiler. That's why user can not run multiple paragraph concurrently
>> when they work with SparkInterpreter.
>> But as Zhong Wang mentioned, pr-703 enables each notebook will have
>> separate scala compiler so paragraphs run concurrently, while they're in
>> different notebooks.
>> Thanks for the feedback!
>>
>> Best,
>> moon
>>
>> On Sat, Feb 27, 2016 at 8:59 PM Zhong Wang <[email protected]>
>> wrote:
>>
>>> Sourav: I think this newly merged PR can help you
>>> https://github.com/apache/incubator-zeppelin/pull/703#issuecomment-185582537
>>>
>>> On Sat, Feb 27, 2016 at 1:46 PM, Sourav Mazumder <
>>> [email protected]> wrote:
>>>
>>>> Hi Moon,
>>>>
>>>> This looks great.
>>>>
>>>> My only suggestion would be to include a PR/feature - Support for
>>>> Running Concurrent paragraphs/queries in Zeppelin.
>>>>
>>>> Right now if more than one user tries to run paragraphs in multiple
>>>> notebooks concurrently through a single Zeppelin instance (and single
>>>> interpreter instance) the performance is very slow. It is obvious that the
>>>> queue gets built up within the zeppelin process and interpreter process in
>>>> that scenario as the time taken to move the status from start to pending
>>>> and pending to running is very high compared to the actual running time of
>>>> a paragraph.
>>>>
>>>> Without this the multi tenancy support would be meaningless as no one
>>>> can practically use it in a situation where multiple users are trying to
>>>> connect to the same instance of Zeppelin (and the related interpreter). A
>>>> possible solution would be to spawn separate instance of the same
>>>> interpreter at every notebook/user level.
>>>>
>>>> Regards,
>>>> Sourav
>>>>
>>>> On Sat, Feb 27, 2016 at 12:48 PM, moon soo Lee <[email protected]> wrote:
>>>>
>>>>> Hi Zeppelin users and developers,
>>>>>
>>>>> The roadmap we have published at
>>>>> https://cwiki.apache.org/confluence/display/ZEPPELIN/Zeppelin+Roadmap
>>>>> is almost 9 month old, and it doesn't reflect where the community goes
>>>>> anymore. It's time to update.
>>>>>
>>>>> Based on mailing list, jira issues, pullrequests, feedbacks from
>>>>> users, conferences and meetings, I could summarize the major interest of
>>>>> users and developers in 7 categories. Enterprise ready, Usability
>>>>> improvement, Pluggability, Documentation, Backend integration, Notebook
>>>>> storage, and Visualization.
>>>>>
>>>>> And i could list related subjects under each categories.
>>>>>
>>>>>    - Enterprise ready
>>>>>       - Authentication
>>>>>          - Shiro authentication ZEPPELIN-548
>>>>>          <https://issues.apache.org/jira/browse/ZEPPELIN-548>
>>>>>       - Authorization
>>>>>          - Notebook authorization PR-681
>>>>>          <https://github.com/apache/incubator-zeppelin/pull/681>
>>>>>       - Security
>>>>>       - Multi-tenancy
>>>>>       - Stability
>>>>>    - Usability Improvement
>>>>>       - UX improvement
>>>>>       - Better Table data support
>>>>>          - Download data as csv, etc PR-725
>>>>>          <https://github.com/apache/incubator-zeppelin/pull/725>,
>>>>>          PR-714
>>>>>          <https://github.com/apache/incubator-zeppelin/pull/714>, PR-6
>>>>>          <https://github.com/apache/incubator-zeppelin/pull/6>, PR-89
>>>>>          <https://github.com/apache/incubator-zeppelin/pull/89>
>>>>>          - Featureful table data display (pagenation, etc)
>>>>>       - Pluggability ZEPPELIN-533
>>>>>    <https://issues.apache.org/jira/browse/ZEPPELIN-533>
>>>>>       - Pluggable visualization
>>>>>       - Dynamic Interpreter, notebook, visualization loading
>>>>>       - Repository and registry for pluggable components
>>>>>    - Improve documentation
>>>>>       - Improve contents and readability
>>>>>       - more tutorials, examples
>>>>>    - Interpreter
>>>>>       - Generic JDBC Interpreter
>>>>>       - (spark)R Interpreter
>>>>>       - Cluster manager for interpreter (Proposal
>>>>>       
>>>>> <https://cwiki.apache.org/confluence/display/ZEPPELIN/Cluster+Manager+Proposal>
>>>>>       )
>>>>>       - more interpreters
>>>>>    - Notebook storage
>>>>>       - Versioning ZEPPELIN-540
>>>>>       <http://issues.apache.org/jira/browse/ZEPPELIN-540>
>>>>>       - more notebook storages
>>>>>    - Visualization
>>>>>       - More visualizations PR-152
>>>>>       <https://github.com/apache/incubator-zeppelin/pull/152>, PR-728
>>>>>       <https://github.com/apache/incubator-zeppelin/pull/728>, PR-336
>>>>>       <https://github.com/apache/incubator-zeppelin/pull/336>, PR-321
>>>>>       <https://github.com/apache/incubator-zeppelin/pull/321>
>>>>>       - Customize graph (show/hide label, color, etc)
>>>>>
>>>>>
>>>>> It will help anyone quickly get overall interest of project and the
>>>>> direction. And based on this roadmap, we can discuss and re-define the 
>>>>> next
>>>>> release 0.6.0 scope and it's schedule.
>>>>>
>>>>> What do you think? Any feedback would be appreciated.
>>>>>
>>>>> Thanks,
>>>>> moon
>>>>>
>>>>>
>>>>
>
>
> --
> Vinayak Agrawal
>
>
> "To Strive, To Seek, To Find and Not to Yield!"
> ~Lord Alfred Tennyson
>

Re: [DISCUSS] Update Roadmap

Reply via email to