Thank you Massimo for your detailed reply.

Some comments inline:

On 03/09/2016 09:50 AM, massimo di stefano wrote:
Hi,

I’ll try to address some of the issues raised in the last couple of mails:

1)  concern about adding extra materials to the live
2) Include or drop jupyter from 9.5 release
3) contributing new notebooks to the live
4) which notebook to include on Osgeo-Live 9.5


1) About the concern in adding extra materials to the live:

To address:
Cameron:
I concerned about how much new material we are attempting add related to 
Jupyter notebooks, all at the last moment.
Angelos:
Furthermore, as part of the Jupyter setup, a big number [2] [3] of Python 
projects/libraries were also added to OSGeoLive, as soft dependencies to 
support Notebooks, without official review or decision by the community,

 From the dependencies listed by Angelos and linked here:

[2] https://github.com/OSGeo/OSGeoLive/blob/master/bin/install_jupyter.sh#L30 
<https://github.com/OSGeo/OSGeoLive/blob/master/bin/install_jupyter.sh#L30>
[3] https://github.com/OSGeo/OSGeoLive/blob/master/bin/install_jupyter.sh#L43 
<https://github.com/OSGeo/OSGeoLive/blob/master/bin/install_jupyter.sh#L43>
My proposed PR —>  https://github.com/OSGeo/OSGeoLive-Notebooks/pull/5 
<https://github.com/OSGeo/OSGeoLive-Notebooks/pull/5>


uses the following packages:

###
python-matplotlib
python-scipy
goal-bin

python-geographiclib
python-geocoder
###

 From which only the last 2 are new to  the osgeolive.
The other packages where already installed in the previous version of the 
OSGeo-live
and are pretty common in the OSGeo software ecosystem.

In the PR comments, I added notes on its footprint, which is in the order of 10 
MB.

Thank you for the detailed assessment of the dependencies brought in by your work.
Not including gdal-bin was actually a bug ;)
You are correct that the other packages were already present in the previous release, I just wanted to point out that we are being strict to the Notebook itself, but we include other projects without even discussing about it...


2) Include jupyter YES or NO

I’ll leave this to the community, but i should clarify that:

-  ’there is a tremendous amount of work behind the finally complete jupiter 
packaging’ that makes the live a better product.
-   all the notebooks developed for the GSoC are working just fine with ipython 
notebook  as well as with jupiter notebook.

And I should point out that:
-  ipython notebook is not something new to the OSGeo-live, it is more than one 
year that is shipped within the OSGeo-live.
-  the notebook has been successful adopted by some of the main OSGeo projects 
during OSGeo-conference workshops
    (see [1] [2] as example)
- to have the GSoC merged into the current OSGeo-live, there is no need of 
jupiter ... if you really want to drop it :( .

For the record, I am in favor of including the Python stack in OSGeo-Live, including Jupyter. We DO need a working Jupyter quickstart to be able to include it though, and it has to be ready within the next 48h



3) contributing new notebooks to the live:

Although the notebook can be used as a “scripting tool” or like a simple ide 
IMHO the notebook is not just that, the notebook adds the capabilities to 
include rich descriptive narrative to a data processing workflow including the 
possibilities to interact with data through a nicely done widget interactive 
system  and finally enabling the printing of publication-ready reports (see 
latex inline rendering and pdf export).

+1


Also, on the OSGeo-live, we are aiming (AFAIK) to use the same `common dataset` 
among projects (or, at least, we attempt to) this way different software 
projects can be more easily compared,  say for complexity of their usage or for 
output quality/performance comparison.

Very valid point raised here regarding the use of the common dataset in the Notebooks. This should be a requirement for selection.


Having said that,
from a first look at all the other notebooks (or I should say, the only ones if 
the PR will be not accepted ) shipped within the OSGeo-live I noticed  that 
almost all the notebooks under the “projects” folder [3] are not the result of 
original work. Some of those notebooks are just a carbon copy of the examples 
.py (script) code from the relative `project src code` on GitHub and the 
potential offered by the notebook platform is not exploited at all. … no 
problem from my side in having them included on the OSGeo-live, but I hope the 
contributors of those notebook will make a more effective use of the tool.

Moreover, almost none of those notebooks make use of the data we already ship 
within the live…
Hint: the notebook developer (I said developer, right … not just contributors) 
can make use of one of the power of the notebook which is the capability of 
mixing python and bash in the same document . ...
eg. you need a geoJSON file? just use ‘ogr2ogr' to convert one of our ’shape 
file' to the format you need … or make a numpy loop generate user defined novel 
data .. perhaps from a nice query to PostGIS (via psycopg) … it  will take few 
lines of code and a nicely done HTML description for each step)

I agree on the term developer vs contributor.


4) which notebook to include on OSGeo-Live 9.5

trying to address this:

I'm proposing that we release just a few of the Notebooks first, seek community 
feedback on this small subset, adapt if required. But most importantly build an 
OSGeo-Live notebook community and buy in before going too wide.

while I’d love to see more people contributing to the notebook ecosystem on the 
OSGeo-live as well. In regards to the GSoC notebooks, I can’t see how possibly 
can be to apply this:

‘'' release just a few of the Notebooks ‘’’

To the GSoC’s notebooks.

If you have an idea of what the GSoC-2015 was all about (and as principal 
mentor you should!)  you will notice that those notebooks are linked together, 
they are a whole .. what you propose in the sentence above (if applied to the 
GSoC’s notebooks), sounds like publishing a book without all the pages….

As a co-mentor of the GSoC project, I have fully reviewed the Notebooks you provided and reproduced your results back then. You recently opened a PR about including them in OSGeoLive. Wearing my OSGeoLive maintainer hat, I had to re-evaluate your Notebooks in case of missing libraries, dead links etc. This process/review did not imply that your code was not working. I also have to report that the Notebooks from GSoC are working as expected in nightly builds of OSGeoLive.


In the proposed PR I already reduced the number of notebooks, I removed extra 
features or incomplete, placeholders and other notebook based on python3 (which 
is not shipped for this release)

Thank you for removing the placeholders, they were sending out the wrong impression of an incomplete work. I hope the interesting topics that got removed can be added again later with a full content.


as per vote .. I want to see my GSoC merged and that’s why I’m still working on 
the OSGeo-live.
Of course, I’m biased and I let the community decide.

Thank you for opening the motion on a different thread.


Cheers,
Massimo.

Best,
Angelos



[1] https://github.com/wenzeslaus/python-grass-addon 
<https://github.com/wenzeslaus/python-grass-addon>
[2] https://github.com/zarch/workshop-pygrass 
<https://github.com/zarch/workshop-pygrass>
[3] https://github.com/OSGeo/OSGeoLive-Notebooks/tree/master/projects 
<https://github.com/OSGeo/OSGeoLive-Notebooks/tree/master/projects>

On Mar 7, 2016, at 8:15 PM, Angelos Tzotsos <gcpp.kal...@gmail.com> wrote:

Hi,

This discussion is not only related to the work made by Massimo and other 
contributors, but has further implications on how we include new projects in 
OSGeoLive:

So far, Jupyter has been included in the development builds as a natural 
evolution of the IPython project (which was also recently included in 
OSGeoLive), so Jupyter never followed the path to be officially included [1]. 
As we speak (far past the feature freeze date) only the overview doc is 
committed so one could argue in favor of dropping Jupyter from this release 
completely...

Furthermore, as part of the Jupyter setup, a big number [2] [3] of Python 
projects/libraries were also added to OSGeoLive, as soft dependencies to 
support Notebooks, without official review or decision by the community, just 
by following volunteers' vision of what functionality should be demonstrated by 
the Notebooks. Part of this vision was to keep OSGeoLive relevant in terms of 
tools and trends in the open source geospatial world.

Now the proposal is to select a subset of Notebooks to present to the community 
in order to get feedback and build a notebook community to support further 
adoption. A natural follow-up question is: should we also select which 
supporting projects will be included? If we drop some of the Notebooks, their 
dependencies should also be dropped, right?

So how do we decide which notebooks/supporting libraries to keep? And how do we 
do that without being disrespectful to all volunteers who contributed their 
time to maintain the Jupyter/Python stack for some months now (including 
hacking notebooks, creating debian packages, mentoring or participating in the 
GSoC project)?

Here is my proposal:
1. Massimo and Brian are named official maintainers of Jupyter, as the only 
actual contributors of Notebooks [4]
2. Jupyter maintainers have to provide all the needed documentation (overview 
and quickstart) ASAP, else Jupyter is dropped from 9.5
3. Jupyter maintainers get to decide which Notebooks to include in the final 
iso.
4. If they disagree, the community votes weather to include all notebooks OR 
drop Jupyter from 9.5 release and re-evaluate for 10.0

Regards,
Angelos

[1] https://wiki.osgeo.org/wiki/Live_GIS_Disc#How_to_add_a_project_to_OSGeoLive 
<https://wiki.osgeo.org/wiki/Live_GIS_Disc#How_to_add_a_project_to_OSGeoLive>
[2] https://github.com/OSGeo/OSGeoLive/blob/master/bin/install_jupyter.sh#L30 
<https://github.com/OSGeo/OSGeoLive/blob/master/bin/install_jupyter.sh#L30>
[3] https://github.com/OSGeo/OSGeoLive/blob/master/bin/install_jupyter.sh#L43 
<https://github.com/OSGeo/OSGeoLive/blob/master/bin/install_jupyter.sh#L43>
[4] https://github.com/OSGeo/OSGeoLive-Notebooks/graphs/contributors 
<https://github.com/OSGeo/OSGeoLive-Notebooks/graphs/contributors>

On 03/08/2016 12:17 AM, Cameron Shorter wrote:
Hi all,
I should clarify my statement below, (as has been to me off list), as it might 
appear that I'm implying a lack of future, or quality of notebooks.

My comments below relate to level of external testing and size of community who 
have reviewed Massimo's notebooks.

I think that Massimo has done an excellent job pioneering notebooks within the 
OSGeo-Live framework, and these notebooks provide a great platform from which 
to demonstrate OSGeo functionality.
I think our next step is to work toward bringing a groundswell of community 
behind the development of these notebooks.

My suggested approach differs a little with that proposed by Massimo, although 
I think we are aiming toward the same long term goal (of wide adoption and 
community maintenance of Notebooks within the OSGeo-Live framework).

I'm proposing that we release just a few of the Notebooks first, seek community 
feedback on this small subset, adapt if required. But most importantly build an 
OSGeo-Live notebook community and buy in before going too wide.

This question is still unresolved within the core OSGeo-Live team, and we need 
to make a decision fast, as our Release Candidate is due next Monday 14 March. 
Opinions from our OSGeo-Live community would be greatly appreciated so we can 
make a wise decision moving forward.

Warm regards, Cameron

On 7/03/2016 10:54 pm, Cameron Shorter wrote:
Angelos, all,
I concerned about how much new material we are attempting add related to 
Jupyter notebooks, all at the last moment.

With OSGeo-Live, we have built our reputation around quality and stability, and 
I think we should be careful not to compromise that. We will attract more users 
to Jupyter notebooks if they try one excellent notebook, and look elsewhere for 
more, than if they try 10 notebooks which almost work.

So before adding a new Notebook, I suggest that it should be tested start to 
finish, and then thoroughly reviewed  by the author, and then at least one 
other person, preferably 2.

Am I right in understanding that we are currently proposing to add ~ 30 new 
notebooks? I'd be inclined to pick out 2 to 5 of these and focus on getting 
just these working.
(The remainder can be included on OSGeo-Live for testing and workshops, just 
ensure that you can only find it if provided with the correct URL)

That said, who do we have available to help test notebooks? If you can help 
out, please reply to this email, volunteering your services.

On 7/03/2016 10:34 pm, Angelos Tzotsos wrote:
2. Jupyter Notebooks: We currently have a git repository with notebooks to 
include in the final release and we also have an open pull request to merge the 
work from GSoC 2015 [5].
There is a special nightly build [6][7] including the GSoC notebooks.
We need to evaluate all our notebooks for this release and make a decision on 
the notebooks to be included.
Perhaps we need a team of volunteers to go through all notebooks and review 
them? Perhaps we need a spreadsheet listing all notebooks and their status? 
Thoughts?

--
Angelos Tzotsos, PhD
OSGeo Charter Member
http://users.ntua.gr/tzotsos <http://users.ntua.gr/tzotsos>

_______________________________________________
Live-demo mailing list
Live-demo@lists.osgeo.org <mailto:Live-demo@lists.osgeo.org>
http://lists.osgeo.org/mailman/listinfo/live-demo 
<http://lists.osgeo.org/mailman/listinfo/live-demo>
http://live.osgeo.org <http://live.osgeo.org/>
http://wiki.osgeo.org/wiki/Live_GIS_Disc 
<http://wiki.osgeo.org/wiki/Live_GIS_Disc>


--
Angelos Tzotsos, PhD
OSGeo Charter Member
http://users.ntua.gr/tzotsos

_______________________________________________
Live-demo mailing list
Live-demo@lists.osgeo.org
http://lists.osgeo.org/mailman/listinfo/live-demo
http://live.osgeo.org
http://wiki.osgeo.org/wiki/Live_GIS_Disc

Reply via email to