Dear Diane, On Tue, Sep 26, 2017 at 10:47:21PM -0700, Diane Trout wrote: > > > Since it is accepted for the R packages and the data are refering > > to R data I do not see any reason why this should not be accepted. > > I traced back from Rdatasets to the original R packages. > > Every one of the packages are licensed as some combination of GPL-2 and > GPL-3
As far as I understood CRAN only free software is permitted there. So if there is something on CRAN we should be able to distribute it license wise. (I've fought one exception (locfit), thought). > However it's likely that some of the datasets are not copyrightable. +1 > Here's my badly formatted notes of the name of the dataset > and the link to the R package documentation, and what GPL license it's > available under. > > How should this be formatted to be shipped with Debian? > > https://ghic.org/~diane/debian/statsmodels.datasets.README.txt I think regarding formatting and context its perfectly fine. > Does it go in README.source? or in upstream/metadata? or something > else? I think there is no "right" place to do it - but ftpmaster is used to read README.source in these cases. So why not using it? > I could also ask one of the campus librarians to help review the > datasets to determine if they should or shouldn't be copyrightable. I think ftpmaster is the instance who needs to decide this in our case (and due to the change of the binary packages it will be processed by them). I'd like to repeat myself: We really need to get statsmodels into testing soon since otherwise a lot of packages will removed from there and I'm specifically picky about one rdependency (metaphlan2) which requires always manual work by the release team due some complex dependency relations. So getting the statsmodels uploaded *soon* has way higher priority for me currently than beeing more picky than ftpmaster about copyrights of some datasets in documentation that could be probably read online as well since it creates work for several people. If we could upload statsmodels today (or latest tomorrow) I would immediately ping ftpmaster to check it quickly to enable us removing those pieces if they don't like it. > My current WIP patch for the documentation is here: > The Debian copyright file still needs to be updated, and the citations > need to be listed. > > https://ghic.org/~diane/debian/statsmodels-0001-add-patch-use-cached-da > tasets-and-cache-all-of-the-g.patch I `git am`-ed this to your branch (any reason to not push it?) BTW, please take over the changelog entry under your ID. I did not contribute anything relevant. The bad news is that when I tried to build the branch + your patch (as I said, please push to enable more easy testing for others) I'm running into: ... Finished /build/statsmodels-0.8.0/docs/source/examples/notebooks/generated/statespace_arma_0.html Finished /build/statsmodels-0.8.0/docs/source/examples/notebooks/generated/pca_fertility_factors.html Finished /build/statsmodels-0.8.0/docs/source/examples/notebooks/generated/ols.html Finished /build/statsmodels-0.8.0/docs/source/examples/notebooks/generated/regression_plots.html concurrent.futures.process._RemoteTraceback: """ Traceback (most recent call last): File "/usr/lib/python3.5/concurrent/futures/process.py", line 175, in _process_worker r = call_item.fn(*call_item.args, **call_item.kwargs) File "/usr/lib/python3.5/concurrent/futures/process.py", line 153, in _process_chunk return [fn(*args) for args in chunk] File "/usr/lib/python3.5/concurrent/futures/process.py", line 153, in <listcomp> return [fn(*args) for args in chunk] File "../tools/nbgenerate.py", line 97, in do_one kernel_name=kernel_name) File "../tools/nbgenerate.py", line 55, in execute_nb ep.preprocess(nb, {'metadta': {'path': 'notebooks/'}}) File "/usr/lib/python3/dist-packages/nbconvert/preprocessors/execute.py", line 141, in preprocess cwd=path) File "/usr/lib/python3/dist-packages/jupyter_client/manager.py", line 452, in start_new_kernel kc.wait_for_ready(timeout=startup_timeout) File "/usr/lib/python3/dist-packages/jupyter_client/blocking/client.py", line 120, in wait_for_ready raise RuntimeError('Kernel died before replying to kernel_info') RuntimeError: Kernel died before replying to kernel_info """ The above exception was the direct cause of the following exception: Traceback (most recent call last): File "../tools/nbgenerate.py", line 165, in <module> main() File "../tools/nbgenerate.py", line 162, in main kernel_name=args.kernel_name) File "../tools/nbgenerate.py", line 123, in do for dst in pool.map(func, nbs): File "/usr/lib/python3.5/concurrent/futures/_base.py", line 556, in result_iterator yield future.result() File "/usr/lib/python3.5/concurrent/futures/_base.py", line 398, in result return self.__get_result() File "/usr/lib/python3.5/concurrent/futures/_base.py", line 357, in __get_result raise self._exception RuntimeError: Kernel died before replying to kernel_info Makefile:62: recipe for target 'html' failed make[2]: *** [html] Error 1 make[2]: Leaving directory '/build/statsmodels-0.8.0/docs' debian/rules:35: recipe for target 'override_dh_installdocs' failed make[1]: *** [override_dh_installdocs] Error 2 make[1]: Leaving directory '/build/statsmodels-0.8.0' debian/rules:16: recipe for target 'binary' failed make: *** [binary] Error 2 dpkg-buildpackage: error: fakeroot debian/rules binary gave error exit status 2 Any idea what might be the problem here? Kind regards Andreas. -- http://fam-tille.de