On Sun Jun 2, 2024 at 1:09 AM CEST, Arnaud Vié wrote:
> I'm open to any kind of feedback or suggestions of course !
> In particular :
>
>    - if you have any specific website in mind that you would like to be
>    able to build sword modules from, let me know, we can try to add it.
>    (Currently I only included a few French websites, but I'm interested to add
>    some other languages).

Sword module CzeBKR is sourced from the Czech WikiSource [1]
and there seems to be the official way [2] how to get source
in some hopefully more useful formats (plain text, RTF, HTML,
EPubs). I was using my own home-grown Python script [3], but it
seems like with all web-scrapping scripts it rotten away (that
script is under some of kind of very free open source license,
let’s say MIT/X11 … I am going to add the proper LICENSE file
momentarily). It started at [4] (look at the source view), but it
doesn’t seem to be that useful anymore.

>    - And if you are knowledgeable about the intellectual property laws in
>    other countries, I'm interested : currently, I've added a section to the
>    README explaining why the usage of the scraper on any public website is
>    allowed in France with references to the related texts, but it would
>    probably be useful to have similar information for users from other
>    countries.

I am absolutely certain, there are no problems with CzeBKR:

    1. It is WikiSource, so we have somebody else to blame ;)
    2. The original Bible of Kralice [5] is from the sixteenth
       century and it is absolutely in the public domain.
    3. Source for the WikiSource was a scan [6] of the book
       from 1918, without any authors shown. The works of only
       possible editor of that Bible I know about [7] (and he is
       not shown on the title page, but he was working in the
       early 20th century with the International Bible Society on
       the revision of the Bible) are under the Bern Convention
       (death in 1929 + 75 years) in the public domain as well.
    4. We are in EU as well.

If you want to use CzeBKR as your test case, I am ready to help
you with any testing or Czech issues or whatever.

Blessed Sunday!

Matěj

[1] https://cs.wikisource.org/wiki/Bible_kralick%C3%A1_(1918) 
[2] https://ws-export.wmcloud.org/?lang=cs&title=Bible_kralick%C3%A1_%281918%29
[3] https://gitlab.com/crosswire-bible-society/CzeBKR/-/blob/master/kralicka.py
[4] 
https://cs.wikisource.org/wiki/Speci%C3%A1ln%C3%AD:Exportovat_str%C3%A1nky/Bible_kralick%C3%A1_(1918)
[5] https://en.wikipedia.org/wiki/Bible_of_Kralice
[6] http://archive.org/details/biblsvatanebvec00socigoog
[7] https://cs.wikipedia.org/wiki/Jan_Karafi%C3%A1t
-- 
http://matej.ceplovi.cz/blog/, @mcepl@floss.social
GPG Finger: 3C76 A027 CA45 AD70 98B5  BC1D 7920 5802 880B C9D8
 
The ratio of literacy to illiteracy is a constant, but nowadays
the illiterates can read.
    -- Alberto Moravia

Attachment: E09FEF25D96484AC.asc
Description: application/pgp-keys

Attachment: signature.asc
Description: PGP signature

_______________________________________________
sword-devel mailing list: sword-devel@crosswire.org
http://crosswire.org/mailman/listinfo/sword-devel
Instructions to unsubscribe/change your settings at above page

Reply via email to