Re: [sword-devel] Python script for checking pairwise characters (PROFF-OF-CONCEPT)

2023-12-19 Thread David Haslam
When an apostrophe is used to make an English noun a possessive, if the noun already ends with the letter s - the apostrophe is placed after the s. There are even some rare exceptions such as the singular noun cockatrice - in which the possessive just has an apostrophe but no letter s afterwards

Re: [sword-devel] Python script for checking pairwise characters (PROFF-OF-CONCEPT)

2023-12-19 Thread Nathan Phillip Brink
On 2023-12-19 04:26, Matěj Cepl wrote: On Tue Dec 19, 2023 at 2:17 AM CET, Timothy Allen wrote: 2. Apostrophes In English, the apostrophe used for possession (“the boy’s train”) and omission (“don’t let’s start") is traditionally set with the same character used as the closing single quote, so

Re: [sword-devel] Python script for checking pairwise characters (PROFF-OF-CONCEPT)

2023-12-19 Thread Matěj Cepl
On Tue Dec 19, 2023 at 1:30 AM CET, Timothy Allen wrote: > As a data point, when I was writing scripts for manipulating and > updating the BSB module, I found the `xml.etree.ElementTree` module in > the Python standard library to be many times faster than the SAX API. > The SAX API is a perhaps

Re: [sword-devel] Python script for checking pairwise characters (PROFF-OF-CONCEPT)

2023-12-19 Thread Matěj Cepl
On Tue Dec 19, 2023 at 2:17 AM CET, Timothy Allen wrote: > I tried running it over my BSB module, and I hit problems fairly > quickly, some of which are more easily solved than others. > > 1. No support for language “en” > > This was easy enough to handle, there's a configuration variable near >

Re: [sword-devel] Python script for checking pairwise characters (PROFF-OF-CONCEPT)

2023-12-18 Thread Timothy Allen
On 19/12/23 00:06, Matěj Cepl wrote: I have decided not to rely on very kind help by David with his Windows tools and I have written (hopefully) completely platform neutral pure Python 3 script for checking pairwise-characters. So, far it was used only for fixing https://gitlab.com/crosswire-bibl

Re: [sword-devel] Python script for checking pairwise characters (PROFF-OF-CONCEPT)

2023-12-18 Thread Timothy Allen
On 19/12/23 01:45, Matěj Cepl wrote: 2. I use SAX API (xml.sax from the standard library) and it seems to me like better suited for the Bible processing than the traditional DOM (or LXML) interface. It nicely hides away all hard work going on in the background and let me work only on

Re: [sword-devel] Python script for checking pairwise characters (PROFF-OF-CONCEPT)

2023-12-18 Thread Kristof Szabo
Ok, all good then, we are covered, this is a different use case. On Mon, Dec 18, 2023 at 3:46 PM Matěj Cepl wrote: > On Mon Dec 18, 2023 at 2:38 PM CET, Kristof Szabo wrote: > > I wrote some time back https://github.com/krisek/sword-test, with quite > a > > few test cases, which, I think, cove

Re: [sword-devel] Python script for checking pairwise characters (PROFF-OF-CONCEPT)

2023-12-18 Thread Matěj Cepl
On Mon Dec 18, 2023 at 2:38 PM CET, Kristof Szabo wrote: > I wrote some time back https://github.com/krisek/sword-test, with quite a > few test cases, which, I think, covers your use case as well. Couple of differences on the first look: 1. Functionally, I prefer my script which stops when the fi

Re: [sword-devel] Python script for checking pairwise characters (PROFF-OF-CONCEPT)

2023-12-18 Thread Kristof Szabo
Hi Matěj, I wrote some time back https://github.com/krisek/sword-test, with quite a few test cases, which, I think, covers your use case as well. I was in touch with Dom on this at the time, but somehow the discussion stopped how to include these in the module build pipeline. If you think it is

[sword-devel] Python script for checking pairwise characters (PROFF-OF-CONCEPT)

2023-12-18 Thread Matěj Cepl
Hello, I have decided not to rely on very kind help by David with his Windows tools and I have written (hopefully) completely platform neutral pure Python 3 script for checking pairwise-characters. So, far it was used only for fixing https://gitlab.com/crosswire-bible-society/CzeCEP/-/issues/2 and