It's not quite as simple as working with the fully spelled out names. SWORD allows other alternates as well. For example, perhaps the following would work just as well for Apostle-Works:
A-W
AW
Wrks
Wrk
Wks
Wk
and any proper prefix of Apostle-Works that does not conflict with another books abbreviations:
Apostle-Work
Apostle-Wor
Apostle-Wo
Apostle-W
Apostle-
Apostle
Apostl
...
Ap

How about prefixes on both sides of the dash?
Ap-Works
Apo-Works
Ap-Wo

How about abbreviations of just one side or the other:
Apo-Wrks
Apostle-Wrk
A-Wks

In Him,
    DM


On 09/30/2010 01:24 PM, Weston Ruter wrote:
I think the fundamental problem here is that the SWORD reference parser is too simple. Namely, the parser needs to not blindly split on a hyphen character but rather tokenize the input stream and contextually determine what each token is as it processes the tokens in sequence. For example, if I had the following passage span (assuming the language has "Apostle-Works" as the book name for "Acts"):

Apostle-Works 4:32 - Romans 3:21

In this case, the parser would come across that first hyphen and could contextually determine it's not a passage span separator hyphen since the following token "Works" is not a recognized as a book, and also that "Apostle" is not a full book in itself but "Apostle-Works" is. Otherwise, there could be a pre-processor that does a first pass inspecting the token stream and replacing localized book name token sequences with their internal OSIS names and then just split on the hyphen as usual.

Does that sound right?

On Thu, Sep 30, 2010 at 9:52 AM, DM Smith <dmsm...@crosswire.org <mailto:dmsm...@crosswire.org>> wrote:

    On 09/30/2010 11:11 AM, David Troidl wrote:
    Hi Robert,

    There are many Unicode characters for hyphens and dashes.  Could
    you substitute, for example, the hyphen from General Punctuation
    (&#x2010;)?  This would give the proper appearance, without
    conflicting with the 'normal' hyphen separator.
    I think this is at core a user input problem. Telling users that
    they have to use a special character that is not on their keyboard
    is a problem. I don't think it will do at all.

    If we parse the user input to figure out whether a hyphen is a
    range specifier or part of a name and if part of a name then
    substitute it with something else, then we should add that to the
    SWORD reference parser.



    Peace,

    David

    On 9/29/2010 5:28 PM, Robert Hunt wrote:
    On 30/09/10 10:17, Greg Hellings wrote:

    OP was not talking about a transliteration from the sounds of
    his email, but rather the original language where the hyphen is
    a letter.

    You are equivalently proposing an English speaker to not use
    the letter s in the Bible names list. It might be
    comprehensible but it would be horrible usability and I
    probably wouldn't take such software seriously!

    Exactly!

    Perhaps allowing each locale to define its own numerals and
    hyphen-like character would be a good solution?

    Yes, I'm sure there's probably dozens of languages in the world
    that are likely to have hyphens in book names. Even in English,
    hyphen is a valid letter as you can see in the sentence above.
    (It's just fortunate that it doesn't occur in book names.

    Surely this issue has come up many times before???

    Robert.

    On Sep 29, 2010 4:08 PM, "Daniel Owens" <dhow...@pmbx.net
    <mailto:dhow...@pmbx.net>> wrote:
    >
    > On 09/29/2010 03:55 PM, Robert Hunt wrote:
    >> New Zealand.
    >>
    >> Hello all,
    >>
    >> I am spending today studying the documentation on the Crosswire
    >> Sword wiki so I'm likely to have a few questions. Please let
    me know
    >> if this is not the right forum to ask questions.
    >>
    >> I see in http://www.crosswire.org/wiki/DevTools:SWORD that
    >> localised book names are not allowed hyphens in them
    (because the
    >> hyphen is used for verse ranges). In the Philippine language
    that we
    >> worked with as Bible translators, the hyphen is a letter in the
    >> alphabet and appears in several book names!
    >>
    >> Is this still a current limitation? If so, what is the
    suggested
    >> work-around.
    >>
    >> Thanks,
    >> Robert.
    >>
    > This problem came up with Vietnamese, and I was just told to
    drop the
    > hyphens. The result was not ideal, but in the end it is still
    > comprehensible in Vietnamese. I think the hyphen was needed
    because
    > Vietnamese is monosyllabic, but more recent
    "transliterations" of
    > foreign names have simply dropped the hyphens. Would the
    names still be
    > comprehensible without the hyphen?
    >
    > Daniel


    _______________________________________________
    sword-devel mailing list:sword-devel@crosswire.org  
<mailto:sword-devel@crosswire.org>
    http://www.crosswire.org/mailman/listinfo/sword-devel
    Instructions to unsubscribe/change your settings at above page


    _______________________________________________
    sword-devel mailing list:sword-devel@crosswire.org  
<mailto:sword-devel@crosswire.org>
    http://www.crosswire.org/mailman/listinfo/sword-devel
    Instructions to unsubscribe/change your settings at above page


    _______________________________________________
    sword-devel mailing list: sword-devel@crosswire.org
    <mailto:sword-devel@crosswire.org>
    http://www.crosswire.org/mailman/listinfo/sword-devel
    Instructions to unsubscribe/change your settings at above page




--
Weston Ruter
http://weston.ruter.net/
@westonruter <http://twitter.com/westonruter> - Google Profile <http://www.google.com/profiles/WestonRuter#about>


_______________________________________________
sword-devel mailing list: sword-devel@crosswire.org
http://www.crosswire.org/mailman/listinfo/sword-devel
Instructions to unsubscribe/change your settings at above page

_______________________________________________
sword-devel mailing list: sword-devel@crosswire.org
http://www.crosswire.org/mailman/listinfo/sword-devel
Instructions to unsubscribe/change your settings at above page

Reply via email to