really weird. do I need to open a Sword bug? (it might be better if  you
could, as you could explain it better than I could, but you're a busy
man...)


On Mon, Apr 14, 2025 at 3:00 PM Karl Kleinpaste <k...@kleinpaste.org> wrote:

> On 3/29/25 13:30, Fred wrote:
>
> So I fired up Xiphos, installed the 3 greek modules from Crosswire and did
> some searches.
> Oddly enough, doing a search for that word, using "exact phrase" says it
> can be found in Matt 8:17, Mark 13:35, II Cor 6:18, II Tim 1:5, Titus 3:9
> and 1 Peter 4:19.
>
> doing the same search in the tischmorph module turns up only the ones in
> Rev and II Cor.
>
> So, looked up the matthew reference and find that word isn't there!
> ...
> Two oddities here, anybody got any clues?
>
>
> Sorry for being tardy about this, another instance of "I marked this for
> later, then didn't notice when 'later' came and went..."
>
> The short, useless answer is "I don't have any good explanation."
>
> TischMorph:
> When I search lemma:G3841, I get 10 verses:
> 2Cor 6:18; Rev 1:8; 4:8; 11:17; 15:3; 16:7; 16:14; 19:6; 19:15; 21:22
> Of that, Rev 16:14, 19:15 are included because they are ref'd by the same
> Strong's but in the form παντοκρατορος.
>
> When I search παντοκρατωρ, I get:
> lucene: nothing?
> phrase, regex: The usual 8 verses (above, excluding the 2 alternates).
> Certainly, I expect lucene search should cough up the same 8.
>
> When I adjust the search term to the more general παντοκρ*, then lucene
> search gives me what I expect. ???
>
> I have no idea how you're getting Matt/Mark/2Tim/Tit/1Pet references. I
> don't see that.
>
> However, when I use diatheke:
> diatheke -b TischMorph -s lucene -k παντοκρατωρ | sed -e 's/8R/8 ; R/' -e
> 's/II/; II/' | semis | sort
> ("sed" is to fix some rough output from diatheke)
> then I get the usual 8 verses.
>
> In 2TGreek:
> lucene: nothing?
> regex: The usual 8.
> phrase: The usual 8 plus Mat 8:17; Mark 13:35; 2Tim 1:5, Tit 3:9?!? But no
> 1Pet reference.
> This is freakish.
>
> Again, adjust search to use παντοκρ*, I now get a proper set including the
> extras from παντοκρατορος (10 verses).
> How is this distinguished?
>
> I cannot begin to explain any of this. Surely, lucene search should be
> returning the proper set on a no-wildcard single word. This suggests to me
> that there is something funny about how the lucene index is being handled.
> "Exact phrase" adding verses that are manifestly not to be included is a
> different sort of problem.
>
> The bottom line for Xiphos is that it turns off Greek Accents and uses
> StripText on search terms (for consistency of "no accents that would create
> false differences"), then executes the search. What Xiphos displays is what
> the engine sends back. Xiphos has no more control than that.
>
> I'm thoroughly mystified by all of this, especially since I can see that I
> (being in Xiphos' code) have nothing additional I can do about how the
> search behaves. The code just prepares the search terms and hands it off
> for execution.
>
> I rebuilt lucene indices to ensure there was nothing funny from a possibly
> old index generation.
>
> How can παντοκρατωρ and παντοκρ* produce different lucene results?
> _______________________________________________
> sword-devel mailing list: sword-devel@crosswire.org
> http://crosswire.org/mailman/listinfo/sword-devel
> Instructions to unsubscribe/change your settings at above page
>
_______________________________________________
sword-devel mailing list: sword-devel@crosswire.org
http://crosswire.org/mailman/listinfo/sword-devel
Instructions to unsubscribe/change your settings at above page

Reply via email to