Re: [sword-devel] Soft hyphens?

2017-04-01 Thread David Haslam
Someone once developed an algorithm called *KUCut* to insert zero width spaces into Thai text. Not sure of the current state of play, but I do know that the text used as the test bed for machine learning was the *ThaiKJV* of Philip Pope, which was the source text for our module. An unrelated disc

Re: [sword-devel] Soft hyphens?

2017-04-01 Thread DM Smith
Can Lucene code be improved? Short answer: No. Long answer: I’ve suggested improvements in the past when it was felt that JSword and SWORD should be able to use the same Lucene indexes. Going from memory, the argument against any change was that a mechanism would be needed to know when the index

Re: [sword-devel] Soft hyphens?

2017-04-01 Thread David Haslam
Interesting. Question prompted by an addition to /Tentative suggestions/ in https://crosswire.org/wiki/CrossWire_KJV#KJV_module: Can the Lucene code be improved ? David -- View this message in context: http://sword-dev.350566.n4.nabble.com/Soft-hyphens-tp4657045p4657048.html Sent from the S

Re: [sword-devel] Is the wiki down or the whole server?

2017-04-01 Thread DM Smith
Was just getting to it. Glad its up now. > On Apr 1, 2017, at 8:43 AM, David Haslam wrote: > > Maybe it's a traffic vs bandwidth problem. > > It started working again soon afterwards. > > David > > > > -- > View this message in context: > http://sword-dev.350566.n4.nabble.com/Is-the-wiki-

Re: [sword-devel] Soft hyphens?

2017-04-01 Thread DM Smith
SWORD uses Lucene’s StandardAnalyzer which in turn uses WhitespaceTokenizer. It doesn’t use WordDelimiterFilter. As such it doesn’t handle hyphenated words well, including soft hyphen. In Him, DM > On Apr 1, 2017, at 8:56 AM, David Haslam wrote: > > Does SWORD search using Lucene igno

[sword-devel] Soft hyphens?

2017-04-01 Thread David Haslam
Does SWORD search using Lucene ignore the presence of a soft hyphen in any word? i.e. If the user searches for 'violence' and the word in the text was 'vio­lence' would it be found? NB. The second instance contains a soft hyphen \xAD between 'vio' and 'lence'. Best regards, David -- View thi

Re: [sword-devel] Is the wiki down or the whole server?

2017-04-01 Thread David Haslam
Maybe it's a traffic vs bandwidth problem. It started working again soon afterwards. David -- View this message in context: http://sword-dev.350566.n4.nabble.com/Is-the-wiki-down-or-the-whole-server-tp4657043p4657044.html Sent from the SWORD Dev mailing list archive at Nabble.com. __

[sword-devel] Is the wiki down or the whole server?

2017-04-01 Thread David Haslam
I cannot access the developers' wiki right now. It was just after I had reviewed an edit. Please check asap. Thanks. David -- View this message in context: http://sword-dev.350566.n4.nabble.com/Is-the-wiki-down-or-the-whole-server-tp4657043.html Sent from the SWORD Dev mailing list archive