Re: [sword-devel] Thai and Lucene

2005-02-15 Thread DM Smith
The analyzer that is used to index a module must be used as the analyzer to parse the search request. The analyzer that Sword is currently using is for English. As part of the distribution of Lucene, there are analyzers for Russian and German. Also, in lucene's beta sandbox there are analyzers

Re: [sword-devel] Thai and Lucene

2005-02-15 Thread Chris Little
Adrian Korten wrote: g'day, I've been wondering whether Thai would benefit from Lucene. Even if it does support utf-8, I doubt that Lucene supports Thai when no word breaks are provided. Even if it had smarts to handle Thai word-breaking like ICU, it would stumble over the Biblical words. S

Re: [sword-devel] Thai and Lucene

2005-02-14 Thread Adrian Korten
g'day, I've been wondering whether Thai would benefit from Lucene. Even if it does support utf-8, I doubt that Lucene supports Thai when no word breaks are provided. Even if it had smarts to handle Thai word-breaking like ICU, it would stumble over the Biblical words. S, I haven't tried it.