RE: question on custom filter

2009-07-20 Thread OBender
Never mind, I think I got it. -Original Message- From: OBender [mailto:osya_ben...@hotmail.com] Sent: Monday, July 20, 2009 4:42 PM To: java-user@lucene.apache.org Subject: RE: question on custom filter No, it reversed in the e-mail. Funny though, when I insert it in to the Excel it

RE: question on custom filter

2009-07-20 Thread OBender
t (left to right, e.g., EREV left most then TOV). Should UTF-8 CSV file preserve the natural (language specific) order of words? -Original Message- From: Robert Muir [mailto:rcm...@gmail.com] Sent: Monday, July 20, 2009 3:49 PM To: java-user@lucene.apache.org Subject: Re: question on cus

RE: question on custom filter

2009-07-20 Thread OBender
u :) ? -Original Message- From: Robert Muir [mailto:rcm...@gmail.com] Sent: Monday, July 20, 2009 3:34 PM To: java-user@lucene.apache.org Subject: Re: question on custom filter Obender, I think your input is incorrect. The hebrew text you pasted in your example appears incorrect. Its gonna be h

Re: question on custom filter

2009-07-20 Thread Robert Muir
From: Robert Muir [mailto:rcm...@gmail.com] > Sent: Monday, July 20, 2009 3:18 PM > To: java-user@lucene.apache.org > Subject: Re: question on custom filter > > Obender, based on your previous comments (that you see text displayed > in the wrong order), I again recommend that you en

RE: question on custom filter

2009-07-20 Thread OBender
I've checked, and it appears to be enabled. -Original Message- From: Robert Muir [mailto:rcm...@gmail.com] Sent: Monday, July 20, 2009 3:18 PM To: java-user@lucene.apache.org Subject: Re: question on custom filter Obender, based on your previous comments (that you see text display

Re: question on custom filter

2009-07-20 Thread Robert Muir
onday, July 20, 2009 3:03 PM > To: java-user@lucene.apache.org > Subject: Re: question on custom filter > > Obender, i ran your code and it did what I expected (but not what you pasted): > > First token is: (טוֹב,0,4) > Second token is: (עֶרֶב,5,10) > > I also loaded up your Simp

RE: question on custom filter

2009-07-20 Thread OBender
3:03 PM To: java-user@lucene.apache.org Subject: Re: question on custom filter Obender, i ran your code and it did what I expected (but not what you pasted): First token is: (טוֹב,0,4) Second token is: (עֶרֶב,5,10) I also loaded up your SimpleWhitespaceAnalyzer in Luke, with the same results. On M

Re: question on custom filter

2009-07-20 Thread Robert Muir
al String fieldName, final Reader >> reader ) >>        { >>                TokenStream ts  = new WhitespaceTokenizer( reader ); >>                ts                      = new XFilter( ts ); >> >>                return ts; >>        } >> } >> >> -Original M

Re: question on custom filter

2009-07-20 Thread Robert Muir
riginal Message- > From: Robert Muir [mailto:rcm...@gmail.com] > Sent: Monday, July 20, 2009 2:26 PM > To: java-user@lucene.apache.org > Subject: Re: question on custom filter > > Obender, I think something in your environment / display environment > might be causing some confu

RE: question on custom filter

2009-07-20 Thread OBender
che.org Subject: Re: question on custom filter Obender, I think something in your environment / display environment might be causing some confusion. Are you using microsoft windows? If so, please verify that support for right-to-left languages is enabled [control panel/regional and language options].

Re: question on custom filter

2009-07-20 Thread Robert Muir
[טוֹב עֶרֶב] >> Then first token that filter receives is: >> [עֶרֶב] (0,5) >> and the second is: >> [טוֹב] (6,10) >> Which means that it counts from right to left (words and indexes). >> >> Am I missing something? >> >> -Original Message-

RE: question on custom filter

2009-07-20 Thread OBender
from right to left (words and indexes). > > Am I missing something? > > -Original Message- > From: Robert Muir [mailto:rcm...@gmail.com] > Sent: Monday, July 20, 2009 1:43 PM > To: java-user@lucene.apache.org > Subject: Re: question on custom filter > > Obender,

RE: question on custom filter

2009-07-20 Thread OBender
Hold on a second, the phrase that you included link to is not in the correct order of words! -Original Message- From: Robert Muir [mailto:rcm...@gmail.com] Sent: Monday, July 20, 2009 2:07 PM To: java-user@lucene.apache.org Subject: Re: question on custom filter Obender, This is not

RE: question on custom filter

2009-07-20 Thread OBender
ly 20, 2009 2:07 PM To: java-user@lucene.apache.org Subject: Re: question on custom filter Obender, This is not true. the text you pasted is the following in unicode: \N{HEBREW LETTER TET} \N{HEBREW LETTER VAV} \N{HEBREW POINT HOLAM} \N{HEBREW LETTER BET} \N{SPACE} \N{HEBREW LETTER AYIN} \N{HEBREW

Re: question on custom filter

2009-07-20 Thread Robert Muir
s and indexes). > > Am I missing something? > > -Original Message- > From: Robert Muir [mailto:rcm...@gmail.com] > Sent: Monday, July 20, 2009 1:43 PM > To: java-user@lucene.apache.org > Subject: Re: question on custom filter > > Obender, I don't think its as dif

RE: question on custom filter

2009-07-20 Thread OBender
ssage- From: Robert Muir [mailto:rcm...@gmail.com] Sent: Monday, July 20, 2009 1:43 PM To: java-user@lucene.apache.org Subject: Re: question on custom filter Obender, I don't think its as difficult as you think. Your filter does not need to be aware of this issue at all. In unicode, rig

Re: question on custom filter

2009-07-20 Thread Robert Muir
Obender, I don't think its as difficult as you think. Your filter does not need to be aware of this issue at all. In unicode, right-to-left languages are encoded in the data in logical order. The rendering system is what converts it to display in right-to-left for RTL languages. For example in Ar