Never mind, I think I got it.
-Original Message-
From: OBender [mailto:osya_ben...@hotmail.com]
Sent: Monday, July 20, 2009 4:42 PM
To: java-user@lucene.apache.org
Subject: RE: question on custom filter
No, it reversed in the e-mail. Funny though, when I insert it in to the Excel
it
t (left to right, e.g., EREV left most then TOV). Should
UTF-8 CSV file preserve the natural (language specific) order of words?
-Original Message-
From: Robert Muir [mailto:rcm...@gmail.com]
Sent: Monday, July 20, 2009 3:49 PM
To: java-user@lucene.apache.org
Subject: Re: question on cus
u
:) ?
-Original Message-
From: Robert Muir [mailto:rcm...@gmail.com]
Sent: Monday, July 20, 2009 3:34 PM
To: java-user@lucene.apache.org
Subject: Re: question on custom filter
Obender, I think your input is incorrect. The hebrew text you pasted
in your example appears incorrect. Its gonna be h
From: Robert Muir [mailto:rcm...@gmail.com]
> Sent: Monday, July 20, 2009 3:18 PM
> To: java-user@lucene.apache.org
> Subject: Re: question on custom filter
>
> Obender, based on your previous comments (that you see text displayed
> in the wrong order), I again recommend that you en
I've checked, and it appears to be enabled.
-Original Message-
From: Robert Muir [mailto:rcm...@gmail.com]
Sent: Monday, July 20, 2009 3:18 PM
To: java-user@lucene.apache.org
Subject: Re: question on custom filter
Obender, based on your previous comments (that you see text display
onday, July 20, 2009 3:03 PM
> To: java-user@lucene.apache.org
> Subject: Re: question on custom filter
>
> Obender, i ran your code and it did what I expected (but not what you pasted):
>
> First token is: (טוֹב,0,4)
> Second token is: (עֶרֶב,5,10)
>
> I also loaded up your Simp
3:03 PM
To: java-user@lucene.apache.org
Subject: Re: question on custom filter
Obender, i ran your code and it did what I expected (but not what you pasted):
First token is: (טוֹב,0,4)
Second token is: (עֶרֶב,5,10)
I also loaded up your SimpleWhitespaceAnalyzer in Luke, with the same results.
On M
al String fieldName, final Reader
>> reader )
>> {
>> TokenStream ts = new WhitespaceTokenizer( reader );
>> ts = new XFilter( ts );
>>
>> return ts;
>> }
>> }
>>
>> -Original M
riginal Message-
> From: Robert Muir [mailto:rcm...@gmail.com]
> Sent: Monday, July 20, 2009 2:26 PM
> To: java-user@lucene.apache.org
> Subject: Re: question on custom filter
>
> Obender, I think something in your environment / display environment
> might be causing some confu
che.org
Subject: Re: question on custom filter
Obender, I think something in your environment / display environment
might be causing some confusion.
Are you using microsoft windows? If so, please verify that support for
right-to-left languages is enabled [control panel/regional and
language options].
[טוֹב עֶרֶב]
>> Then first token that filter receives is:
>> [עֶרֶב] (0,5)
>> and the second is:
>> [טוֹב] (6,10)
>> Which means that it counts from right to left (words and indexes).
>>
>> Am I missing something?
>>
>> -Original Message-
from right to left (words and indexes).
>
> Am I missing something?
>
> -Original Message-
> From: Robert Muir [mailto:rcm...@gmail.com]
> Sent: Monday, July 20, 2009 1:43 PM
> To: java-user@lucene.apache.org
> Subject: Re: question on custom filter
>
> Obender,
Hold on a second, the phrase that you included link to is not in the correct
order of words!
-Original Message-
From: Robert Muir [mailto:rcm...@gmail.com]
Sent: Monday, July 20, 2009 2:07 PM
To: java-user@lucene.apache.org
Subject: Re: question on custom filter
Obender, This is not
ly 20, 2009 2:07 PM
To: java-user@lucene.apache.org
Subject: Re: question on custom filter
Obender, This is not true.
the text you pasted is the following in unicode:
\N{HEBREW LETTER TET}
\N{HEBREW LETTER VAV}
\N{HEBREW POINT HOLAM}
\N{HEBREW LETTER BET}
\N{SPACE}
\N{HEBREW LETTER AYIN}
\N{HEBREW
s and indexes).
>
> Am I missing something?
>
> -Original Message-
> From: Robert Muir [mailto:rcm...@gmail.com]
> Sent: Monday, July 20, 2009 1:43 PM
> To: java-user@lucene.apache.org
> Subject: Re: question on custom filter
>
> Obender, I don't think its as dif
ssage-
From: Robert Muir [mailto:rcm...@gmail.com]
Sent: Monday, July 20, 2009 1:43 PM
To: java-user@lucene.apache.org
Subject: Re: question on custom filter
Obender, I don't think its as difficult as you think. Your filter does
not need to be aware of this issue at all.
In unicode, rig
Obender, I don't think its as difficult as you think. Your filter does
not need to be aware of this issue at all.
In unicode, right-to-left languages are encoded in the data in logical order.
The rendering system is what converts it to display in right-to-left
for RTL languages.
For example in Ar
17 matches
Mail list logo