RE: lucene farsi problem

2008-05-11 Thread Steven A Rowe
;ژ"/"zhe"/U+698 with the letter > > > > > > > > > > "ز"/"ze"/U+632, which has just one dot over it. > > > > > > > > > > > > > > > > > > > > Unless you were mistaken in all of your emails when > > > > > you included > > > &

RE: lucene farsi problem

2008-05-10 Thread esra
e", then what I said >> in my >> > > > > > > > previous email still stands: there is no problem here. >> > > > >

RE: lucene farsi problem

2008-05-09 Thread Steven A Rowe
Hi Esra, On 05/07/2008 at 11:49 AM, Steven A Rowe wrote: > At Chris Hostetter's suggestion, I am rewriting the patch > attached to LUCENE-1279, including the following changes: > > - Merged the contents of the CollatingRangeQuery class into > RangeQuery and RangeFilter > - Switched the Locale par

Re: lucene farsi problem

2008-05-08 Thread Grant Ingersoll
cross posting, but why the word 'Farsi' instead of 'Persian'? No one says Lucnce français or Español, or Deutsch - so why Farsi? Please read the following article, I found it quite enlightening. http://www.cais-soas.com/CAIS/Languages/persian_not_farsi.htm PV -- View thi

RE: lucene farsi problem

2008-05-08 Thread Vizzini
ointer. Knowledge is good. > > Steve > > On 05/07/2008 at 2:54 AM, Vizzini wrote: >> >> Sorry for cross posting, but why the word 'Farsi' instead of >> 'Persian'? No one says Lucnce français or Español, or Deutsch - so why >> Farsi? &

RE: lucene farsi problem

2008-05-07 Thread Steven A Rowe
good. Steve On 05/07/2008 at 2:54 AM, Vizzini wrote: > > Sorry for cross posting, but why the word 'Farsi' instead of > 'Persian'? No one says Lucnce français or Español, or Deutsch - so why Farsi? > > Please read the following article, I found it quite enlighte

RE: lucene farsi problem

2008-05-07 Thread Steven A Rowe
esra wrote: > > > > > > > > > > > > > > > > > > Hi Steven, > > > > > > > > > > > > > > > > > > sorry i made a mistake. unicodes are like this: > > > > > > > > > > > > > > > > &

Re: lucene farsi problem

2008-05-06 Thread Vizzini
- View this message in context: http://www.nabble.com/lucene-farsi-problem-tp16977096p17098552.html Sent from the Lucene - Java Users mailing list archive at Nabble.com. - To unsubscribe, e-mail: [EMAIL PROTECTED] For additio

RE: lucene farsi problem

2008-05-06 Thread esra
gt; > > > > > > ژ = U+632 >> > > > > > > > and the first letter of "ساب ووفر " is س = U+633 >> > > > > > > >> > > > > > > you can also check them here >> > > > > > > > >>

RE: lucene farsi problem

2008-05-04 Thread Steven A Rowe
t; > > > > > Esra > > > > > > > > > > > > > > > > > > > > > Steven A Rowe wrote: > > > > > > > > > > > > > > > > Hi Esra, > > > > > > > > > &g

RE: lucene farsi problem

2008-05-03 Thread esra
Esra, >> > > > > > >> > > > > > Going back to the original problem statement, I see something >> that >> > > > > > looks illogical to me - please correct me if I'm wrong: >> > > > > > >> > &g

RE: lucene farsi problem

2008-05-02 Thread Steven A Rowe
t; > On Apr 30, 2008, at 3:21 AM, esra wrote: > > > > > > > i am using lucene's "IndexSearcher" to search the given xml by > > > > > > > keyword which contains farsi information. while searching i use > > > > > > > ra

RE: lucene farsi problem

2008-05-02 Thread esra
t; > > > while searching i use ranges like >> > > > > >> > > > > آ-ث | ج-خ | د-ژ | س-ظ | ع-ق | ک-ل | م-ی >> > > > > >> > > > > when i do search for "د-ژ" range the results are wrong , they >

RE: lucene farsi problem

2008-05-02 Thread Steven A Rowe
> are the results of " س-ظ "range. > > > > > > > > > > for example when i do search for "د-ژ" one of the the results is > > > > > "ساب ووفر", this result also shown on the " س-ظ " range's

RE: lucene farsi problem

2008-05-02 Thread esra
gt; >> > > As IndexSearcher use "compareTo" method and this method uses >> > > unicodes for comparing, i found the unicodes of the characters. >> > > >> > > د=U+62F >> > > ژ = U+698 >> > >

RE: lucene farsi problem

2008-05-02 Thread Steven A Rowe
3 > > > > It appears to me that *both* the "د-ژ" range [ U+062F - U+0698 ] and > > the "س-ظ" range [ U+0633 - U+0638 ] contain the first letter of "ساب > > ووفر", which is "س" = U+0633. > > > > You stated that U+0633 should be contained i

RE: lucene farsi problem

2008-05-02 Thread esra
the > "س-ظ" range [ U+0633 - U+0638 ] contain the first letter of "ساب ووفر", > which is "س" = U+0633. > > You stated that U+0633 should be contained in the [ U+0633 - U+0638 ] > range - I agree - but why do you think U+0633 should not be cont

Re: lucene farsi problem

2008-05-01 Thread Grant Ingersoll
re analyzers for different languages , will this be usefull if so do you know where to find a farsi analyzer? I would bu glad if you help. thanks , Esra -- View this message in context: http://www.nabble.com/lucene-farsi-problem- tp16977096p16977096.html Sent from the Lucene - Java Users m

RE: lucene farsi problem

2008-05-01 Thread Steven A Rowe
Hi Esra, Going back to the original problem statement, I see something that looks illogical to me - please correct me if I'm wrong: On Apr 30, 2008, at 3:21 AM, esra wrote: > i am using lucene's "IndexSearcher" to search the given xml by > keyword which contains farsi information. > while search

Re: lucene farsi problem

2008-05-01 Thread esra
n on the " س-ظ " range's result list which >>>> is the >>>> corret range. >>>> >>>> As IndexSearcher use "compareTo" method and this method uses >>>> unicodes for >>>> comparing, i found the unicodes

RE: lucene farsi problem

2008-05-01 Thread esra
quot; method and this method uses >> unicodes for >> comparing, i found the unicodes of the characters. >> >> د=U+62F >> ژ = U+698 >> and the first letter of "ساب ووفر " is س = U+633 >> >> Do you

RE: lucene farsi problem

2008-04-30 Thread Steven A Rowe
On 04/30/2008 at 12:50 PM, Steven A Rowe wrote: > Caveat: I don't speak, read, write, or dream in Farsi - I > just know that it mostly shares its orthography with Arabic, > and that they are both written and read right-to-left. > > How are you constructing the queries? Using QueryParser? If > so

RE: lucene farsi problem

2008-04-30 Thread Steven A Rowe
> > Do you have any idea how to solve this problem, there are > analyzers for > different languages , > will this be usefull if so do you know where to find a farsi analyzer? > > I would bu glad if you help. > > thanks , > > Esra > > -- View this mess

Re: lucene farsi problem

2008-04-30 Thread Grant Ingersoll
33 Do you have any idea how to solve this problem, there are analyzers for different languages , will this be usefull if so do you know where to find a farsi analyzer? I would bu glad if you help. thanks , Esra -- View this message in context: http://www.nabble.com/lucene

Re: lucene farsi problem

2008-04-30 Thread esra
method uses >> unicodes for >> comparing, i found the unicodes of the characters. >> >> د=U+62F >> ژ = U+698 >> and the first letter of "ساب ووفر " is س = U+633 >> >> Do you have any idea how to solve this problem, there are analyzers >>

Re: lucene farsi problem

2008-04-30 Thread Grant Ingersoll
problem, there are analyzers for different languages , will this be usefull if so do you know where to find a farsi analyzer? I would bu glad if you help. thanks , Esra -- View this message in context: http://www.nabble.com/lucene-fa

lucene farsi problem

2008-04-30 Thread esra
a farsi analyzer? I would bu glad if you help. thanks , Esra -- View this message in context: http://www.nabble.com/lucene-farsi-problem-tp16977096p16977096.html Sent from the Lucene - Java Users mailing list archive at Nabble.com. -