;ژ"/"zhe"/U+698 with the letter
> > > > > > > > > > "ز"/"ze"/U+632, which has just one dot over it.
> > > > > > > > > >
> > > > > > > > > > Unless you were mistaken in all of your emails when
> > > > > you included
> > > &
e", then what I said
>> in my
>> > > > > > > > previous email still stands: there is no problem here.
>> > > > >
Hi Esra,
On 05/07/2008 at 11:49 AM, Steven A Rowe wrote:
> At Chris Hostetter's suggestion, I am rewriting the patch
> attached to LUCENE-1279, including the following changes:
>
> - Merged the contents of the CollatingRangeQuery class into
> RangeQuery and RangeFilter
> - Switched the Locale par
cross posting, but why the word 'Farsi' instead of
'Persian'? No one says Lucnce français or Español, or Deutsch -
so why
Farsi?
Please read the following article, I found it quite enlightening.
http://www.cais-soas.com/CAIS/Languages/persian_not_farsi.htm
PV
-- View thi
ointer. Knowledge is good.
>
> Steve
>
> On 05/07/2008 at 2:54 AM, Vizzini wrote:
>>
>> Sorry for cross posting, but why the word 'Farsi' instead of
>> 'Persian'? No one says Lucnce français or Español, or Deutsch - so why
>> Farsi?
&
good.
Steve
On 05/07/2008 at 2:54 AM, Vizzini wrote:
>
> Sorry for cross posting, but why the word 'Farsi' instead of
> 'Persian'? No one says Lucnce français or Español, or Deutsch - so why Farsi?
>
> Please read the following article, I found it quite enlighte
esra wrote:
> > > > > > > > >
> > > > > > > > > Hi Steven,
> > > > > > > > >
> > > > > > > > > sorry i made a mistake. unicodes are like this:
> > > > > > > > >
> > > > > > > &
-
View this message in context:
http://www.nabble.com/lucene-farsi-problem-tp16977096p17098552.html
Sent from the Lucene - Java Users mailing list archive at Nabble.com.
-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additio
gt; > > > > > > ژ = U+632
>> > > > > > > > and the first letter of "ساب ووفر " is س = U+633
>> > > > > > >
>> > > > > > > you can also check them here
>> > > > > > > >
>>
t; > > > > > Esra
> > > > > > >
> > > > > > >
> > > > > > > Steven A Rowe wrote:
> > > > > > > >
> > > > > > > > Hi Esra,
> > > > > > > >
> &g
Esra,
>> > > > > >
>> > > > > > Going back to the original problem statement, I see something
>> that
>> > > > > > looks illogical to me - please correct me if I'm wrong:
>> > > > > >
>> > &g
t; > On Apr 30, 2008, at 3:21 AM, esra wrote:
> > > > > > > i am using lucene's "IndexSearcher" to search the given xml by
> > > > > > > keyword which contains farsi information. while searching i use
> > > > > > > ra
t; > > > while searching i use ranges like
>> > > > >
>> > > > > آ-ث | ج-خ | د-ژ | س-ظ | ع-ق | ک-ل | م-ی
>> > > > >
>> > > > > when i do search for "د-ژ" range the results are wrong , they
>
> are the results of " س-ظ "range.
> > > > >
> > > > > for example when i do search for "د-ژ" one of the the results is
> > > > > "ساب ووفر", this result also shown on the " س-ظ " range's
gt;
>> > > As IndexSearcher use "compareTo" method and this method uses
>> > > unicodes for comparing, i found the unicodes of the characters.
>> > >
>> > > د=U+62F
>> > > ژ = U+698
>> > >
3
> >
> > It appears to me that *both* the "د-ژ" range [ U+062F - U+0698 ] and
> > the "س-ظ" range [ U+0633 - U+0638 ] contain the first letter of "ساب
> > ووفر", which is "س" = U+0633.
> >
> > You stated that U+0633 should be contained i
the
> "س-ظ" range [ U+0633 - U+0638 ] contain the first letter of "ساب ووفر",
> which is "س" = U+0633.
>
> You stated that U+0633 should be contained in the [ U+0633 - U+0638 ]
> range - I agree - but why do you think U+0633 should not be cont
re
analyzers
for
different languages ,
will this be usefull if so do you know where to find a farsi
analyzer?
I would bu glad if you help.
thanks ,
Esra
--
View this message in context:
http://www.nabble.com/lucene-farsi-problem-
tp16977096p16977096.html
Sent from the Lucene - Java Users m
Hi Esra,
Going back to the original problem statement, I see something that looks
illogical to me - please correct me if I'm wrong:
On Apr 30, 2008, at 3:21 AM, esra wrote:
> i am using lucene's "IndexSearcher" to search the given xml by
> keyword which contains farsi information.
> while search
n on the " س-ظ " range's result list which
>>>> is the
>>>> corret range.
>>>>
>>>> As IndexSearcher use "compareTo" method and this method uses
>>>> unicodes for
>>>> comparing, i found the unicodes
quot; method and this method uses
>> unicodes for
>> comparing, i found the unicodes of the characters.
>>
>> د=U+62F
>> ژ = U+698
>> and the first letter of "ساب ووفر " is س = U+633
>>
>> Do you
On 04/30/2008 at 12:50 PM, Steven A Rowe wrote:
> Caveat: I don't speak, read, write, or dream in Farsi - I
> just know that it mostly shares its orthography with Arabic,
> and that they are both written and read right-to-left.
>
> How are you constructing the queries? Using QueryParser? If
> so
>
> Do you have any idea how to solve this problem, there are
> analyzers for
> different languages ,
> will this be usefull if so do you know where to find a farsi analyzer?
>
> I would bu glad if you help.
>
> thanks ,
>
> Esra
>
> -- View this mess
33
Do you have any idea how to solve this problem, there are analyzers
for
different languages ,
will this be usefull if so do you know where to find a farsi
analyzer?
I would bu glad if you help.
thanks ,
Esra
--
View this message in context:
http://www.nabble.com/lucene
method uses
>> unicodes for
>> comparing, i found the unicodes of the characters.
>>
>> د=U+62F
>> ژ = U+698
>> and the first letter of "ساب ووفر " is س = U+633
>>
>> Do you have any idea how to solve this problem, there are analyzers
>>
problem, there are analyzers
for
different languages ,
will this be usefull if so do you know where to find a farsi analyzer?
I would bu glad if you help.
thanks ,
Esra
--
View this message in context:
http://www.nabble.com/lucene-fa
a farsi analyzer?
I would bu glad if you help.
thanks ,
Esra
--
View this message in context:
http://www.nabble.com/lucene-farsi-problem-tp16977096p16977096.html
Sent from the Lucene - Java Users mailing list archive at Nabble.com.
-
27 matches
Mail list logo