Re: Multi language indexing

2007-05-08 Thread bhecht
Hi Doron, Thank you very much for your time and for the detailed explanations. This is exactly what I meant and I am happy to see I understood correctly. I am now using the Snowball which seems to work very good. Thanks again and good day, Barak Hecht. -- View this message in context: htt

Re: Multi language indexing

2007-05-07 Thread Doron Cohen
bhecht <[EMAIL PROTECTED]> wrote on 07/05/2007 10:26:27: > I have implemented my own analyzer for each country. > So as I see it, when I index these records, I want to > provide lucene, with a specific analyzer per record > i'm indexing. > > When a user performs a query in my JSF form, I will > us

Re: Multi language indexing

2007-05-07 Thread bhecht
Sorry, I didn't understand I need to use the PerFieldanalyzerWrapper for this task, and tried to index the document twice. Sorry for the previous post. thanks for the great help. But if you already asked, I will be happy to explain what my goal is, and maybe see if i'm approaching this correctly

Re: Multi language indexing

2007-05-07 Thread karl wettin
7 maj 2007 kl. 15.45 skrev bhecht: OK, thanks, I think I got it. Just to see if I understood correctly: When I do the search on both stemmed and unstemmed fields, I will do the following: 1) If I know the country of the requested search - I will use the stemmed analyzer, and then the

Re: Multi language indexing

2007-05-07 Thread bhecht
OK, thanks, I think I got it. Just to see if I understood correctly: When I do the search on both stemmed and unstemmed fields, I will do the following: 1) If I know the country of the requested search - I will use the stemmed analyzer, and then the unstemmed field

Re: Multi language indexing

2007-05-07 Thread karl wettin
7 maj 2007 kl. 13.27 skrev bhecht: The last option seems to be the right one for me, using a stemmed and unstemmed field. I assume when you mean "unstemmed", you mean indexing the field using the UN_TOKENIZED parameter. No, I mean TOKENIZED, but not using a stemmer analyzer. -- karl

Re: Multi language indexing

2007-05-07 Thread bhecht
OK, thanks for the reply. The last option seems to be the right one for me, using a stemmed and unstemmed field. I assume when you mean "unstemmed", you mean indexing the field using the UN_TOKENIZED parameter. Now my problem starts, when trying to implement this with "Hibernate Search", which al

Re: Multi language indexing

2007-05-07 Thread karl wettin
7 maj 2007 kl. 12.16 skrev bhecht: My question regarding "the way to go", was if it is a good solution to index a content of a table, using more than 1 analyzer, determining the analyzer by the country value of each record. I'm not sure what you mean, but I'll try. Do you ask if it makes

Re: Multi language indexing

2007-05-07 Thread bhecht
I know indexing and searching need to use the same analyzer. My question regarding "the way to go", was if it is a good solution to index a content of a table, using more than 1 analyzer, determining the analyzer by the country value of each record. Couldn't find a post that describes exactly my

Re: Multi language indexing

2007-05-07 Thread karl wettin
7 maj 2007 kl. 10.02 skrev bhecht: This means I index and search using the same analyzer. I was interested to know if this is the way to go? That would be the way to go (unless you are really sure what you're doing). -- karl --