RE: Lucene, HTML and Hebrew

2008-01-30 Thread Itamar Syn-Hershko
OK, I've been processing things for a while. I came up with an idea that I want your advice on -- is there a way I could stem the Hebrew words in my analyzer yet keep a note of some sort of the original term which was assembled by this stem, WITHOUT affecting frequency/proximity data? This is I gu

RE: Lucene, HTML and Hebrew

2008-01-24 Thread Steven A Rowe
Hi Itamar, On 01/24/2008 at 2:55 PM, Itamar Syn-Hershko wrote: > > Lucene does not store proximity relations between data in different > > fields, only within individual fields > > So are 2 calls for doc->add with the same field but different > texts are considered as 1 field (latter call being i

RE: Lucene, HTML and Hebrew

2008-01-24 Thread Itamar Syn-Hershko
l Message- From: Steven A Rowe [mailto:[EMAIL PROTECTED] Sent: Wednesday, January 23, 2008 1:06 AM To: java-user@lucene.apache.org Subject: RE: Lucene, HTML and Hebrew Hi Itamar, In another thread, you wrote: > Yesterday I sent an email to this group querying about some very > impor

RE: Lucene, HTML and Hebrew

2008-01-22 Thread Steven A Rowe
On 01/22/2008 at 8:49 PM, Grant Ingersoll wrote: > On Jan 22, 2008, at 6:06 PM, Steven A Rowe wrote: > > On 01/21/2008 at 2:59 PM, Itamar Syn-Hershko wrote: > > > 2) How would I set the boosts for the headers and footnotes? > > > I'd rather have it stored within the index file than have to > > > ap

Re: Lucene, HTML and Hebrew

2008-01-22 Thread Grant Ingersoll
On Jan 22, 2008, at 6:06 PM, Steven A Rowe wrote: 2) How would I set the boosts for the headers and footnotes? I'd rather have it stored within the index file than have to append it to each and every query I will execute, but I'm open to suggestions. I'm more interested in performance and flex

RE: Lucene, HTML and Hebrew

2008-01-22 Thread Steven A Rowe
Hi Itamar, In another thread, you wrote: > Yesterday I sent an email to this group querying about some > very important (to me...) features of Lucene. I'm giving it > another chance before it goes unnoticed or forgotten. If it > was too long please let me know and I will email a shorter > list of

Lucene, HTML and Hebrew

2008-01-21 Thread Itamar Syn-Hershko
Hi all, I'm starting in the process of creating Hebrew support for Lucene. Specifically I'm using Clucene (which is an awesome and strong port), but that shouldn't matter for my questions. Please, if you know of any info or similar project let me know, it can save me loads of time and headaches.