Re: Modelling relational data in Lucene Index?

2006-11-06 Thread KEGan
Hi, I am actually doing something what the original poster mentioned. Previously, I have use Hibernate and Lucene. But I found that for my particular project my data is quite flat, so in the next version I totally take out Hibernate (and the complexity with it :)) and use Lucene as the "main stor

idf in scores

2006-11-06 Thread Antony Bowesman
I've been trying to understand how idf is arrived at from a query. I have a single Document with 9 fields. One field "subject" has the phrase "RFC2822 - Internet Message Format" and a second "body" has the contents of rfc2822. The other fields contain additional meta data. If I search for su

Re: Filter query method

2006-11-06 Thread spinergywmy
Hi Chris, My scenario is: I will key in the first search value in the text box, then the first search result will be return. Next, I will clear the first search value and key in the second search value within the same text box. The 2nd search value will search the first result, for inst

Re: Filter query method

2006-11-06 Thread Chris Hostetter
:Is there anyone built your own filter query before, in order to perform : search within search results. Meaning after the first search, the result is : cached and the second search searches the result that return from the first : searched, and is not searching the whole index again. : :Ju

Filter query method

2006-11-06 Thread spinergywmy
Hi, Is there anyone built your own filter query before, in order to perform search within search results. Meaning after the first search, the result is cached and the second search searches the result that return from the first searched, and is not searching the whole index again. Just won

Re: whats the correct way to do normalisation?

2006-11-06 Thread Erik Hatcher
On Nov 6, 2006, at 11:27 AM, hans meiser wrote: Hi, Did you take a look at IsoLatin1AccentFilter ? It nearly do the same i need, but not perfectly. public final Token next() throws java.io.IOException { final Token t = input.next(); if (t == null) return null; return new Token

RE: "Catalog" backend for document stored fields?

2006-11-06 Thread Robichaud, Jean-Philippe
[sorry for the long delay for my answer, we are having some issues with our mail server...] Thanks for your comment. Yes it would make sense if the log files were not so big. In fact, I'm only indexing a subset of the log information. Because I store the information in Lucene, it is easier and f

Doc classification / categorization with Lucene ?

2006-11-06 Thread Dmitry Goldenberg
Hello, What are the best practices for document classification / categorization using Lucene? Any recommendations as far as manual vs. automatic, which products to use or not to use? Does Lucene offer anything out of the box? Thanks, - Dmitry

Re: whats the correct way to do normalisation?

2006-11-06 Thread hans meiser
Hi, > Did you take a look at IsoLatin1AccentFilter ? It nearly do the same i need, but not perfectly. public final Token next() throws java.io.IOException { final Token t = input.next(); if (t == null) return null; return new Token(removeAccents(t.termText()), t.startO

Re: whats the correct way to do normalisation?

2006-11-06 Thread Patrick Turcotte
Hi, Did you take a look at IsoLatin1AccentFilter ? Patrick On 11/6/06, hans meiser <[EMAIL PROTECTED]> wrote: Hi, Lucene indexes documents from 3 different countries here (English, German and French). I want to normalize some characters like umlauts. ä -> ae I did it in the following way

RE: Intermittent search performance problem

2006-11-06 Thread Vladimir Olenin
Any profiler can add it's own overhead. You might try the "-verbose:gc" JVM flag (if you haven't tried it yet). The fastest way to check if you problems are GC related. Check JVM docs (or plainly - 'java -help' for more flags. There are some '-X' flags for more detailed info, as well as flags to du

whats the correct way to do normalisation?

2006-11-06 Thread hans meiser
Hi, Lucene indexes documents from 3 different countries here (English, German and French). I want to normalize some characters like umlauts. ä -> ae I did it in the following way: New Analyzer: public class SpecialCharsAnalyzer extends StandardAnalyzer { public SpecialCharsAnalyzer() {

Re: injecting fields looked up from DB at the runtime - Solr/Lucene question

2006-11-06 Thread Yonik Seeley
On 11/6/06, Vladimir Olenin <[EMAIL PROTECTED]> wrote: So, if I'll be using Solr, what's the right strategy? Is it possible to redefine SolrDocument class through configuration? There isn't currently a SolrDocument class... it's all hypothetical. But yes, I imagine it would work by allowing one

RE: injecting fields looked up from DB at the runtime - Solr/Lucene question

2006-11-06 Thread Vladimir Olenin
So, if I'll be using Solr, what's the right strategy? Is it possible to redefine SolrDocument class through configuration? If not, would it be safe to inject these properties through Aspects while maintaining the whole framework intact and in workable condition? (eg, if at some point for some reaso

Re: Modelling relational data in Lucene Index?

2006-11-06 Thread Emmanuel Bernard
I had a quick look at SOLR and DBSight. They seem to achieve a different goal than Hibernate Lucene. The formers belong to the project box category: you set up a server that will handle the search for you. The application will then delegate the work to those servers. The latter belongs to the fr

Re: reg : document number

2006-11-06 Thread Erick Erickson
From the comments in the IndexModifier.java file (didn't see this in the "regular" javadoc... /** * Deletes all documents containing term. * This is useful if one uses a document field to hold a unique ID string for * the document. Then to delete such a document, one merely constructs a

how to boost recent document

2006-11-06 Thread Bhavin Pandya
Hi guys, How to boost recent documents ...so that it top ups... I mean what coulld be proper strategy... - Bhavin pandya

how to boost recent doc

2006-11-06 Thread Bhavin Pandya
Hi guys, I want to boost recent documentsso that it top ups... what could be best strategy... - Bhavin pandya