Hi,
I am actually doing something what the original poster mentioned.
Previously, I have use Hibernate and Lucene. But I found that for my
particular project my data is quite flat, so in the next version I totally
take out Hibernate (and the complexity with it :)) and use Lucene as the
"main stor
I've been trying to understand how idf is arrived at from a query. I have a
single Document with 9 fields. One field "subject" has the phrase "RFC2822 -
Internet Message Format" and a second "body" has the contents of rfc2822.
The other fields contain additional meta data. If I search for su
Hi Chris,
My scenario is:
I will key in the first search value in the text box, then the first
search result will be return. Next, I will clear the first search value and
key in the second search value within the same text box. The 2nd search
value will search the first result, for inst
:Is there anyone built your own filter query before, in order to perform
: search within search results. Meaning after the first search, the result is
: cached and the second search searches the result that return from the first
: searched, and is not searching the whole index again.
:
:Ju
Hi,
Is there anyone built your own filter query before, in order to perform
search within search results. Meaning after the first search, the result is
cached and the second search searches the result that return from the first
searched, and is not searching the whole index again.
Just won
On Nov 6, 2006, at 11:27 AM, hans meiser wrote:
Hi,
Did you take a look at IsoLatin1AccentFilter ?
It nearly do the same i need, but not perfectly.
public final Token next() throws java.io.IOException {
final Token t = input.next();
if (t == null)
return null;
return new Token
[sorry for the long delay for my answer, we are having some issues with our
mail server...]
Thanks for your comment. Yes it would make sense if the log files were not
so big. In fact, I'm only indexing a subset of the log information.
Because I store the information in Lucene, it is easier and f
Hello,
What are the best practices for document classification / categorization using
Lucene? Any recommendations as far as manual vs. automatic, which products to
use or not to use? Does Lucene offer anything out of the box?
Thanks,
- Dmitry
Hi,
> Did you take a look at IsoLatin1AccentFilter ?
It nearly do the same i need, but not perfectly.
public final Token next() throws java.io.IOException {
final Token t = input.next();
if (t == null)
return null;
return new Token(removeAccents(t.termText()), t.startO
Hi,
Did you take a look at IsoLatin1AccentFilter ?
Patrick
On 11/6/06, hans meiser <[EMAIL PROTECTED]> wrote:
Hi,
Lucene indexes documents from 3 different countries here
(English, German and French). I want to normalize some
characters like umlauts. ä -> ae
I did it in the following way
Any profiler can add it's own overhead. You might try the "-verbose:gc"
JVM flag (if you haven't tried it yet). The fastest way to check if you
problems are GC related. Check JVM docs (or plainly - 'java -help' for
more flags. There are some '-X' flags for more detailed info, as well as
flags to du
Hi,
Lucene indexes documents from 3 different countries here
(English, German and French). I want to normalize some
characters like umlauts. ä -> ae
I did it in the following way:
New Analyzer:
public class SpecialCharsAnalyzer extends StandardAnalyzer {
public SpecialCharsAnalyzer() {
On 11/6/06, Vladimir Olenin <[EMAIL PROTECTED]> wrote:
So, if I'll be using Solr, what's the right strategy? Is it possible to
redefine SolrDocument class through configuration?
There isn't currently a SolrDocument class... it's all hypothetical.
But yes, I imagine it would work by allowing one
So, if I'll be using Solr, what's the right strategy? Is it possible to
redefine SolrDocument class through configuration? If not, would it be
safe to inject these properties through Aspects while maintaining the
whole framework intact and in workable condition? (eg, if at some point
for some reaso
I had a quick look at SOLR and DBSight. They seem to achieve a different
goal than Hibernate Lucene.
The formers belong to the project box category: you set up a server that
will handle the search for you. The application will then delegate the
work to those servers.
The latter belongs to the fr
From the comments in the IndexModifier.java file (didn't see this in the
"regular" javadoc...
/**
* Deletes all documents containing term.
* This is useful if one uses a document field to hold a unique ID string
for
* the document. Then to delete such a document, one merely constructs a
Hi guys,
How to boost recent documents ...so that it top ups...
I mean what coulld be proper strategy...
- Bhavin pandya
Hi guys,
I want to boost recent documentsso that it top ups...
what could be best strategy...
- Bhavin pandya
18 matches
Mail list logo