Thanks a million. You really helped me get on the right direction. I'm
going to start building some test cases this afternoon so I can really
begin to get my hands dirty and see how everything works. It's good to
know that there isn't one "right" way for my purposes, which is mostly
what I wanted
Yes, updating a document in Lucene is "expensive" for two
reasons:
1> deleting and adding a document does mean there's internal
work being done. But it's not all *that* expensive. So this really
comes down to how many records you expect to update
every 15 minutes. You've gotta try it.
2
I'm going to try and cover all replies so far, but for the most part
this first one since it had the most help so far. Thanks to everyone who
replied so far, you've given me a lot of great ideas to think about and
look into. I'm going to begin some small test indexes with our data so
we have somet
Hi
I was going to suggest looking at hibernate search. It comes with
event listeners that modify your indexes when the persistent entity
changes. It use lucene under the hood so if you need to access lucene
the you can.
Indexing can be done sync or async and the documentation shows how to
On Thu, Mar 26, 2009 at 6:28 PM, Matt Schraeder wrote:
> I'm new to Lucene and just beginning my project of adding it to our web
> app. We are indexing data from a MS SQL 2000 database and building
> full-text search from it.
>
> Everything I have read says that building the index is a resource h
There are many things you need to synchronize with database. Besides
just changed fields, you may need to deal with deleted database records,
etc.
In general, it's not efficient to pull over data that's changing
often.and may not have much effect on search. It'll overload Lucene
unnecessarily
You've got a great grasp of the issues, comments below. But before you
do, a lot of this kind if thing is incorporated in SOLR, which is build on
Lucene. Particularly updating an index then using it.
So you might take a look over there. It even has a DataImportHandler...
WARNING: I've only been mo
I'm new to Lucene and just beginning my project of adding it to our web
app. We are indexing data from a MS SQL 2000 database and building
full-text search from it.
Everything I have read says that building the index is a resource heavy
operation so we should use it sparingly. For the most part