hello! http://downloads.supportandmore.de/amrv-10-11.html?jhID=8aus2
-
To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-user-h...@lucene.apache.org
Hi,
Could I please ask another question regarding Lucene "under the hood" /
performance.
I wondered how "AND" queries are implemented?
Say we query for "+hello +world".
Would Lucene simply find 2 lists of documents ("documents containing HELLO",
and "documents containing WORLD"),
and then in
Thanks again.
- Original Message -
From: Erick Erickson
To: java-user@lucene.apache.org; sol myr
Cc:
Sent: Sunday, October 23, 2011 7:18 PM
Subject: Re: performance question - number of documents
"Why would it matter...top 5 matches" Because Lucene has to calculate
the
Hi,
We've noticed some Lucene performance phenomenon, and would appreciate an
explanation from anyone familiar with Lucene internals
(I know Lucene as a user, but haven't looked under its hood).
We have a Lucene index of about 30 million records.
We ran 2 queries: "AND" and "OR" ("+john +doe" v
Hi,
I noticed that the new Lucene 3.4 supports "BlockJoinQuery" (allowing for
'join' or 'relation' between documents).
I understand the documented limitations on the feature (nowhere near the power
of SQL join), but it's still very useful for me :)
My question: does anyone please have a code ex
Thanks so much, this helped a lot :)
- Original Message -
From: Uwe Schindler
To: java-user@lucene.apache.org; 'sol myr'
Cc:
Sent: Tuesday, October 4, 2011 12:14 PM
Subject: RE: [Lucene] Frequencies and positions - are they stored per field?
Hi,
Term Vectors are somehow
Thanks a lot.
But then what's the added value of Field.TermVector?
Can't it be deduced from the overall Lucene index? Or is it just inefficient to
deduce?
Thanks again :)
- Original Message -
From: Uwe Schindler
To: java-user@lucene.apache.org; 'sol myr'
Cc:
Se
Hi,
I use Lucene, but an not familiar with its internals.
I'd appreciate help understanding whether Term Frequences and Positions - are
stored per Document of per Field?
On the one hand, I never ask for "Field.TermVector" because I read it's only
required for "MoreLikeThis" (which I don't ne
Hi,
Our Lucene index grew to about 4 GB .
Unfortunately it brought up a performance problem of slow file merging.
We have:
1. A writer thread: once an Hour it looks for modified documents, and
updates the Lucene index.
Usually there are only few modifications, but sometimes we switch the
entire co
Hi,
I need to run and "AND" query with a twist: give higher ranking for
"exact match".
So when searching for BIG BEN
- Give high rank for the Document "BIG BEN is in London"
- Lower rank for "It's a BIG day for my dear friend BEN"
Following good advice from this list, I combined 2 separate que
Hi,
I'm rephrasing a previous performance question, in light of new data...
I have a Lucene index of about 0.5 GB.
Currently performance is good - up to 200 milliseconds per search (with
complex boolean queries, but never retrieving more than 200 top results).
The question: how much can the index
---
> Instant Scalable Full-Text Search On Any Database/Application
> site: http://www.dbsight.net
> demo: http://search.dbsight.com
> Lucene Database Search in 3 minutes:
>
> http://wiki.dbsight.com/index.php?title=Create_Lucene_Database_Search_in_3_minutes
>
>
&g
Hi,
I need to search a Catalog.
Most users search *this* year's catalog, but on rare occasions they may ask
for old products (from previous years).
I'm trying to select between 2 options:
1) Keep huge big index for all years (where documents have a "year" field,
so I can filter out the current ye
l-Text Search On Any Database/Application
> site: http://www.dbsight.net
> demo: http://search.dbsight.com
> Lucene Database Search in 3 minutes:
> http://wiki.dbsight.com/index.php?title=Create_Lucene_Database_Search_in_3_minutes
>
>
>
> On 3/22/2011 1:30 AM, sol myr wrote:
&g
Hi,
What are my options for distributing an application that uses Lucene?
Our current application works against a database of INVENTORY. We schedule
hourly checks for modified items (timestamp-based), and update a single
Lucene index.
Now we want to distribute out application, to a Grid, with fail
Hi,
I have several documents that share the same (large) searchable data.
For example, say my Documents represent movies, and 2 movies share the same
actorBiography of Brad Pitt (assuming I want
to search movies by actorBiography words, far-fetched as it might seem):
Document1:
- movieName=
other complications.
--
Ian.
On Wed, Feb 9, 2011 at 8:43 AM, sol myr wrote:
> Hi,
>
> I'm building my own BooleanQuery (rather than using Query Parser). That's
> because I need different defaults from my users:
> If a user types: java program
> I need to run the query: +java* +
Hi,
I'm building my own BooleanQuery (rather than using Query Parser). That's
because I need different defaults from my users:
If a user types: java program
I need to run the query: +java* +program* (namely AND search, with Prefix so as
to hit "programS", "programMER").
So naively I split the
Hi,
I'm trying to understand the behavior of file merging / optimization.
I see that whenever my IndexWriter calls 'commit()', it creates a new file (or
fileS).
I also see these files merged when calling 'optimize()' , as much as allowed by
the parameter 'NoCFSRatio' .
But I'm still trying to f
newReader;
searcher = new IndexSearcher(reader);
}
instead of
reader.reopen(true);
Bye.
*Raf*
On Sun, Jan 16, 2011 at 11:06 AM, sol myr wrote:
> Hi,
>
> Thank you kindly for replying.
> Unfortunately, reopen() doesn't help me see the changes.
> Here's my test:
> First
Hi,
Thank you kindly for replying.
Unfortunately, reopen() doesn't help me see the changes.
Here's my test:
First I write & commit a document, and run a search - which correctly finds
this document.
Then I write & commit another document, re-open the reader and run another
search - this should
Hi,
We're writing a web application, which naturally needs
- "IndexSearcher" when users use our search screen
- "IndexWriter" in a background process that periodically updates and optimizes
our index.
Note our writer is exclusive - no other applications/threads ever write to our
index files.
Wh
option 2 can automatically record create/update/delete
changes
cleanly in a separate table.
Either of these options help you to "remember to reindex" just the changed
items.
Cheers
Mark
- Original Message
From: sol myr
To: java-user@lucene.apache.org
Sent: Thu, 13 Jan
Hi,
Our main data storage is MySql tables.
We index it on Lucene in order to improve the search (boosting, proximate
spelling, etc).
We naturally maintain it - for example, to insert a new "Contract" entity, we
have:
addContract(Contract cont){
// INSERT into MySQL:
hibernateSession.sa
/changes/Changes.html#3.0.3.changes_in_runtime_behavior
so its a feature not a bug :)
Simon
On Tue, Jan 11, 2011 at 7:46 AM, sol myr wrote:
> Hi,
>
> Continuing my question - I now suspect a bug in Lucene 3.0.3, because I ran
> the test with Lucene 3.0.0 and it worked okay (no junk fil
Hi,
Continuing my question - I now suspect a bug in Lucene 3.0.3, because I ran the
test with Lucene 3.0.0 and it worked okay (no junk files)... could anyone
please confirm?
--- On Mon, 1/10/11, sol myr wrote:
From: sol myr
Subject: Newbie question: optimized files?
To: java-user
Hi,
I'm new to Lucene (using 3.0.3), and just started to check out the behavior of
the 'optimize()' method (which is quite important for our application).
Could it be that 'optimize' cancels out the 'compoundFile' mode? Or am I doing
something wrong?
Here's my test: I create an indexWriter wi
27 matches
Mail list logo