Resending again, since my question didn't get much attention
-- Forwarded message --
From: Kasun Perera
Date: Tue, Jun 19, 2012 at 3:26 PM
Subject: Different Weights to Lucene fields with Okapi Similarity
To: java-user@lucene.apache.org
Based on this link http://www2002.org/CDR
You may want to read:
http://www.searchworkings.org/blog/-/blogs/simon-says%3A-optimize-is-bad-for
-you
-
Uwe Schindler
H.-H.-Meier-Allee 63, D-28213 Bremen
http://www.thetaphi.de
eMail: u...@thetaphi.de
> -Original Message-
> From: Scott Smith [mailto:ssm...@mainstreamdata.com]
> Se
Do you use Lucene or Solr?
We faced the problem in Solr due too big Caches, which where (re)warmed up
after a commit and the never ending full GCs.
Greets Ralf
-Ursprüngliche Nachricht-
Von: Scott Smith [mailto:ssm...@mainstreamdata.com]
Gesendet: Montag, 16. Juli 2012 22:29
An: java-us
We have an application that has to do "real time" indexing of a number of
documents. What it does is wake up about every 20 seconds and updates the
index with any changes that have been queued since the last time it ran. This
involves adding and deleting several hundred documents. This is all
Dear Carsten,
abstract of your post:
you need the offset to perform your search/ranking like the position is needed
for phrase queries.
You are using reader.getTermFreqVector to get the offset.
This is to slow for your application and you think about a switch to version 4.0
imho you should usi
Do you really HAVE to keep all those indexes opened?
You could use a LRU or LFU cache of reasonable size with opened
indexes, and open new searcher if it's not in the cache.
If your indexes are quite small, the open call shouldn't be too expensive.
On Mon, Jul 16, 2012 at 11:51 AM, Ian Lea wrot
your spikes could be due to garbage collection. Since you are on java
1.7 you could try this commandline (blind shot):
java -server
-Xms1G
-Xmx1G
-Xss128k
-XX:+UseParNewGC
-XX:+UseConcMarkSweepGC
-XX:CMSInitiatingOccupancyFraction=75
-XX:+UseCMSInitiatingOccupancyOnly
or maybe tr
OOV or OOM? Always best to post a full stack trace, and version of
lucene, and OS.
Anyway - give your app more memory? Close searchers after use or some
period of inactivity?
Best long term solution is probably to merge the many small indexes
into one, or a few, larger indexes and restrict quer
So "content" is a String variable in your program holding a multi-line
value, is it? I'd double check exactly what that is holding before
you store it in the index.
--
Ian.
On Mon, Jul 16, 2012 at 4:56 AM, sam wrote:
> I had done that,I used the docment.add(new
> field("content",content,field