Different Weights to Lucene fields with Okapi Similarity

2012-07-16 Thread Kasun Perera
Resending again, since my question didn't get much attention -- Forwarded message -- From: Kasun Perera Date: Tue, Jun 19, 2012 at 3:26 PM Subject: Different Weights to Lucene fields with Okapi Similarity To: java-user@lucene.apache.org Based on this link http://www2002.org/CDR

RE: Lucene reorganizing indexes

2012-07-16 Thread Uwe Schindler
You may want to read: http://www.searchworkings.org/blog/-/blogs/simon-says%3A-optimize-is-bad-for -you - Uwe Schindler H.-H.-Meier-Allee 63, D-28213 Bremen http://www.thetaphi.de eMail: u...@thetaphi.de > -Original Message- > From: Scott Smith [mailto:ssm...@mainstreamdata.com] > Se

AW: Lucene reorganizing indexes

2012-07-16 Thread Ralf Heyde
Do you use Lucene or Solr? We faced the problem in Solr due too big Caches, which where (re)warmed up after a commit and the never ending full GCs. Greets Ralf -Ursprüngliche Nachricht- Von: Scott Smith [mailto:ssm...@mainstreamdata.com] Gesendet: Montag, 16. Juli 2012 22:29 An: java-us

Lucene reorganizing indexes

2012-07-16 Thread Scott Smith
We have an application that has to do "real time" indexing of a number of documents. What it does is wake up about every 20 seconds and updates the index with any changes that have been queued since the last time it ran. This involves adding and deleting several hundred documents. This is all

Re: Offsets in 3.6/4.0

2012-07-16 Thread karsten-solr
Dear Carsten, abstract of your post: you need the offset to perform your search/ranking like the position is needed for phrase queries. You are using reader.getTermFreqVector to get the offset. This is to slow for your application and you think about a switch to version 4.0 imho you should usi

Re: many index reader problem

2012-07-16 Thread Danil ŢORIN
Do you really HAVE to keep all those indexes opened? You could use a LRU or LFU cache of reasonable size with opened indexes, and open new searcher if it's not in the cache. If your indexes are quite small, the open call shouldn't be too expensive. On Mon, Jul 16, 2012 at 11:51 AM, Ian Lea wrot

Re: In memory Lucene configuration

2012-07-16 Thread Simon Willnauer
your spikes could be due to garbage collection. Since you are on java 1.7 you could try this commandline (blind shot): java -server -Xms1G -Xmx1G -Xss128k -XX:+UseParNewGC -XX:+UseConcMarkSweepGC -XX:CMSInitiatingOccupancyFraction=75 -XX:+UseCMSInitiatingOccupancyOnly or maybe tr

Re: many index reader problem

2012-07-16 Thread Ian Lea
OOV or OOM? Always best to post a full stack trace, and version of lucene, and OS. Anyway - give your app more memory? Close searchers after use or some period of inactivity? Best long term solution is probably to merge the many small indexes into one, or a few, larger indexes and restrict quer

Re: about some date store

2012-07-16 Thread Ian Lea
So "content" is a String variable in your program holding a multi-line value, is it? I'd double check exactly what that is holding before you store it in the index. -- Ian. On Mon, Jul 16, 2012 at 4:56 AM, sam wrote: > I had done that,I used the docment.add(new > field("content",content,field