Re: New codecs keep Freq skip/omit Pos

2011-04-21 Thread Robert Muir
On Thu, Apr 21, 2011 at 9:52 PM, Alex vB wrote: > > PforDelta W Freq W Pos         20.6 GB > PforDelta W/O Freq W/O Pos               1.6 GB > Standard 4.0 W Freq W Pos              28.1 GB > Standard 4.0 W/O Freq W/O Pos    6.2 GB > Pfor W Freq W Pos                         22 GB > Pfor W/O Freq

Re: New codecs keep Freq skip/omit Pos

2011-04-21 Thread Robert Muir
On Thu, Apr 21, 2011 at 9:52 PM, Alex vB wrote: > Hello everybody, > > I am currently testing several new Lucene 4.0 codec implementations to > compare with an own solution. > The difference is that I am only indexing frequencies and not positions. I > would like to have this for the other codecs.

New codecs keep Freq skip/omit Pos

2011-04-21 Thread Alex vB
Hello everybody, I am currently testing several new Lucene 4.0 codec implementations to compare with an own solution. The difference is that I am only indexing frequencies and not positions. I would like to have this for the other codecs. I know there was already a post for this topic http://lucen

"Umlaute" getting lost

2011-04-21 Thread Clemens Wyss
I keep my search terms in a dedicated RAMDirectory (the termIndex). In there I palce all the term of my real index. When putting the terms into the termIndex I can still see [using the debugger] the Umlaute (äöü). Unfortunately when searching the termIndex the documents no more contain these Um

Re: termFreqVector is always null ?

2011-04-21 Thread Patrick Diviacco
Nevermind, I've solved by indexing the fields with with Field.TermVector.YES doc.add(new Field("tags", "foo bar", Store.NO, Index.ANALYZED, Field.TermVector.YES)); On 21 April 2011 10:57, Patrick Diviacco wrote: > Hi, > > for any document, the termFreqVector is always null. > I'm sure

Re: How to make search distributed and scalable

2011-04-21 Thread Weiwei Wang
thanks, dude, i will take a look at it 2011/4/21 Kim Kokkonen > You might check out katta, which nicely combines lucene and hadoop: > > http://katta.sourceforge.net/ > > This page has some architectural talks: > > http://katta.sourceforge.net/about/talks > > -Kim > > > > On 4/19/2011 12:11 AM, W

termFreqVector is always null ?

2011-04-21 Thread Patrick Diviacco
Hi, for any document, the termFreqVector is always null. I'm sure the documents are in the collection and the field exist. So where is the problem ? for (int i = 0; i < reader.numDocs(); i++){ TermFreqVector tfv = reader.getTermFreqVector(i, "tags"); thanks

Re: Locking Issue with Concurrency

2011-04-21 Thread yogesh
i search the data i got error An unexpected network error occurred -- View this message in context: http://lucene.472066.n3.nabble.com/Locking-Issue-with-Concurrency-tp2842509p2846120.html Sent from the Lucene - Java Users mailing list archive at Nabble.com.