Re: Filtering search results

2011-11-22 Thread Felipe Carvalho
I'd really love to use Solr, but unfortunately this is not an option for this project I'm working on, Would anybody have a pointer to an article or a few lines of code showing how to use this using Lucene core? I've done quite some digging on Google, but I could only people suggesting to use Solr

RE: Best document format / markup for text indexing?

2011-11-22 Thread Paul Allan Hill
> What is the best format/markup/ebook standard/document standard/other to use > for easiest and best text search support? The helpful Tika libraries can parse any number of formats and then index the text into Lucene, so I'm thinking the question is what is the better format when you want to d

Fuzzy Search Sorting

2011-11-22 Thread Romiko Derbynew
Hi Guys, I am using Lucene with neo4j database. Currently if I do a fuzzy search via a rest call using the Query API with this data GivenName: John FamilyName: Smith GivenName: Bob FamilyName: Smith GivenName: Adam FamilyName: Smith GivenName: Bill FamilyName: Smath If I query the index li

Re: JVM Crash org.apache.lucene.store.DataInput.readVInt()I

2011-11-22 Thread Roberto Fonti
Thanks Uwe and Simon and for your detailed answers. I'm reviewing my code to find where I'm using a closed index. Roberto On Nov 22, 2011, at 11:53 AM, Uwe Schindler wrote: > Hi, > > The JDK issues you are talking about are also fixed in 1.6.9_29 (not only > 1.7.0u1). So your crash is not rel

RE: JVM Crash org.apache.lucene.store.DataInput.readVInt()I

2011-11-22 Thread Uwe Schindler
Hi, The JDK issues you are talking about are also fixed in 1.6.9_29 (not only 1.7.0u1). So your crash is not related to any of the "famous java6/7 bugs" (in addition, the vint bug does *not* crash your JVM at all it just corrupts your index by returning wrong values - and this one is definitely fi

Re: JVM Crash org.apache.lucene.store.DataInput.readVInt()I

2011-11-22 Thread Simon Willnauer
hey, are you using mmap directory by any chance? Stuff like this often happens when you try to read from a already closed mmaped file. Typically you try to search or read from a already closed IndexSearcher / IndexReader, is it possible that you use a stale IndexReader in a in-flight search? The u

JVM Crash org.apache.lucene.store.DataInput.readVInt()I

2011-11-22 Thread Roberto Fonti
My JVM (1.6.0_29) keeps crashing on intensive use when indexing documents with Lucene. I get: # # A fatal error has been detected by the Java Runtime Environment: # # SIGSEGV (0xb) at pc=0x2b6b196d767c, pid=26417, tid=1183217984 # # JRE version: 6.0_29-b11 # Java VM: Java HotSpot(TM) 64-Bit

highlighter by using term offsets

2011-11-22 Thread starz10de
I'm writing a highlighter by using term offsets as follows: IndexReader reader = IndexReader.open( indexPath ); TermPositionVector tpv = (TermPositionVector)reader.getTermFreqVector( hits[i].doc,"contents"); When I run the searcher, I face this error in TermPositionVector t