Re: need to find locations of query hits in doc: works fine for regular text but not for phone numbers

2012-06-13 Thread Jack Krupansky
Try putting the phone number in quotes in the query: String qstr = "\"800-555-1212\""; And check query.toString to see how the query parser analyzed the term, bot with and without quotes. And make sure you initialized the query parser with "contents" as the default field. -- Jack Krupansky

need to find locations of query hits in doc: works fine for regular text but not for phone numbers

2012-06-13 Thread Ilya Zavorin
Hello All, I am using 3.4. I need to find locations of query hits in a document. What I've implemented works fine for textual queries but does not work for phone numbers. Here's how I index my docs: String oc = "Joe dialed 800-555-1212 but got a busy signal"; doc.add(new Field("contents",

ant build target:jflex throws Stackoverflow error

2012-06-13 Thread bin01
Hello, I have checked out lucene 3.6 and I am trying to run the ant jflex. It is throwing a Stackoverflow error when it is trying to execute the target: jflex-UAX29URLEmailTokenizer. Any idea why this is happening? I have checked out the trunk version of Jflex 1.5 with revision greater than 597 as

Re: OutOfMemoryError when opening the index ?

2012-06-13 Thread Yang
ok, found it: we are using Cloudera CDHu3u, they change the ulimit for child jobs. but I still don't know how to change their default settings yet On Wed, Jun 13, 2012 at 2:15 PM, Yang wrote: > I got the OutOfMemoryError when I tried to open an Lucene index. > > it's very weird since this is o

Re: Index pruning

2012-06-13 Thread Andrzej Bialecki
On 30/05/2012 03:39, Greg Bowyer wrote: Hi all I am playing about with the index pruning contrib package, I want to see if it will make a faster and slightly smaller index for me. However when I try either Carmel or RIDF methods it just ends up deleting all my postings for the two fields of inte