FieldCache

2011-10-21 Thread Peyman Faratin
Hi I have a field that is indexed as follows for(String c: article.getCategories()){ doc.add(new Field("categories", c.toLowerCase(), Field.Store.YES, Field.Index.ANALYZED)); } I have a search space of 2 million docs and I need to access the category field of each hitdoc. I woul

Language Identifier with Lucene?

2011-10-21 Thread Luca Rondanini
Hi all, I usually use Nutch for this but, just for fun, I tried to create a language identifier based on Lucene only. I had a really small set of "training data": 10 files (roughly 2M each) for 10 languages. I indexed those files using an NGram analyzer. I have to say that I was not expecting mu

Re: Index forward slashes

2011-10-21 Thread Erick Erickson
Think about using some different analysis chains. For instance, WhitespaceTokenizer, perhaps followed by LowercaseFilter. You can compose pretty arbitrary analysis chains to suit your needs. Best Erick On Fri, Oct 21, 2011 at 10:59 AM, Michael Szediwy wrote: > Hi, > > what can I do if I want to

No longer able to set merge factor since updating to Lucene 3.4

2011-10-21 Thread Paul Taylor
Hi upgraded from 3.1 to 3.4, now it is compliaing about deprecated method indexWriter.setMergeFactor(); Saying it can only be used with the default LogMergePolicy ,but I never set the merge policy so shouldn't I be using the default anyway ? Paul ---

Re: About "join.search" in 3.4 version.

2011-10-21 Thread Michael McCandless
On Thu, Oct 20, 2011 at 9:06 PM, Mead Lai wrote: > Thank you, Mike. > Are you sure the 'Solr' has implemented 'Join' function. > I just skims through some tour guids about Solr, and not sure about that. > Appreciate you very much. Woops, I'm sorry: I believe Solr's join functionality was only imp

Re: Return Lucene field name when a query is matched

2011-10-21 Thread Ian Lea
Here's the output from a little test program on a 2 doc index. Query: title:bends author:bends Hit: title=the bends, author=radiohead exp: 0.24439742 = (MATCH) product of: 0.48879483 = (MATCH) sum of: 0.48879483 = (MATCH) weight(title:bends in 1), product of: 0.5564505 = queryWeight(t

Index forward slashes

2011-10-21 Thread Michael Szediwy
Hi, what can I do if I want to have "/" (slashes) as tokens to search? Thanks & Regards Michael - To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org For additional commands, e-mail: java-user-h...@lucene.apache.org

Re: Lucene java doc help

2011-10-21 Thread Ian Lea
The "Expert" note means that ordinary non-expert users like me should not be using it directly. It will likely be called behind the scenes by some other method not flagged as expert, and it is those that we should be using. Only developers and clever people doing clever extensions and the like wo