Hi
I have a field that is indexed as follows
for(String c: article.getCategories()){
doc.add(new Field("categories", c.toLowerCase(),
Field.Store.YES, Field.Index.ANALYZED));
}
I have a search space of 2 million docs and I need to access the category field
of each hitdoc. I woul
Hi all,
I usually use Nutch for this but, just for fun, I tried to create a language
identifier based on Lucene only.
I had a really small set of "training data": 10 files (roughly 2M each) for
10 languages. I indexed those files using an NGram analyzer.
I have to say that I was not expecting mu
Think about using some different analysis chains. For instance,
WhitespaceTokenizer,
perhaps followed by LowercaseFilter. You can compose pretty arbitrary
analysis chains
to suit your needs.
Best
Erick
On Fri, Oct 21, 2011 at 10:59 AM, Michael Szediwy
wrote:
> Hi,
>
> what can I do if I want to
Hi upgraded from 3.1 to 3.4, now it is compliaing about deprecated method
indexWriter.setMergeFactor();
Saying it can only be used with the default LogMergePolicy ,but I never
set the merge policy so shouldn't I be using the default anyway ?
Paul
---
On Thu, Oct 20, 2011 at 9:06 PM, Mead Lai wrote:
> Thank you, Mike.
> Are you sure the 'Solr' has implemented 'Join' function.
> I just skims through some tour guids about Solr, and not sure about that.
> Appreciate you very much.
Woops, I'm sorry: I believe Solr's join functionality was only
imp
Here's the output from a little test program on a 2 doc index.
Query: title:bends author:bends
Hit: title=the bends, author=radiohead
exp: 0.24439742 = (MATCH) product of:
0.48879483 = (MATCH) sum of:
0.48879483 = (MATCH) weight(title:bends in 1), product of:
0.5564505 = queryWeight(t
Hi,
what can I do if I want to have "/" (slashes) as tokens to search?
Thanks & Regards
Michael
-
To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-user-h...@lucene.apache.org
The "Expert" note means that ordinary non-expert users like me should
not be using it directly. It will likely be called behind the scenes
by some other method not flagged as expert, and it is those that we
should be using. Only developers and clever people doing clever
extensions and the like wo