Re: Classifier4J and Lucene

2005-10-23 Thread msftblows
interesting information you have here...I will look into this and let you know what I come up with. Thanks! -Original Message- From: Chris Hostetter <[EMAIL PROTECTED]> To: java-user@lucene.apache.org Sent: Sun, 23 Oct 2005 10:14:13 -0700 (PDT) Subject: Re: Classifier4J and

Re: Classifier4J and Lucene

2005-10-23 Thread Chris Hostetter
: Not sure if this makses sense...but curious if anyone has ideas, or has : done something like this. I have a few ideas, none of which are mutuallly exclusive... 1) look at the Explain output for the various queries you are generating to help you understand why your boosts aren't having as much

Re: Classifier4J and Lucene

2005-10-23 Thread Jeff Rodenburg
Sounds like you might have to consider both, if the first one doesn't solve your issue. A company field sounds like it's a single entry, i.e. one that can't be "spammed up" with multiple terms, i.e. "Oralce Oracle Oracle". It also sounds as if you're searching multiple fields, and that some fields

Classifier4J and Lucene

2005-10-23 Thread msftblows
Hey- I have an indexer at my company that I wrote while back that indexes database content (users and their profile)...one of the next req. of the project is to avoid 'spam' in hits. For example if I do a search for oracle, and oracle is in 25 places in someones bio field...and another person