Re: Generating Query

2009-08-12 Thread AHMET ARSLAN
> thanks for the suggestion, but unfortunately it does not > work. What are you trying to do? Both Adriano's and my query satisfies what you were asking for. What didn't work? - To unsubscribe, e-mail: java-user-unsubsc

Lucene Vs Sphinx benchmarking for large dataset

2009-08-12 Thread Anshum
I just happened to benchmark a little modified version of lucene with a little modified version of sphinx :) Have posted my results here http://ai-cafe.blogspot.com Would also be updating more @ the blog in a while! -- Anshum Gupta Naukri Labs! http://ai-cafe.blogspot.com The facts expressed here

Re: term query boost problem

2009-08-12 Thread Christian Bongiorno
turns out the index is being built with lower-case terms which is why we aren't getting hits the way we expect. When I change my search terms to lower I see more of what I expect. Gonna keep working on this and post updates. On Wed, Aug 12, 2009 at 12:46 PM, Christian Bongiorno < christ...@bongio

Re: Generating Query

2009-08-12 Thread bourne71
thanks for the suggestion, but unfortunately it does not work >< Ahmet Arslan wrote: > >> I am trying to build a query that looks like the >> following: >> url:(+news +politics)^1.5 content:(+news +politics)^2.0 >> >> But I can't seems to find any reference to it. I try >> hardcoding it like t

Complexity of PhraseQuery slop?

2009-08-12 Thread Jason Rutherglen
In trying to calculate the cost of various slop settings for phrase queries, what's the time complexity? O(n) or O(n^2)? - To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org For additional commands, e-mail: java-user

Re: term query boost problem

2009-08-12 Thread Grant Ingersoll
You have a bunch of log statements in there, what are they printing out? Also, IndexSearcher.explain() is your friend for understanding why a doc matched the way it did. On Aug 12, 2009, at 3:46 PM, Christian Bongiorno wrote: I have a situation where I have a series of terms queries as par

Re: Contribute to Lucene

2009-08-12 Thread Chris Hostetter
: that you use. Also, we are nearing 2.9 release, so it would : be great to have as many eyeballs as possible on that. There is a lot of new : goodness coming in 2.9, so it will be particularly important to get good : feedback, as this will help set the stage ...even for people that might be re

term query boost problem

2009-08-12 Thread Christian Bongiorno
I have a situation where I have a series of terms queries as part of a BooleanQuery. example: term: 'sole type' -> leather BooleanClause.SHOULD_OCCURR term: 'title' -> 'Men's Golf shoes' BooleanClause.SHOULD_OCCURR ... But, some terms are incredibly powerful indicators of match term: 'band t

Re: How to tune Analyzer for Text Extraction

2009-08-12 Thread xs2Abhishek
Hi, Well you completely understood my problem:wistle:, the point you mentioned about how much to extract after the word Location is something i'll have to figure out. So lets say that the input to my system would be:- " Location : Montvale, NJ Duration : 7 months " Now the problem is when the in

Re: How to tune Analyzer for Text Extraction

2009-08-12 Thread xs2Abhishek
Hi, Thanks for your replies, it really helped me a lot. Thanks&Regards, Abhishek -- View this message in context: http://www.nabble.com/How-to-tune-Analyzer-for-Text-Extraction-tp24926082p24938899.html Sent from the Lucene - Java Users mailing list archive at Nabble.com. ---

Fwd: Sign up for ApacheCon US by 14 August and save up to $500!

2009-08-12 Thread Grant Ingersoll
Forwarding the ApacheCon announcement. Also note we have a lot of Lucene ecosystem talks and a meetup scheduled, as well as training on both Lucene and Solr, so I hope you will join us. Cheers, Grant Begin forwarded message: From: Sally Khudairi Date: August 7, 2009 9:55:10 PM EDT To: an

Re: How to tune Analyzer for Text Extraction

2009-08-12 Thread Julien Nioche
Hi, you should also have a look at GATE (http://gate.ac.uk) which comes with a NER application called ANNIE. You could use it to analyse your docs before indexing them with Lucene or SOLR. As Grant mentioned, UIMA can also be used for that as there are a number of NER annotators available for it

Re: How to tune Analyzer for Text Extraction

2009-08-12 Thread Grant Ingersoll
On Aug 11, 2009, at 5:27 PM, xs2Abhishek wrote: Hi, I am trying to make a decision on weather or not I can use Lucene for my requirements, which mainly include data tagging. I have to be able to parse or index a .txt file and then be able to extract text accordingly. For e.g if the inpu

Re: Contribute to Lucene

2009-08-12 Thread Grant Ingersoll
On Aug 12, 2009, at 5:51 AM, Amin Mohammed-Coleman wrote: Hi I would like to contribute/help in the development of Lucene and I'm not sure where to start. I understand Lucene is a mature project with some really great contributors and I was wondering whether you can advise me on where

Fwd: Contribute to Lucene

2009-08-12 Thread Amin Mohammed-Coleman
Hi I would like to contribute/help in the development of Lucene and I'm not sure where to start. I understand Lucene is a mature project with some really great contributors and I was wondering whether you can advise me on where to start. I would be happy to do any documentation, fix bugs or anyt

Re: Generating Query

2009-08-12 Thread AHMET ARSLAN
> I am trying to build a query that looks like the > following: > url:(+news +politics)^1.5 content:(+news +politics)^2.0 > > But I can't seems to find any reference to it. I try > hardcoding it like the > following: > BooleanQuery query = new BooleanQuery(); > query.add(new TermQuery(new Term(fie

Re: Generating Query

2009-08-12 Thread Adriano Crestani
Try this: BooleanQuery b1 = new BooleanQuery(); b1.add(new TermQuery(new Term("url", "news")), BooleanClause.Occur.MUST); b1.add(new TermQuery(new Term("url", "politics")), BooleanClause.Occur.MUST); b1.setBoost(1.5f); BooleanQuery b2 = new BooleanQuery(); b2.add(n

Generating Query

2009-08-12 Thread bourne71
Hi, I am trying to build a query that looks like the following: url:(+news +politics)^1.5 content:(+news +politics)^2.0 But I can't seems to find any reference to it. I try hardcoding it like the following: BooleanQuery query = new BooleanQuery(); query.add(new TermQuery(new Term(field, "+news +

Re: Query Boosting

2009-08-12 Thread AHMET ARSLAN
> thanks, I understand how boosting works, what I need will > be a boost in the query that will increase the score of a page if all > keywords/query is found in the page to increase its ranking. You can find answer of your question in the last two messages at this thread: http://www.nabble.com/G