> thanks for the suggestion, but unfortunately it does not
> work.
What are you trying to do? Both Adriano's and my query satisfies what you were
asking for. What didn't work?
-
To unsubscribe, e-mail: java-user-unsubsc
I just happened to benchmark a little modified version of lucene with a
little modified version of sphinx :)
Have posted my results here
http://ai-cafe.blogspot.com
Would also be updating more @ the blog in a while!
--
Anshum Gupta
Naukri Labs!
http://ai-cafe.blogspot.com
The facts expressed here
turns out the index is being built with lower-case terms which is why we
aren't getting hits the way we expect. When I change my search terms to
lower I see more of what I expect.
Gonna keep working on this and post updates.
On Wed, Aug 12, 2009 at 12:46 PM, Christian Bongiorno <
christ...@bongio
thanks for the suggestion, but unfortunately it does not work ><
Ahmet Arslan wrote:
>
>> I am trying to build a query that looks like the
>> following:
>> url:(+news +politics)^1.5 content:(+news +politics)^2.0
>>
>> But I can't seems to find any reference to it. I try
>> hardcoding it like t
In trying to calculate the cost of various slop settings for phrase
queries, what's the time complexity? O(n) or O(n^2)?
-
To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-user
You have a bunch of log statements in there, what are they printing out?
Also, IndexSearcher.explain() is your friend for understanding why a
doc matched the way it did.
On Aug 12, 2009, at 3:46 PM, Christian Bongiorno wrote:
I have a situation where I have a series of terms queries as par
: that you use. Also, we are nearing 2.9 release, so it would
: be great to have as many eyeballs as possible on that. There is a lot of new
: goodness coming in 2.9, so it will be particularly important to get good
: feedback, as this will help set the stage
...even for people that might be re
I have a situation where I have a series of terms queries as part of a
BooleanQuery.
example:
term: 'sole type' -> leather
BooleanClause.SHOULD_OCCURR
term: 'title' -> 'Men's Golf shoes'
BooleanClause.SHOULD_OCCURR
...
But, some terms are incredibly powerful indicators of match
term: 'band t
Hi,
Well you completely understood my problem:wistle:, the point you mentioned
about how much to extract after the word Location is something i'll have to
figure out. So lets say that the input to my system would be:-
"
Location : Montvale, NJ
Duration : 7 months
"
Now the problem is when the in
Hi,
Thanks for your replies, it really helped me a lot.
Thanks&Regards,
Abhishek
--
View this message in context:
http://www.nabble.com/How-to-tune-Analyzer-for-Text-Extraction-tp24926082p24938899.html
Sent from the Lucene - Java Users mailing list archive at Nabble.com.
---
Forwarding the ApacheCon announcement. Also note we have a lot of
Lucene ecosystem talks and a meetup scheduled, as well as training on
both Lucene and Solr, so I hope you will join us.
Cheers,
Grant
Begin forwarded message:
From: Sally Khudairi
Date: August 7, 2009 9:55:10 PM EDT
To: an
Hi,
you should also have a look at GATE (http://gate.ac.uk) which comes with a
NER application called ANNIE. You could use it to analyse your docs before
indexing them with Lucene or SOLR.
As Grant mentioned, UIMA can also be used for that as there are a number of
NER annotators available for it
On Aug 11, 2009, at 5:27 PM, xs2Abhishek wrote:
Hi,
I am trying to make a decision on weather or not I can use Lucene
for my
requirements, which mainly include data tagging. I have to be able
to parse
or index a .txt file and then be able to extract text accordingly.
For e.g
if the inpu
On Aug 12, 2009, at 5:51 AM, Amin Mohammed-Coleman wrote:
Hi
I would like to contribute/help in the development of Lucene and I'm
not
sure where to start. I understand Lucene is a mature project with
some
really great contributors and I was wondering whether you can advise
me on
where
Hi
I would like to contribute/help in the development of Lucene and I'm not
sure where to start. I understand Lucene is a mature project with some
really great contributors and I was wondering whether you can advise me on
where to start. I would be happy to do any documentation, fix bugs or
anyt
> I am trying to build a query that looks like the
> following:
> url:(+news +politics)^1.5 content:(+news +politics)^2.0
>
> But I can't seems to find any reference to it. I try
> hardcoding it like the
> following:
> BooleanQuery query = new BooleanQuery();
> query.add(new TermQuery(new Term(fie
Try this:
BooleanQuery b1 = new BooleanQuery();
b1.add(new TermQuery(new Term("url", "news")),
BooleanClause.Occur.MUST);
b1.add(new TermQuery(new Term("url", "politics")),
BooleanClause.Occur.MUST);
b1.setBoost(1.5f);
BooleanQuery b2 = new BooleanQuery();
b2.add(n
Hi,
I am trying to build a query that looks like the following:
url:(+news +politics)^1.5 content:(+news +politics)^2.0
But I can't seems to find any reference to it. I try hardcoding it like the
following:
BooleanQuery query = new BooleanQuery();
query.add(new TermQuery(new Term(field, "+news +
> thanks, I understand how boosting works, what I need will
> be a boost in the query that will increase the score of a page if all
> keywords/query is found in the page to increase its ranking.
You can find answer of your question in the last two messages at this thread:
http://www.nabble.com/G
19 matches
Mail list logo