Hello Xia,
XD> what's the difference from dotLucene?
Why dotLucene? dotLucene is the .Net port of Lucene, so your question
is pretty much the same as 'what's the difference from Lucene?'
dotLucene as Lucene itself is not a search application, it's a
library, so that's the difference :).
Some of
add a field to store the time you add the index
2006/3/17, hu andy <[EMAIL PROTECTED]>:
>
> Because I will delete the indexed document periodically, So the index
> files
> must be deleted after that. If I just want to delete some documents added
> before some past day from the index, How should i
what's the difference from dotLucene?
2006/3/18, Artem Vasiliev <[EMAIL PROTECTED]>:
>
> Hi guys!
>
> I'd like to thank the developers and contributors of Lucene project
> for the fantastic library. And thanks Otis and Erik for a great book!
>
> I'm writing an open source file searcher application
17 mar 2006 kl. 21.01 skrev karl wettin:
17 mar 2006 kl. 16.36 skrev Java Programmer:
My problem concerns result grouping, the best example will be
Google search
where you have results sorted by relevance, and also grouped by
domain (they
have little indent/margin). In my project I want
Hi!
This is my first post to Lucene ML, so please excuse the following message
if completely wrong :-). We are trying to upgrade Jackrabbit to support
Lucene 1.9.1. At a first problem we needed to change the access modified of
a method from protected to public, but this was not a problem. The seco
Hi guys!
I'd like to thank the developers and contributors of Lucene project
for the fantastic library. And thanks Otis and Erik for a great book!
I'm writing an open source file searcher application 'sharehound'
(http://sharehound.sourceforge.net/) based on Lucene. It can now
search SMB file sha
Guys, thanks for your help yesterday, I solved my problem! I was
actually using an IndexSearcher in another thread that I had forgotten
all about. Whoever suggested that IndexReader was to blame was right on
the money. I now make sure I close my Readers and, bingo, the open
files are managed ni
Hello,
Suppose that I have indexed a document with Lucene. How can I extract the most
important terms in this documents by using a programming language? Does Lucene
provide an API to do that? (For example, if this document is indexed by TFIDF,
the most important temrs are the terms with bigges
Michael Wechner wrote:
Maybe it would make sense to sort it alphabetically [ ... ]
+1 This should be sorted alphabetically be business name or last name.
That's what it says on the page, although a few entries are out of
place. Please feel free to fix this.
Doug
-
Erik Hatcher wrote:
I'm increasingly getting more and more requests for Lucene consulting
myself, and simply don't have the bandwidth to tackle most of them.
I have said "yes" a few times recently though, so don't count me out
though ;)
If you are skilled with Lucene, and interested in m
I have a need to issue a query that is typically performed on a relational
database. I may have to give up on this idea but thought I may ask if
there is a way to handle this type query. Let's say my documents all have
2 fields fieldA and fieldB. Is there a query that can return hits for all
do
I'm increasingly getting more and more requests for Lucene consulting
myself, and simply don't have the bandwidth to tackle most of them.
I have said "yes" a few times recently though, so don't count me out
though ;)
If you are skilled with Lucene, and interested in making yourself
avail
17 mar 2006 kl. 16.36 skrev Java Programmer:
My problem concerns result grouping, the best example will be
Google search
where you have results sorted by relevance, and also grouped by
domain (they
have little indent/margin). In my project I want to get similar
functionality, without very
: > 1 web server running PHP
: > 1 MySQL server (which will still be used, but not for fulltext
: > queries)
: > 1 server running Lucene
: > The Lucene index will be created out of the MySQL data.
: >
: > My question: How can I send a query from the webserver using PHP to
: > the
: > Lucene server
: Apparently there is a way of retrieving the unnormalized score from a
: Hit but I have been unable to track this down. I need to return this
: value because an external client wants to compile results from multiple
: queries itself.
Not from the Hits class itself, but the raw score is availabl
I believe hte topic you are refering to is typically refered to as
"clustering" ... you may wnat to search for that.
I've never really looked at it, but carrot2 seems to be a favorite among
those who do result clustering.
: Date: Fri, 17 Mar 2006 16:36:44 +0100
: From: Java Programmer <[EMAIL P
Good question. 'Top' reports the jvm at 99.9% CPU, but the individual CPUs
(top/1) don't seem to add up to 99.9. This server is actually 2 - 8 CPU
servers whose backplanes are cabled together, so there may be some issue
here. The network load is heavy, but doesn't seem to be the bottleneck (on
the
Hi,
Apparently there is a way of retrieving the unnormalized score from a
Hit but I have been unable to track this down. I need to return this
value because an external client wants to compile results from multiple
queries itself.
Any help much appreciated.
Cheers,
Nick.
-
Peter Keegan wrote:
I did some additional testing with Chris's patch and mine (based on Doug's
note) vs. no patch and found that all 3 produced the same throughput - about
330 qps - over a longer period.
Was CPU utilizaton 100%? If not, where do you think the bottleneck now
is? Network? Or
Hello,
I tried to search myself for soultion, but without any good result, so I
want to ask group.
My problem concerns result grouping, the best example will be Google search
where you have results sorted by relevance, and also grouped by domain (they
have little indent/margin). In my project I wan
Ottis,
I'm interested in the "remote" option.
With thanks,
AdamOn 17/03/06, Otis Gospodnetic <[EMAIL PROTECTED]> wrote:
Hello,Somebody asked me if I knew any good Lucene people who'd be interested in some work that involves a good amount of Lucene...Here is some info.The
company is in New York City
I experimented with this by using a Similiarity class that returns a
constant (1) for all values and found that had no noticable affect on query
performance.
Peter
On 12/6/05, Chris Hostetter <[EMAIL PROTECTED]> wrote:
>
>
> : I was wondering if there is a standard way to retrive documents WITHOU
I did some additional testing with Chris's patch and mine (based on Doug's
note) vs. no patch and found that all 3 produced the same throughput - about
330 qps - over a longer period. So, there seems to be a point of diminishing
returns to adding more cpus. The dual core Opterons (8 cpu) still win
Interestingly, the last two consulting jobs I've had dealt with this
very issue - having user entered terms be interpreted as partial
string to match in any indexed term. Care must be taken to avoid the
classic TooManyClauses exception or a more insidious OutOfMemory
exception.
By using
On Mar 17, 2006, at 7:54 AM, jens bertheau wrote:
I am currently working on switching from MySQL fulltext search to
Lucene.
The indexing and searching already works pretty well. I have the
following
environment:
1 web server running PHP
1 MySQL server (which will still be used, but not for f
Thank you very much - that did the trick! :)
Am 17.03.2006 um 13:51 schrieb Eric Jain:
Perhaps you could subclass the QueryParser and override the
getFieldQuery method:
protected Query getFieldQuery(String field, String term) {
return new PrefixQuery(new Term(field, term));
}
--
Hi,
I am currently working on switching from MySQL fulltext search to Lucene.
The indexing and searching already works pretty well. I have the following
environment:
1 web server running PHP
1 MySQL server (which will still be used, but not for fulltext queries)
1 server running Lucene
The Lucene
Florian Hanke wrote:
I'd like to append an * (create a WildcardQuery) to each search term in
a query, such that a query that is entered as e.g. "term1 AND term2" is
modified (effectively) to "term1* AND term2*".
Parsing the search string is not very elegant (of course). I'm thinking
that overri
Hello all,
I'd like to append an * (create a WildcardQuery) to each search term
in a query, such that a query that is entered as e.g. "term1 AND
term2" is modified (effectively) to "term1* AND term2*".
Parsing the search string is not very elegant (of course). I'm
thinking that overriding Q
Thanks for all the replies to my previous posting, I
was not subscribed to the list properly and did not
see all of the replies. Please disregard this post.
Thanks again,
Paul
--- Paul Lynch <[EMAIL PROTECTED]> wrote:
> Hi,
>
> have implemented the DistanceComparatorSource
> example
> from Luce
my client is looking for a Lucene expert to assist with a migration from
Verity to Lucene
(up to six weeks work, starting this coming Monday, 20 March). This can be
based in New York or london
The ideal candidate will have had large scale application migration
experience of migrating f
What version of Lucene are you using? Are you removing the index
completely and rebuilding it from scratch with the compound flag
enabled (by default since 1.4)? You really shouldn't have massive
numbers of files created when using the compound format, so I suspect
something is fishy wi
On Mar 17, 2006, at 6:15 AM, <[EMAIL PROTECTED]>
<[EMAIL PROTECTED]> wrote:
Thanks to everyone for the explanation. Given that RangeQuery is
clearly
unsuitable for out requirements, ConstantScoreRangeQuery looks ideal.
However, we're building our queries (at the moment) using QueryParser.
I
Thanks to everyone for the explanation. Given that RangeQuery is clearly
unsuitable for out requirements, ConstantScoreRangeQuery looks ideal.
However, we're building our queries (at the moment) using QueryParser.
Is there any way we can get QueryParser to use a ConstantScoreRangeQuery
instead of
Hej Paul,
Then, if no comparator is found in the cache, a new one is created (line
> 193) and then stored in the cache (line 202). HOWEVER, both the cache
> lookup() and store() do NOT take into account locale; if we, on the same
> index reader, try to do one search sorted by Locale.FRENCH and one
35 matches
Mail list logo