Hi John,
I heard of many users who used Lucene for this use-case, it's
definitely a valid one. Indexes are stored mostly on disk, with a tiny
part of them being held in memory to guarantee good access speed.
Lucene supports both inverted indexes and KD trees up to 8 dimensions.
Lookup, sorting an
Greetings;
I'd like to play around with Lucene to offload some of my database lookups.
Is this a valid use of Lucene in your opinion(s)?
Indexes - they are stored on the file system as some kind of tree (I'm
guessing)?
Lookups and sorting - Can I lookup by date and sort asc/desc and paginate?
You can provide your own Similarity implementation, overriding
whichever of the methods you need in order to achieve your aims. Use
it via the setxxx methods mentioned in the javadocs and unless you
deliberately sort by some other field everything should fall into
place.
--
Ian.
2011/11/9 强继朋
lucene,
I hava a problem i don't know how to do, it's about Score Formula of
lucene. In the package of lucene, it provide a method in Class Similarity. My
question : if i want to only use some factors of Formula, such as TF and IDF.
And then i add some additional factors, in aim to
Thanks Steve. Helpful this slide. Greetings.
- Mensaje original -
De: "Steven A Rowe"
Para: java-user@lucene.apache.org
Enviados: Domingo, 10 de Julio 2011 21:45:48 (GMT-0500) Auto-Detected
Asunto: RE: Some question about Lucene
This slide show is a few years old, but
[mailto:yhdelg...@uci.cu]
Sent: Sunday, July 10, 2011 9:30 PM
To: java-user@lucene.apache.org
Subject: Some question about Lucene
Hello
I'm a new Lucene user. I have the following question: is posible to build a
crawler/spider with Lucene library or Lucene is only for index/search phases.
Hello
I'm a new Lucene user. I have the following question: is posible to build a
crawler/spider with Lucene library or Lucene is only for index/search phases. I
am studying three project: Nutch, Lucene and Solr but I don't see what is the
main difference between them.
Greetings .
--
Simon, no problem. I am looking at it now. I will just post my
approach and let people tear it apart / get things moving :)
On Fri, Jul 31, 2009 at 2:45 PM, Simon
Willnauer wrote:
> @Michael: add yourself as a Watcher for the issue.
> @Robert: I can start working on this within the next weeks - ca
@Michael: add yourself as a Watcher for the issue.
@Robert: I can start working on this within the next weeks - can you help too?
simon
On Fri, Jul 31, 2009 at 7:49 PM, Robert Muir wrote:
> Michael, makes sense. most of the issues probably have some
> workaround, so reply back if you need.
>
> Th
Michael, makes sense. most of the issues probably have some
workaround, so reply back if you need.
Thanks for your feedback though, it is helpful to know that its important!
On Fri, Jul 31, 2009 at 1:36 PM, Michael Thomsen wrote:
> Not really. At this point, I just needed to know where the UCS4
>
Not really. At this point, I just needed to know where the UCS4
support stands. I'm reasonably familiar with the various analyzers and
what they can do. It's just the state of UCS4 support that might be an
issue for us.
Thanks,
Mike
On Fri, Jul 31, 2009 at 12:25 PM, Robert Muir wrote:
> Michael
Michael just out of curiousity, did you have a particular Analyzer in
mind you were planning on using, or rather certain features in Lucene
you were concerned would work with these codepoints?
On Fri, Jul 31, 2009 at 12:19 PM, Simon
Willnauer wrote:
> Hey Robert, good to see that you found the lin
Hey Robert, good to see that you found the link :)
On Fri, Jul 31, 2009 at 6:06 PM, Robert Muir wrote:
> Michael, as Simon mentioned I created an issue describing where you
> might run into trouble, at least in lucene core.
>
> The low-level lucene stuff, it treats these just fine (as surrogate pa
Michael, as Simon mentioned I created an issue describing where you
might run into trouble, at least in lucene core.
The low-level lucene stuff, it treats these just fine (as surrogate pairs).
But most analyzers run into some trouble. (things like
WhitespaceAnalyzer are ok)
Also wildcard queries
Thanks for your quick response!
Mike
On Fri, Jul 31, 2009 at 10:25 AM, Simon
Willnauer wrote:
> If I understand you correctly you are asking if lucene can deal with
> encodings that use more than 16 bit. Well yes and no but mainly no.
> The support for unicode 4.0 was introduced in Java 1.5 and l
If I understand you correctly you are asking if lucene can deal with
encodings that use more than 16 bit. Well yes and no but mainly no.
The support for unicode 4.0 was introduced in Java 1.5 and lucene core
has still back-compat requirements for java 1.4. Lucene's analyzers
make use of char[] all
Is Lucene capable of handling UCS4 data natively?
Thanks,
Mike
-
To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-user-h...@lucene.apache.org
Thanks Michael for your answer :)
Actually writer.addIndexesNoOptimize method can not help us because our
aim is to split indexes rather than to merge them. But you information
about setting autoCommit=true is very helpful for us because so we will
avoid sharing of stored fields and will be ab
Ivan Vasilev wrote:
Hi Lucene Guys,
As I see in the Lucene web site in file formats page the version
2.3 will have some changes in file formats that are very important
for us. First I will say what we do and then will ask my questions.
We distribute the index on some machines. The impleme
Hi Lucene Guys,
As I see in the Lucene web site in file formats page the version 2.3
will have some changes in file formats that are very important for us.
First I will say what we do and then will ask my questions.
We distribute the index on some machines. The implementation is made so
that
Can you provide a self contained test or at least some code for this?
On Jul 19, 2007, at 5:32 PM, Mark Miller wrote:
Hopefully someone will be able to give you some further insight
into this. To me, it looks like a corrupted index. If TermVectors
where not stored, at worst you should be see
Hopefully someone will be able to give you some further insight into
this. To me, it looks like a corrupted index. If TermVectors where not
stored, at worst you should be seeing a NullPointerException. Has this
index had anything interesting happen to it? Made with an older version
of Lucene, u
Hi all, I use query (+body:12) (+title:12) , but I got some wrong
message bellow:
java.io.IOException: read past EOF
at
org.apache.lucene.store.BufferedIndexInput.refill(BufferedIndexInput.java:137)
at
org.apache.lucene.store.BufferedIndexInput.readByte(BufferedIndexInput.java:38)
Thanks. Do you know about any existing application that is
built on top of lucene that provides this functionality?
Tanya
-Original Message-
From: Erick Erickson [mailto:[EMAIL PROTECTED]
Sent: Friday, June 01, 2007 7:18 AM
To: java-user@lucene.apache.org
Subject: Re: question about luce
e
From: Will Johnson <[EMAIL PROTECTED]>
To: java-user@lucene.apache.org
Sent: Friday, 1 June, 2007 2:02:17 PM
Subject: RE: question about lucene
Solr, which is built on top of lucene and adds highlighting among other
features, gets close to what you want. Check out:
http://wiki.apache.
: java-user@lucene.apache.org
Subject: Re: question about lucene
Nope. But here's what I think you can do (although I haven't
tried this exactly, so caveat emptor).
Document doc = new Document();
doc.add("text", line1);
doc.add("text", line2);
doc.add("text&qu
07, Tanya Levshina <[EMAIL PROTECTED]> wrote:
Wow, it was fast! Thanks. Do you know about any existing application that
is
built on top of lucene that provides this functionality?
Tanya
-Original Message-
From: Erick Erickson [mailto:[EMAIL PROTECTED]
Sent: Friday, June 01, 2007 7
: question about lucene
No. Lucene is an *engine*, not an app that has a lot of stuff built on top
of it out of the box.
You have to index enough information to figure this out somehow.
Best
Erick
On 6/1/07, Tanya Levshina <[EMAIL PROTECTED]> wrote:
>
> Hi,
>
>
>
> I
No. Lucene is an *engine*, not an app that has a lot of stuff built on top
of it out of the box.
You have to index enough information to figure this out somehow.
Best
Erick
On 6/1/07, Tanya Levshina <[EMAIL PROTECTED]> wrote:
Hi,
I've just downloaded Lucene, tried demo and looked at the do
Hi,
I've just downloaded Lucene, tried demo and looked at the documentation. The
Indexing and Searching work great and fast but I also need to display all
the actual "hits": the lines from the files that match a particular query.
Does Lucene provide means to do it?
Thanks a lot,
Tanya
27;t support DB2 database?
Regards,
lilyyan
>From: "Chris Lu" <[EMAIL PROTECTED]>
>Reply-To: java-user@lucene.apache.org
>To: java-user@lucene.apache.org
>Subject: Re: ask for a question about Lucene
>Date: Thu, 5 Oct 2006 22:02:06 -0700
>
>You can use D
e: ask for a question about Lucene
Date: Thu, 5 Oct 2006 20:00:07 -0400
My gut feel is that, with 150 records, using Lucene is overkill. This
assumes that your database already exists. You'd have to extract the data
from the DB, store it in a lucene index, then worry about keeping them
synchron
ser@lucene.apache.org
To: java-user@lucene.apache.org
Subject: Re: ask for a question about Lucene
Date: Thu, 5 Oct 2006 22:02:06 -0700
You can use DBSight. It's free for your data. And you just need to
follow this example you will know how to use it, no java coding
needed. And you can schedul
You can use DBSight. It's free for your data. And you just need to
follow this example you will know how to use it, no java coding
needed. And you can schedule jobs to synchronize with the database.
http://wiki.dbsight.com/index.php?title=Step_by_step
The website is:
http://www.dbsight.net
Chri
My gut feel is that, with 150 records, using Lucene is overkill. This
assumes that your database already exists. You'd have to extract the data
from the DB, store it in a lucene index, then worry about keeping them
synchronized.
I'd suggest, though, that the fastest way to satisfy yourself about
Hello ALL,
i'm new to Lucene and wandering where i can start from Lucene? : )
basically my application is: when user input some keywords (can be more than
one words) within an academic research site, the output will be the
researchers' academic interests.
there are will be a DB2 database tha
36 matches
Mail list logo