Hello,
I just got this working in three or four steps:
1. goto http://www.apache.org/dyn/closer.cgi/lucene/java/
2. click on any of the mirrors and download "lucene-2.0.0.zip"
3. unzip into preferred directory (step not shown), then use jar to
look at snowball items:
jar tvf /opt/lucene-2.0.
I want to again thank Grant for recently taking a dedicated lead
effort on the documentation revamping. I haven't yet built a recent
archive from trunk to see what it's out of the box experience is
like, but I'm guessing that at least the issue mentioned below has
been alleviated (?).
Mark's message was very nice to see about LIA, but I want to reply
and second Eric's comments about the Lucene distributable. I just
downloaded and unpacked the 2.0 .zip to test out a newbie to Lucene,
but Java savvy (our target audience), experience.
I opened docs/index.html and the first
Hi All,
Any pointers in this direction?
Thanks in advance.
Kapil
Kapil Chhabra wrote:
Just to mention, I have tokenized FIELD2 on "," and indexed it.
FIELD2:3 should return 1,2
FIELD2:(FIELD2:3) should return something like the output of:
*FIELD2: 1 OR FIELD2: 2
*
Regards,
kapilChhabra*
*
Hi Eric,
Take a look at http://lucene.apache.org/java/docs/releases.html
Clicking through to the appropriate mirror, download/unzip
lucene-2.0.0.zip. In it you'll find the jars, contrib, README, demo,
etc. It really should contain everything you need outside of the
actual Lucene source
Did you look in contrib/snowball? It should have been unpacked along with
everything else when you unpacked lucene 2.0, it's named something like
lucene-showball-2.0.0.jar.
That said, I've also had to poke around to find things, and I still don't
understand large portions of what Lucene does for
You sound like a man that wants a manual. Lucene In Action is the way to
go. It targets 1.4 instead of 2.0, so I'm sure you will still have
complaints, but if you want to get up to speed on Lucene fast, LIA is
the way to go. LIA will give you the overview and important classes, and
then the doc
>>> Grant Ingersoll 12/26/2006 2:43 PM >>>
>I'm sorry you are not finding what you need. The snowball analyzers
>come in a separate jar, in the release zip, under the contrib/
>snowball directory. You may also want/need the analyzers in contrib/
>analyzers for other languages. The README de
>>> Grant Ingersoll 12/26/2006 2:43 PM >>>
Hi Eric,
>I'm sorry you are not finding what you need. The snowball analyzers
>come in a separate jar, in the release zip, under the contrib/
>snowball directory. You may also want/need the analyzers in contrib/
>analyzers for other languages. The
Hi Eric,
I'm sorry you are not finding what you need. The snowball analyzers
come in a separate jar, in the release zip, under the contrib/
snowball directory. You may also want/need the analyzers in contrib/
analyzers for other languages. The README delivered w/ the release
probably sh
I'm been having a hard time finding any kind of reasonable
documentation
on lucene. It seems that the javadocs are mostly empty, and the stuff
on
the wiki really doesn't explain anything. Is there a better place to
be looking?
So far, I've managed to get some basic stuff working. Now I'm try
Suman Ghosh wrote:
Andrzej,
I have been trying to solve a similar problem where I need to boost
score based on the document type. Your approach is very interesting
and I want to give it a try.
I have a implementation specific question. When you mention to put as
many "1" as the boost need to be
Hi Alex,
This seems like a pretty small setup, so I think we may need some
more info. I don't know a lot about EJBs, but what I wonder is if
your bean is opening an Index Searcher for every connection. Perhaps
you could describe your architecture a little bit more and that will
help us
First, it probably would have been a good thing to start a new thread on
this topic, since it's only vaguely related to disk space ...
That said, sure. Note that there's no requirement in lucene that all
documents in an index have the same fields. Also, there's no reason you
can't use two separat
Hi,
I have another related problem. I am adding news articles for a company
to the lucene index. As of now if the articles are mapped to more than
one company, they are added so many times in the index. As the no. of
companies mapped to each article increases, this will not be a scalable
impl
Harini Raghavan wrote:
Yes I think I got hit IOException. I assumed that the.tmp files are not
required and deleted them manually from the indes directory as they were
more than 10G. Is that ok?
Yes, they are indeed not necessary so deleting them is fine. This
(deleting partially created file
Yes I think I got hit IOException. I assumed that the.tmp files are not
required and deleted them manually from the indes directory as they were
more than 10G. Is that ok?
Michael McCandless wrote:
Harini Raghavan wrote:
Thank you for the response. I don't have readers open on the index,
bu
Just to mention, I have tokenized FIELD2 on "," and indexed it.
FIELD2:3 should return 1,2
FIELD2:(FIELD2:3) should return something like the output of:
*FIELD2: 1 OR FIELD2: 2
*
Regards,
kapilChhabra*
*
Kapil Chhabra wrote:
Hi,
Please see the following data-structure
++--+
|
Hi,
Please see the following data-structure
++--+
| FIELD1 | FIELD2 |
++--+
| 1 | 2,3,4,6, |
| 2 | 3,1,5,7, |
| 3 | 1,2, |
| 4 | 1,8,10, |
| 5 | 2,9, |
| 6 | 1, |
| 7 | 2,9, |
| 8 | 4,9, |
| 9 |
19 matches
Mail list logo