Hello,
Im using Lucene 3.6.2. and my file, which i indexed ,is something like
this :
FIELD-1 FIELD-2 FIELD-3 FIELD-4
DOC1: A ABC
DOC2: B ABC
DOC3: C ABC
DOC4: D ABC
DOC5: E WWW ABC
DOC6: F DEF
DOC7: G DEF
DOC8: H D
Lucene is a library, it depends on your implementation how to keep index files
open. Yes, Lucene supports mmap.
If you want to run Lucene as a separate server, use Apache Solr or
ElasticSeach, which runs the indexing software as a separate process,
accessible via REST API.
Uwe
-
Uwe Schin
In Unicorn (Facebook's search backend), we used mmap'd indices. We could
load them on a separate process - which meant that we could make scoring
changes and test rapidly since we did not have to reload the index for
every run. Is this true for Lucene also? I'm assuming it would be if the
entire
You may also be interested in this talk @ BerlinBuzzwords2013:
http://intrafind.de/tl_files/documents/INTRAFIND_BerlinBuzzwords2013_The-Typed-Index.pdf
Unfortunately the slides are not available.
Uwe
-
Uwe Schindler
H.-H.-Meier-Allee 63, D-28213 Bremen
http://www.thetaphi.de
eMail: u...@the
Hi Michael,
Instead of putting the annotation in Payloads, why not put them in as
"synonyms", ie at the same spot as the original string (see SynonymFilter in
the LIA book). So your string would look like (to the index):
W. A. Mozart was born in Salzburg
artist city
so you ca
Eventualy, I have choosen yet another solution. I work with those
"payloads" as with synonyms. In my TokenFilter with every occurence of
token with "payload", I inject new term - containing this "payload" with
zeroed PossitionIncrementAttribute. It solves nearly all my issues =)
Thanks everyone fo
On Thu, Jun 20, 2013 at 7:09 PM, Tom Burton-West wrote:
> Sine my objective is to undersdand BlockGroupingCollector, I thought I
> would extract a test for BlockGroupingCollector, maybe in another file as
> Robert suggested.
I think that makes sense!
> I was looking at testRandom because that is