Solr already support de-duplication when adding new documents. You can
refer to the doc at
https://cwiki.apache.org/confluence/display/solr/De-Duplication
On Tue, Sep 20, 2016 at 12:18 PM Vjeran Marcinko <
vjeran.marci...@email.t-com.hr> wrote:
> Hello,
>
> I'm pretty much Lucene newb, so wonderi
Thanks Mike...
Regards
--
View this message in context:
http://lucene.472066.n3.nabble.com/Optimising-segments-merges-tp4296997p4297021.html
Sent from the Lucene - Java Users mailing list archive at Nabble.com.
-
To unsubscri
Yes, you can ... see the TieredMergePolicy setters.
But again I would strongly recommend using Lucene's defaults here.
Mike McCandless
http://blog.mikemccandless.com
On Tue, Sep 20, 2016 at 5:44 PM, lukes wrote:
> Thanks a lot Mike. Can we control of how often natural merge should happen,
> or
Thanks a lot Mike. Can we control of how often natural merge should happen,
or what are the factors that define when to kick off "natural merging" ?
Regards.
--
View this message in context:
http://lucene.472066.n3.nabble.com/Optimising-segments-merges-tp4296997p4297017.html
Sent from the Luce
Lucene takes care of merging ("natural merging") as you add documents,
commit, etc.
If your index is still going to be changing it's best to never
forceMerge and let natural merging run at its defaults.
And, yes, deleted documents are reclaimed by merging.
Mike McCandless
http://blog.mikemccand
20 September 2016, Apache Lucene™ 6.2.1 available
The Lucene PMC is pleased to announce the release of Apache Lucene 6.2.1
Apache Lucene is a high-performance, full-featured text search engine library
written entirely in Java. It is a technology suitable for nearly any
application that requires f
Hi,
In my application i am committing(indexWriter.commit() on every
single/batch of documents, but now as a result there are lots of segments
getting generated(One thing can be, i don't do commit, but just add document
to indexWriter. But now, if system crashes then uncommited documents
wouldn't
A wild shot in the dark: Are the square brackets really part of the
field name? They have never officially been supported, from the Ref
Guide:
"Field names should consist of alphanumeric or underscore characters
only and not start with a digit. This is not currently strictly
enforced, but other fi
Hi all,
I have an application that works fine with 6.0.1, but if I go to 6.1.0 or 6.2.0
then I occasionally get a corrupted index where the SegmentMerger keeps
breaking on a numeric field.
This is the exception I get:
... (stack of application code) ...
Caused by: java.lang.IllegalArgumentExce
Hi there,
I'm migrating an application from Lucene 4.7.0 to Lucene 6.0.1.
I'm facing a problem with this piece of code :
public List getDistinctValues(IndexReader reader, EventField field)
throws IOException {
List values = new ArrayList();
Fields fields = MultiFields.getFields(reader);
Hmm I don't think Lucene's facets make that easy today, but it would
maybe be minor code change: in IntTaxonomyFacets.getTopChildren where
it checks if values[ord] > 0 you just need > yourMinCount instead.
Maybe open an issue?
Mike McCandless
http://blog.mikemccandless.com
On Mon, Sep 19, 2016 a
11 matches
Mail list logo