red, why not index their ordinals, and
> then sort by that?
>
> On Mon, Jul 2, 2018, 6:16 AM Nicola Buso wrote:
> > Hi Uwe,
> >
> > as said the sorting is calculated elsewhere upfront and the terms
> > are
> > provided to Lucene in the order calculated (in an
; Achterdiek 19, D-28357 Bremen
> http://www.thetaphi.de
> eMail: u...@thetaphi.de
>
> > -Original Message-
> > From: Nicola Buso
> > Sent: Monday, June 25, 2018 5:09 PM
> > To: Uwe Schindler ; java-user@lucene.apache.org
> > Subject: Re: TermInSetQuer
eries coming first) and apply on a field without norms.
>
> TermInSetQuery is fast because it neglects scoring and is just good
> at intersecting the terms dict with the given terms set.
>
> Uwe
>
> -
> Uwe Schindler
> Achterdiek 19, D-28357 Bremen
> http://www.thet
Probably is more a sort problem than scoring the single document and
the order of the input terms is calculated at runtime, in case someone
is thinking about adding a sorting field at indexing time.
Nicola
On Mon, 2018-06-25 at 12:23 +0100, Nicola Buso wrote:
> Hi,
>
> I need t
this score?
Cheers,
Nicola
--
Nicola Buso
EMBL-EBI
-
To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-user-h...@lucene.apache.org
at runtime and from there start
adding my filters.
Did someone already did something similar?
Nicola
--
Nicola Buso
EMBL-EBI
-
To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
For additional commands, e-mail: j
Any hint on how to calculate these values without asking the whole
facet hierarchy and count them?
Is there a specific point in the code where I can check for this
distinct count, and maybe modify the code?
Nicola
On Wed, 2016-07-06 at 13:42 +0100, Nicola Buso wrote:
> Hello everyone,
>
ching and
maybe have this "duplicated count" summed up in the parent nodes.
Do you have any hints on how to achieve it?
Regards,
Nicola
--
Nicola Buso
EMBL-EBI
-
To unsubscribe, e-mail: java-user-unsubscr...@lu
; H.-H.-Meier-Allee 63, D-28213 Bremen
> http://www.thetaphi.de
> eMail: u...@thetaphi.de
>
> > -Original Message-
> > From: Nicola Buso [mailto:nb...@ebi.ac.uk]
> > Sent: Wednesday, August 12, 2015 10:57 AM
> > To: java-user@lucene.apache.org
> > Subject: Strang
a.nio framework should
behave differently?
We are on Redhat Enterprise 6: 2.6.32-504.8.1.el6.x86_64 #1 SMP
Nicola
--
Nicola Buso
Software Engineer - Web Production Team
European Bioinformatics Institute (EMBL-EBI)
European Molecular Biology Laboratory
Wellcome Trust Genome Campus
Hi
:
FieldType.indexOptions()
FieldInfo.getIndexOptions()
What about to use the same naming conventions?
Nicola
--
Nicola Buso
EMBL-EBI
-
To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
For additional commands, e
extend one of the existing classes (IntTaxoFacets). If it's
> something completely different, e.g. RangeFacetCounts, you should be
> able to just extend Facets. And if it's not a "Facets" thing at all,
> i.e. you don't need its API, just write your own interfac
ends the behavior to achieve this?
Any help will be really apreciated,
Nicola.
--
Nicola Buso
Software Engineer - Web Production Team
European Bioinformatics Institute (EMBL-EBI)
European Molecular Biology Laboratory
Wellcome Trust Genome Campus
Hinxton
Cambridge CB10 1SD
Unite
d some classes (e.g.
> > FacetSearchParams
> > or CountFacetRequest).
> >
> > Is there an updated version of that guide?
> > I tried this
> > http://lucene.apache.org/core/*4_8_1*/facet/org/apache/lucene/facet/doc-files/userguide.html
> > but it does not work :|
&
7;ve
> said so far...
>
>
> Best,
> Erick
>
>
> On Fri, May 30, 2014 at 7:49 AM, Nicola Buso wrote:
> Hi,
>
> thanks Michael and Alan. Is enough to know that re-opening the
> index
> there is no guarantee th
> > collection.
> >
> > Alan Woodward
> > www.flax.co.uk
> >
> >
> > On 30 May 2014, at 11:20, Nicola Buso wrote:
> >
> >> Hi Alan,
> >>
> >> just to make it more typical (yes there are not IndexWriters open on
> >>
ff trying to rely on this for more typical uses.
>
> Alan Woodward
> www.flax.co.uk
>
>
>
> On 30 May 2014, at 10:39, Nicola Buso wrote:
>
> > Hi Alan,
> >
> > thanks a lot for the reply.
> >
> > For what I understood from your reply i
chines, the docBase assigned to each reader context should
> be the same.
>
> Alan Woodward
> www.flax.co.uk
>
>
>
> On 29 May 2014, at 14:29, Nicola Buso wrote:
>
> > Hi,
> >
> > from the javadocs:
> >
> >
> > For efficiency,
ds in different machines (than
separated JVMs)?
nicola.
--
Nicola Buso
EMBL-EBI
-
To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-user-h...@lucene.apache.org
lo to Jo McEntyre! :-)
> Uwe
>
> -
> Uwe Schindler
> H.-H.-Meier-Allee 63, D-28213 Bremen
> http://www.thetaphi.de
> eMail: u...@thetaphi.de
>
>
> > -Original Message-
> > From: Nicola Buso [mailto:nb...@ebi.ac.uk]
> > Sent: Thurs
Hi,
I was wondering if it's correct to call the method:
IndexWriter.addIndexes(...)
from different threads concurrently.
nicola.
-
To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
For additional commands, e-mai
On Thu, 2013-10-10 at 16:02 +0530, Devi pulaparti wrote:
> toString output by queryparser.parse() for query TEST && USAGE is "
> content:TEST content:\"amp amp\" content:USAGE " .
> and for query TEST AND USAGE is "+content:TEST +content:USAGE"
> any idea why is analyzer treating && as conten
res. As the counting
> collector does not request any scores, you just add a useless additional
> wrapper around the query's scorer.
>
> Uwe
>
> -
> Uwe Schindler
> H.-H.-Meier-Allee 63, D-28213 Bremen
> http://www.thetaphi.de
> eMail: u...@thetaphi.de
>
>
&g
Hello,
I was going to use the TotalHitCountCollector in cases where I'm
interested just in the number of results.
Obviously I was hoping to gain in performances compared to a "scored"
query.
>From my tests it seam it's not so performant compare to the "scored"
search. At this point I'm wondering i
e, but I'd need to understand better what sort of ranges do u have in
> mind.
> Maybe we can discuss that on a separate thread?
>
> Shai
Nicola
>
>
> On Wed, Jul 3, 2013 at 7:34 PM, Nicola Buso wrote:
>
> > Hi Shai,
> >
> > if I'm correctly unde
ant your top-K to be computed by label too? Or first deduce
> the top-K facets, then sort them otherwise?
>
> Shai
>
>
>
> On Tue, Jul 2, 2013 at 6:36 PM, Nicola Buso wrote:
> Hi,
>
> I was looking to change the order of the facet
Hi,
I was looking to change the order of the facet results; in this case I
would like to order by the facet label instead of the facet value
(count).
An example is a facet on dates; suppose the facet is saved as
/MM/dd, I would like obtain values for this date ordered by the
date; i.e. with d
Thanks for the explanation, I'm not in this situation but it's helpful to
understand better lucene.
Nicola
Michael McCandless wrote:
>On Tue, May 14, 2013 at 10:02 AM, Nicola Buso wrote:
>
>> I now this can sound horrible/flexible/... but this mean I can add two
>
I now this can sound horrible/flexible/... but this mean I can add two
documents with the same field name, but different configurations, for
example different IndexOptions?
Nicola.
On Tue, 2013-05-14 at 12:52 +0100, Nicola Buso wrote:
> OK, thanks for the reply!
>
>
> Nicola.
> commit();
>
> And each AtomicReader will contain different fields. As
> getFieldInfos() documents "Get the {@link FieldInfos} describing all
> fields in this reader."
>
>
> Shai
>
>
>
> On Tue, May 14, 2013 at 1:35 PM, Nicola Buso wrote:
Hi,
I was looking to a way to obtain FieldInfo(s) from the IndexReader; we
need in some way to describe the index. Can I do this?
AtomicReader ar = .leaves().get(0).reader();
// than call
ar.getFieldInfos();
What I mean is, can I suppose every AtomicReader in leaves() contain the
ess wrote:
> On Thu, May 9, 2013 at 5:53 AM, Nicola Buso wrote:
> > Hi,
> >
> > I want to hide some values for a facet; than I'm looking into a way to
> > collect facet values only for a subset of possible values.
> > Is there a way to obtain this?
> > I
Hi,
I want to hide some values for a facet; than I'm looking into a way to
collect facet values only for a subset of possible values.
Is there a way to obtain this?
If not, extending TopK...Handler (or wrapping it) would be the right way
to obtain this behaviour?
Nicola.
-
Hi all,
I think somebody already did this question in the mailing list, but I
can't find it.
Is possible to execute a search with facets in luke? then use the
$facets field in searches?
Nicola.
-
To unsubscribe, e-mail: java-
ot the
> > taxonomy. But you also incur search performance loss because counting a
> > certain dimension requires traversing multiple DV fields.
> >
> > To enable partitions you need to override FacetIndexingParams partition
> > size. You can try to play with
ou need to override FacetIndexingParams partition
> size. You can try to play with it.
>
> In am intetested though to understand the general scenario. Perhaps this
> can be solved some other way...
>
> Shai
> On Apr 26, 2013 5:44 PM, "Nicola Buso" wrote:
>
Hi all,
I'm encountering a problem to index a document with a large number of
values for one facet.
Caused by: java.lang.IllegalArgumentException: DocValuesField "$facets"
is too large, must be <= 32766
at
org.apache.lucene.index.BinaryDocValuesWriter.addValue(BinaryDocValuesWriter.java:5
Hi,
which version of Lucene?
Check the OrdinalPolicy you are using in FacetIndexingParams at indexing
time.
I think you should use: NonTopLevelOrdinalPolicy in lucene 3.6.1 or
OrdinalPolicy.ALL_BUT_DIMENSION in lucene 4.2.1
Nicola.
On Thu, 2013-04-25 at 08:32 +0200, Schimke, Danny wrote:
>
Hi all,
in lucene 4.1 there was this API
FacetResultNode.getResidue()
to understand if there are more values for a facet, than the topk
returned.
Is there a similar API in 4.2? I can't find it!
Nicola.
-
To unsubscribe, e-mai
ish all the refactoring work
> before 4.2.
>
>
> What you should do is extend the new FacetsAggregator and override
> FacetsAccumulator.getAggregator(). Can you try that and let us know if
> that resolves your problem?
>
>
> Shai
>
>
>
> On Thu, Apr 11, 2013 at 1
Hi all,
in Lucene 4.1, after some advise from the mailing list I am merging
taxonomies (in memory because of the small size of taxonomies indexes)
and collecting facets values from the merged taxonomy instead of the
single ones; the scenario is:
- you have a Multireader pointing to more indexes
-
Hi,
I'm trying to use Lucene 4.2, but this merge of more taxonomy indexes
seam is no more working.
Do you have any idea why it has not to work in Lucene 4.2?
Normal faceted search on a single index is working correctly.
Nicola.
On Thu, 2013-01-24 at 16:53 +0000, Nicola Buso wrote:
>
if that's what you're looking for?
>
>
> And again, this might be something that you can do in the UI
> -- add the value that the user selected to the list of values
> that are displayed, irregardless of what the top-10 are
Hi,
I have a FacetRequest with numResults setted to 10, how can I specify
additional facets value to add to the FacetResult?
I try to explain the use-case:
- the user view 10 facet result
- the interface permit the user to choose a facet value not from the
top-10 results
- the user execute the qu
Hi Michael,
I'm looking into implementing a solution.
On Fri, 2013-01-25 at 16:23 -0500, Michael McCandless wrote:
> On Fri, Jan 25, 2013 at 3:48 PM, Nicola Buso wrote:
>
> > if you have experiences in this use case can you share solutions? What
> > is reusable from Lu
A/4 are not returned to the user, even though they belong to
> >> the result set, but did not make it to the top-K
> >> * Facets A/5, A/6 are not returned because they don't belong to the result
> >> set at all.
> >>
> >> If this makes sense to you, an
ur
> FR.setNumResults to Integer.MAX_VALUE. But note, it's going to be
> costly, i.e. you'd get a FacetResultNode per child of A, so depending
> how "wide" A is, this may have some impact on RAM consumption.
>
> If that's not what you meant, could you pleas
Hi all,
I'm introducing Lucene faceted search in our project and I need some
hints to achieve some functionalities:
- I want facet filtering in OR, how to?
- obtain facets for the filtered results but also for the non filtered
one. i.e. I have facet A with values A/V1, A/V2, A/V3 and these value
the
> > following scheme. Search replicas (machines where index resides) are build
> > FacetResult's based on their index chunk (top N categories with document
> > counts). Later on the results are merged "by hands" with summing relevant
> > categories from diff
Thanks Shai,
I'm trying your solution and it's working, I need to check some number
to test it.
As I said we are aware having big indexes, than I use facets only on
subsets, but if it will result in performances issues too than I'll for
sure take a look into facet sampling.
Nicola.
On Wed, 2013
have finished with the
> > following scheme. Search replicas (machines where index resides) are build
> > FacetResult's based on their index chunk (top N categories with document
> > counts). Later on the results are merged "by hands" with summing relevant
> > catego
--- Begin Message ---
Hi,
it's not clear your proposal.
On Mon, 2013-01-21 at 18:21 +0200, Shai Erera wrote:
> Hi
>
>
> First, if it's a one time operation, you can merge the taxonomy
> indexes into one, without merging the content indexes too (but you'll
> need to re-map the ordinals in each
Reader
>
> 2) Execute search with MultiReader and FacetsCollector
>
>
>
> It doesn't get simpler than that ! :)
>
>
> Before I go into great length describing what you should do if you
> cannot share the taxonomy, let me know if that's not an option for
> you.
&
www.thetaphi.de
> eMail: u...@thetaphi.de
>
> > -Original Message-
> > From: Nicola Buso [mailto:nb...@ebi.ac.uk]
> > Sent: Monday, January 21, 2013 3:59 PM
> > To: java-user@lucene.apache.org
> > Subject: FacetedSearch and MultiReader
> >
> &
Hi all,
I'm trying to develop faceted search using lucene 4.0 faceting
framework.
In our project we are searching on multiple indexes using lucene
MultiReader. How should we use the faceted framework to obtain
FacetResults starting from a MultiReader? all the example I see are
using a "single" Ind
Hi,
take a look at StandardAnalyzer sources for an example:
http://grepcode.com/file/repo1.maven.org/maven2/org.apache.lucene/lucene-analyzers-common/4.0.0/org/apache/lucene/analysis/standard/StandardAnalyzer.java#StandardAnalyzer
In your case you are case:
- remember your analyzer have to be re
On Tue, 2012-12-04 at 20:36 +0530, Mohammad Tariq wrote:
> Hello list,
>
> I am entirely new to Lucene and was trying yo get myself familiar
> with it with the help of tutorial presented here :
> http://www.lucenetutorial.com/lucene-in-5-minutes.html
>
> I was trying to execute the examp
On Tue, 2012-12-04 at 17:30 +0530, madan mp wrote:
> I am working with lucene 3.6 ,I am trying to use the highlighter 3.6 in my
> search engine code, can you pls provide me full code for highlighting the
> termspls..pls
>
>
> regards
> madan
Hi,
not a complete example, but you can play
58 matches
Mail list logo