Re: TermInSetQuery keep terms order in results

2018-07-02 Thread Nicola Buso
red, why not index their ordinals, and > then sort by that? > > On Mon, Jul 2, 2018, 6:16 AM Nicola Buso wrote: > > Hi Uwe, > > > > as said the sorting is calculated elsewhere upfront and the terms > > are > > provided to Lucene in the order calculated (in an

Re: TermInSetQuery keep terms order in results

2018-07-02 Thread Nicola Buso
; Achterdiek 19, D-28357 Bremen > http://www.thetaphi.de > eMail: u...@thetaphi.de > > > -Original Message- > > From: Nicola Buso > > Sent: Monday, June 25, 2018 5:09 PM > > To: Uwe Schindler ; java-user@lucene.apache.org > > Subject: Re: TermInSetQuer

Re: TermInSetQuery keep terms order in results

2018-06-25 Thread Nicola Buso
eries coming first) and apply on a field without norms. > > TermInSetQuery is fast because it neglects scoring and is just good > at intersecting the terms dict with the given terms set. > > Uwe > > - > Uwe Schindler > Achterdiek 19, D-28357 Bremen > http://www.thet

Re: TermInSetQuery keep terms order in results

2018-06-25 Thread Nicola Buso
Probably is more a sort problem than scoring the single document and the order of the input terms is calculated at runtime, in case someone is thinking about adding a sorting field at indexing time. Nicola On Mon, 2018-06-25 at 12:23 +0100, Nicola Buso wrote: > Hi, > > I need t

TermInSetQuery keep terms order in results

2018-06-25 Thread Nicola Buso
this score? Cheers, Nicola -- Nicola Buso EMBL-EBI - To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org For additional commands, e-mail: java-user-h...@lucene.apache.org

Extending Analyzer at runtime

2017-06-23 Thread Nicola Buso
at runtime and from there start adding my filters. Did someone already did something similar? Nicola -- Nicola Buso EMBL-EBI - To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org For additional commands, e-mail: j

Re: Hierarchical Facets need duplicated counts

2016-07-07 Thread Nicola Buso
Any hint on how to calculate these values without asking the whole facet hierarchy and count them? Is there a specific point in the code where I can check for this distinct count, and maybe modify the code? Nicola On Wed, 2016-07-06 at 13:42 +0100, Nicola Buso wrote: > Hello everyone, >

Hierarchical Facets need duplicated counts

2016-07-06 Thread Nicola Buso
ching and maybe have this "duplicated count" summed up in the parent nodes. Do you have any hints on how to achieve it? Regards, Nicola -- Nicola Buso EMBL-EBI - To unsubscribe, e-mail: java-user-unsubscr...@lu

Re: Strange Directory open exception

2015-08-12 Thread Nicola Buso
; H.-H.-Meier-Allee 63, D-28213 Bremen > http://www.thetaphi.de > eMail: u...@thetaphi.de > > > -Original Message- > > From: Nicola Buso [mailto:nb...@ebi.ac.uk] > > Sent: Wednesday, August 12, 2015 10:57 AM > > To: java-user@lucene.apache.org > > Subject: Strang

Strange Directory open exception

2015-08-12 Thread Nicola Buso
a.nio framework should behave differently? We are on Redhat Enterprise 6: 2.6.32-504.8.1.el6.x86_64 #1 SMP Nicola -- Nicola Buso Software Engineer - Web Production Team European Bioinformatics Institute (EMBL-EBI) European Molecular Biology Laboratory Wellcome Trust Genome Campus Hi

Lucene 5 field indexed

2015-06-03 Thread Nicola Buso
: FieldType.indexOptions() FieldInfo.getIndexOptions() What about to use the same naming conventions? Nicola -- Nicola Buso EMBL-EBI - To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org For additional commands, e

Re: Facet migration 4.6.1 to > 4.7.0

2014-06-23 Thread Nicola Buso
extend one of the existing classes (IntTaxoFacets). If it's > something completely different, e.g. RangeFacetCounts, you should be > able to just extend Facets. And if it's not a "Facets" thing at all, > i.e. you don't need its API, just write your own interfac

Facet migration 4.6.1 to > 4.7.0

2014-06-17 Thread Nicola Buso
ends the behavior to achieve this? Any help will be really apreciated, Nicola. -- Nicola Buso Software Engineer - Web Production Team European Bioinformatics Institute (EMBL-EBI) European Molecular Biology Laboratory Wellcome Trust Genome Campus Hinxton Cambridge CB10 1SD Unite

Re: Faceted Search User's Guide for Lucene 4.8.1

2014-06-16 Thread Nicola Buso
d some classes (e.g. > > FacetSearchParams > > or CountFacetRequest). > > > > Is there an updated version of that guide? > > I tried this > > http://lucene.apache.org/core/*4_8_1*/facet/org/apache/lucene/facet/doc-files/userguide.html > > but it does not work :| &

Re: MultiReader docid reliability

2014-06-02 Thread Nicola Buso
7;ve > said so far... > > > Best, > Erick > > > On Fri, May 30, 2014 at 7:49 AM, Nicola Buso wrote: > Hi, > > thanks Michael and Alan. Is enough to know that re-opening the > index > there is no guarantee th

Re: MultiReader docid reliability

2014-05-30 Thread Nicola Buso
> > collection. > > > > Alan Woodward > > www.flax.co.uk > > > > > > On 30 May 2014, at 11:20, Nicola Buso wrote: > > > >> Hi Alan, > >> > >> just to make it more typical (yes there are not IndexWriters open on > >>

Re: MultiReader docid reliability

2014-05-30 Thread Nicola Buso
ff trying to rely on this for more typical uses. > > Alan Woodward > www.flax.co.uk > > > > On 30 May 2014, at 10:39, Nicola Buso wrote: > > > Hi Alan, > > > > thanks a lot for the reply. > > > > For what I understood from your reply i

Re: MultiReader docid reliability

2014-05-30 Thread Nicola Buso
chines, the docBase assigned to each reader context should > be the same. > > Alan Woodward > www.flax.co.uk > > > > On 29 May 2014, at 14:29, Nicola Buso wrote: > > > Hi, > > > > from the javadocs: > > > > > > For efficiency,

MultiReader docid reliability

2014-05-29 Thread Nicola Buso
ds in different machines (than separated JVMs)? nicola. -- Nicola Buso EMBL-EBI - To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org For additional commands, e-mail: java-user-h...@lucene.apache.org

Re: IndexWriter.addIndexes() multithread correct?

2014-05-22 Thread Nicola Buso
lo to Jo McEntyre! :-) > Uwe > > - > Uwe Schindler > H.-H.-Meier-Allee 63, D-28213 Bremen > http://www.thetaphi.de > eMail: u...@thetaphi.de > > > > -Original Message- > > From: Nicola Buso [mailto:nb...@ebi.ac.uk] > > Sent: Thurs

IndexWriter.addIndexes() multithread correct?

2014-05-22 Thread Nicola Buso
Hi, I was wondering if it's correct to call the method: IndexWriter.addIndexes(...) from different threads concurrently. nicola. - To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org For additional commands, e-mai

Re: queries with "&&" doesn't work but "AND" does

2013-10-10 Thread Nicola Buso
On Thu, 2013-10-10 at 16:02 +0530, Devi pulaparti wrote: > toString output by queryparser.parse() for query TEST && USAGE is " > content:TEST content:\"amp amp\" content:USAGE " . > and for query TEST AND USAGE is "+content:TEST +content:USAGE" > any idea why is analyzer treating && as conten

Re: TotalHitCountCollector performance

2013-09-19 Thread Nicola Buso
res. As the counting > collector does not request any scores, you just add a useless additional > wrapper around the query's scorer. > > Uwe > > - > Uwe Schindler > H.-H.-Meier-Allee 63, D-28213 Bremen > http://www.thetaphi.de > eMail: u...@thetaphi.de > > &g

TotalHitCountCollector performance

2013-09-18 Thread Nicola Buso
Hello, I was going to use the TotalHitCountCollector in cases where I'm interested just in the number of results. Obviously I was hoping to gain in performances compared to a "scored" query. >From my tests it seam it's not so performant compare to the "scored" search. At this point I'm wondering i

Re: Facets ordering

2013-07-04 Thread Nicola Buso
e, but I'd need to understand better what sort of ranges do u have in > mind. > Maybe we can discuss that on a separate thread? > > Shai Nicola > > > On Wed, Jul 3, 2013 at 7:34 PM, Nicola Buso wrote: > > > Hi Shai, > > > > if I'm correctly unde

Re: Facets ordering

2013-07-02 Thread Nicola Buso
ant your top-K to be computed by label too? Or first deduce > the top-K facets, then sort them otherwise? > > Shai > > > > On Tue, Jul 2, 2013 at 6:36 PM, Nicola Buso wrote: > Hi, > > I was looking to change the order of the facet

Facets ordering

2013-07-02 Thread Nicola Buso
Hi, I was looking to change the order of the facet results; in this case I would like to order by the facet label instead of the facet value (count). An example is a facet on dates; suppose the facet is saved as /MM/dd, I would like obtain values for this date ordered by the date; i.e. with d

Re: Retrieving FieldInfo

2013-05-14 Thread Nicola Buso
Thanks for the explanation, I'm not in this situation but it's helpful to understand better lucene. Nicola Michael McCandless wrote: >On Tue, May 14, 2013 at 10:02 AM, Nicola Buso wrote: > >> I now this can sound horrible/flexible/... but this mean I can add two >

Re: Retrieving FieldInfo

2013-05-14 Thread Nicola Buso
I now this can sound horrible/flexible/... but this mean I can add two documents with the same field name, but different configurations, for example different IndexOptions? Nicola. On Tue, 2013-05-14 at 12:52 +0100, Nicola Buso wrote: > OK, thanks for the reply! > > > Nicola.

Re: Retrieving FieldInfo

2013-05-14 Thread Nicola Buso
> commit(); > > And each AtomicReader will contain different fields. As > getFieldInfos() documents "Get the {@link FieldInfos} describing all > fields in this reader." > > > Shai > > > > On Tue, May 14, 2013 at 1:35 PM, Nicola Buso wrote:

Retrieving FieldInfo

2013-05-14 Thread Nicola Buso
Hi, I was looking to a way to obtain FieldInfo(s) from the IndexReader; we need in some way to describe the index. Can I do this? AtomicReader ar = .leaves().get(0).reader(); // than call ar.getFieldInfos(); What I mean is, can I suppose every AtomicReader in leaves() contain the

Re: Collect facet only on specific values

2013-05-09 Thread Nicola Buso
ess wrote: > On Thu, May 9, 2013 at 5:53 AM, Nicola Buso wrote: > > Hi, > > > > I want to hide some values for a facet; than I'm looking into a way to > > collect facet values only for a subset of possible values. > > Is there a way to obtain this? > > I

Collect facet only on specific values

2013-05-09 Thread Nicola Buso
Hi, I want to hide some values for a facet; than I'm looking into a way to collect facet values only for a subset of possible values. Is there a way to obtain this? If not, extending TopK...Handler (or wrapping it) would be the right way to obtain this behaviour? Nicola. -

Luke and Facet search

2013-05-01 Thread Nicola Buso
Hi all, I think somebody already did this question in the mailing list, but I can't find it. Is possible to execute a search with facets in luke? then use the $facets field in searches? Nicola. - To unsubscribe, e-mail: java-

Re: Big number of values for facets

2013-04-26 Thread Nicola Buso
ot the > > taxonomy. But you also incur search performance loss because counting a > > certain dimension requires traversing multiple DV fields. > > > > To enable partitions you need to override FacetIndexingParams partition > > size. You can try to play with

Re: Big number of values for facets

2013-04-26 Thread Nicola Buso
ou need to override FacetIndexingParams partition > size. You can try to play with it. > > In am intetested though to understand the general scenario. Perhaps this > can be solved some other way... > > Shai > On Apr 26, 2013 5:44 PM, "Nicola Buso" wrote: >

Big number of values for facets

2013-04-26 Thread Nicola Buso
Hi all, I'm encountering a problem to index a document with a large number of values for one facet. Caused by: java.lang.IllegalArgumentException: DocValuesField "$facets" is too large, must be <= 32766 at org.apache.lucene.index.BinaryDocValuesWriter.addValue(BinaryDocValuesWriter.java:5

Re: Faceted Search: count direct matches/member für result nodes

2013-04-25 Thread Nicola Buso
Hi, which version of Lucene? Check the OrdinalPolicy you are using in FacetIndexingParams at indexing time. I think you should use: NonTopLevelOrdinalPolicy in lucene 3.6.1 or OrdinalPolicy.ALL_BUT_DIMENSION in lucene 4.2.1 Nicola. On Thu, 2013-04-25 at 08:32 +0200, Schimke, Danny wrote: >

Lucene 4.2, where is facet residue

2013-04-16 Thread Nicola Buso
Hi all, in lucene 4.1 there was this API FacetResultNode.getResidue() to understand if there are more values for a facet, than the topk returned. Is there a similar API in 4.2? I can't find it! Nicola. - To unsubscribe, e-mai

Re: lucene 4.2 count on merged taxonomies

2013-04-11 Thread Nicola Buso
ish all the refactoring work > before 4.2. > > > What you should do is extend the new FacetsAggregator and override > FacetsAccumulator.getAggregator(). Can you try that and let us know if > that resolves your problem? > > > Shai > > > > On Thu, Apr 11, 2013 at 1

lucene 4.2 count on merged taxonomies

2013-04-11 Thread Nicola Buso
Hi all, in Lucene 4.1, after some advise from the mailing list I am merging taxonomies (in memory because of the small size of taxonomies indexes) and collecting facets values from the merged taxonomy instead of the single ones; the scenario is: - you have a Multireader pointing to more indexes -

Re: FacetedSearch and MultiReader

2013-04-09 Thread Nicola Buso
Hi, I'm trying to use Lucene 4.2, but this merge of more taxonomy indexes seam is no more working. Do you have any idea why it has not to work in Lucene 4.2? Normal faceted search on a single index is working correctly. Nicola. On Thu, 2013-01-24 at 16:53 +0000, Nicola Buso wrote: >

Re: FacetRequest include residue

2013-01-30 Thread Nicola Buso
if that's what you're looking for? > > > And again, this might be something that you can do in the UI > -- add the value that the user selected to the list of values > that are displayed, irregardless of what the top-10 are

FacetRequest include residue

2013-01-29 Thread Nicola Buso
Hi, I have a FacetRequest with numResults setted to 10, how can I specify additional facets value to add to the FacetResult? I try to explain the use-case: - the user view 10 facet result - the interface permit the user to choose a facet value not from the top-10 results - the user execute the qu

Re: Faceted search in OR

2013-01-29 Thread Nicola Buso
Hi Michael, I'm looking into implementing a solution. On Fri, 2013-01-25 at 16:23 -0500, Michael McCandless wrote: > On Fri, Jan 25, 2013 at 3:48 PM, Nicola Buso wrote: > > > if you have experiences in this use case can you share solutions? What > > is reusable from Lu

Re: Faceted search in OR

2013-01-25 Thread Nicola Buso
A/4 are not returned to the user, even though they belong to > >> the result set, but did not make it to the top-K > >> * Facets A/5, A/6 are not returned because they don't belong to the result > >> set at all. > >> > >> If this makes sense to you, an

Re: Faceted search in OR

2013-01-24 Thread Nicola Buso
ur > FR.setNumResults to Integer.MAX_VALUE. But note, it's going to be > costly, i.e. you'd get a FacetResultNode per child of A, so depending > how "wide" A is, this may have some impact on RAM consumption. > > If that's not what you meant, could you pleas

Faceted search in OR

2013-01-24 Thread Nicola Buso
Hi all, I'm introducing Lucene faceted search in our project and I need some hints to achieve some functionalities: - I want facet filtering in OR, how to? - obtain facets for the filtered results but also for the non filtered one. i.e. I have facet A with values A/V1, A/V2, A/V3 and these value

Re: FacetedSearch and MultiReader

2013-01-24 Thread Nicola Buso
the > > following scheme. Search replicas (machines where index resides) are build > > FacetResult's based on their index chunk (top N categories with document > > counts). Later on the results are merged "by hands" with summing relevant > > categories from diff

Re: FacetedSearch and MultiReader

2013-01-23 Thread Nicola Buso
Thanks Shai, I'm trying your solution and it's working, I need to check some number to test it. As I said we are aware having big indexes, than I use facets only on subsets, but if it will result in performances issues too than I'll for sure take a look into facet sampling. Nicola. On Wed, 2013

Re: FacetedSearch and MultiReader

2013-01-22 Thread Nicola Buso
have finished with the > > following scheme. Search replicas (machines where index resides) are build > > FacetResult's based on their index chunk (top N categories with document > > counts). Later on the results are merged "by hands" with summing relevant > > catego

[Fwd: Re: FacetedSearch and MultiReader]

2013-01-21 Thread Nicola Buso
--- Begin Message --- Hi, it's not clear your proposal. On Mon, 2013-01-21 at 18:21 +0200, Shai Erera wrote: > Hi > > > First, if it's a one time operation, you can merge the taxonomy > indexes into one, without merging the content indexes too (but you'll > need to re-map the ordinals in each

Re: FacetedSearch and MultiReader

2013-01-21 Thread Nicola Buso
Reader > > 2) Execute search with MultiReader and FacetsCollector > > > > It doesn't get simpler than that ! :) > > > Before I go into great length describing what you should do if you > cannot share the taxonomy, let me know if that's not an option for > you. &

Re: FacetedSearch and MultiReader

2013-01-21 Thread Nicola Buso
www.thetaphi.de > eMail: u...@thetaphi.de > > > -Original Message- > > From: Nicola Buso [mailto:nb...@ebi.ac.uk] > > Sent: Monday, January 21, 2013 3:59 PM > > To: java-user@lucene.apache.org > > Subject: FacetedSearch and MultiReader > > > &

FacetedSearch and MultiReader

2013-01-21 Thread Nicola Buso
Hi all, I'm trying to develop faceted search using lucene 4.0 faceting framework. In our project we are searching on multiple indexes using lucene MultiReader. How should we use the faceted framework to obtain FacetResults starting from a MultiReader? all the example I see are using a "single" Ind

Re: porting a cutsom Analyzer from 3.6 -> 4.0

2012-12-10 Thread Nicola Buso
Hi, take a look at StandardAnalyzer sources for an example: http://grepcode.com/file/repo1.maven.org/maven2/org.apache.lucene/lucene-analyzers-common/4.0.0/org/apache/lucene/analysis/standard/StandardAnalyzer.java#StandardAnalyzer In your case you are case: - remember your analyzer have to be re

Re: Beginning with Lucene

2012-12-04 Thread Nicola Buso
On Tue, 2012-12-04 at 20:36 +0530, Mohammad Tariq wrote: > Hello list, > > I am entirely new to Lucene and was trying yo get myself familiar > with it with the help of tutorial presented here : > http://www.lucenetutorial.com/lucene-in-5-minutes.html > > I was trying to execute the examp

Re: lucene highlighter

2012-12-04 Thread Nicola Buso
On Tue, 2012-12-04 at 17:30 +0530, madan mp wrote: > I am working with lucene 3.6 ,I am trying to use the highlighter 3.6 in my > search engine code, can you pls provide me full code for highlighting the > termspls..pls > > > regards > madan Hi, not a complete example, but you can play