Re: Should it be acceptable to have a "rows" parameter larger than Integer.MAX_VALUE?

2021-04-04 Thread Eric Pugh
This doesn’t answer your question, but it would be nice if the exception 
pointed you to the specific cause of the error?“Input string ‘30’ 
exceeds Integer.MAX_VALUE for rows parameter”, so that if you aren’t a java 
savvy person, you would better understand the issue.

> On Apr 3, 2021, at 5:35 PM, Shawn Heisey  wrote:
> 
> I've come across something and I wonder if it should be considered a bug.
> 
> If a value larger than Integer.MAX_VALUE is sent with the "rows" parameter, 
> Solr will immediately throw an exception:
> 
> org.apache.solr.common.SolrException: For input string: "30"
> 
> It would be perfectly valid (though probably have terrible performance) to 
> expect a value like that with distributed indexes.  The individual shard 
> subqueries of course could never go that high.
> 
> This error also occurs in the cloud example with a distributed index.
> 
> Should this be considered a bug, or are we OK with current behavior?
> 
> Thanks,
> Shawn

___
Eric Pugh | Founder & CEO | OpenSource Connections, LLC | 434.466.1467 | 
http://www.opensourceconnections.com  | 
My Free/Busy   
Co-Author: Apache Solr Enterprise Search Server, 3rd Ed 


This e-mail and all contents, including attachments, is considered to be 
Company Confidential unless explicitly stated otherwise, regardless of whether 
attachments are marked as such.



Re: Unified Highlighter and Fuzzy Searches

2021-04-04 Thread David Smiley
This was a bug that was fixed in 8.7:
https://issues.apache.org/jira/browse/LUCENE-9427

I thought perhaps hl.weightMatches=false might work but it doesn't.  So you
have to upgrade to get this.

~ David Smiley
Apache Lucene/Solr Search Developer
http://www.linkedin.com/in/davidwsmiley


On Thu, Apr 1, 2021 at 8:23 AM Srijan  wrote:

> I tried both Standard and DisMax query parsers and the issue is easily
> reproducible. And forgot to mention earlier, I am trying this on Solr 8.6.3
>
> Just to add more clarity this is what I am doing:
>
> I have say a field called File_Content_Field with the following values
> indexed and stored: "runnings, running, run, runs"
>
> My query is something like this:
>
> q=File_Content_Field:runnings~0&hl.fl=File_Content_Field&hl=on
>
> With original highlighter, I see the following response:
> "highlighting": {"document_id": {"File_Content_Field": ["\n \nTest Dataset
> 1 Running and Runnings and Runs and R\n \n "]
>
>
> With unified highlighter, no highlighting is returned:
>
>
> q=File_Content_Field:runnings~0&hl.fl=File_Content_Field&hl=on&hl.method=unified
>
> "highlighting": {"document_id": {"File_Content_Field": []}
>
>
> However, runnings~1 works as expected (highlights both running and
> runnings)
>
>
> q=File_Content_Field:runnings~0&hl.fl=File_Content_Field&hl=on&hl.method=unified
>  "highlighting": {
> "document_id": {"File_Content_Field": [ "\n \nTest Dataset 1
> Running and Runnings and Runs and R\n \n ]
> }
>
>
>
>
> On Thu, Apr 1, 2021 at 12:32 AM David Smiley  wrote:
>
> > I tried this in tests both at the Lucene layer and Solr layer and I'm not
> > seeing the failure to highlight for the UH.  What query parser are you
> > using?
> >
> > ~ David Smiley
> > Apache Lucene/Solr Search Developer
> > http://www.linkedin.com/in/davidwsmiley
> >
> >
> > On Wed, Mar 31, 2021 at 11:39 AM seez  wrote:
> >
> > > Hello,
> > >
> > > I have the following fuzzy search criteria:
> > >
> > > runnings~0
> > >
> > > Search itself returns expected results and I see documents that have
> the
> > > exact term "runnings". However the same query criteria is not honored
> by
> > > unified highlighter. It gives back no matching results. Although
> > > "runnings~1" works (with the added caveat of also honoring the "1" edit
> > > distance).
> > >
> > > So it appears unified highlighter only supports edit distance > 0 for
> > fuzzy
> > > searches. And this is not an issue with original or fastVector
> > > highlighters.
> > > Is this a real problem or am I missing something?
> > >
> > >
> > >
> > >
> > >
> > > --
> > > Sent from: https://lucene.472066.n3.nabble.com/Solr-User-f472068.html
> > >
> >
>


Index Size of a tenant

2021-04-04 Thread Natarajan, Rajeswari
Hi,

We plan to store multiple tenants in a single collection (multiple shards)  
with a composite Id router with docId prefix as tenant id.
In this set up, how can a tenant’s index size be found. Solr metrics api gives 
the core’s index size .But in same core multiple tenants might be present.
Would like to know if there is any out of box solr api available for this case.


Thanks,
Rajeswari