[ 
https://issues.apache.org/jira/browse/SOLR-10256?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15954662#comment-15954662
 ] 

Abhishek Kumar Singh commented on SOLR-10256:
---------------------------------------------

I agree with your argument that it may not be the best use case. 
Because it all depends on how we have configured our search to work. And this 
is why we have configurations like  _mm_ , for specifying the minimum match. 
The problem arises in cases wherein our _mm_ configuration guarantees *100% 
tokens-match*, but the spellcheck (due to WordBreak) shows the [suggestions 
wherein even one of the tokens in the broken words](Sugg A)  has a higher 
frequency than the ones with [reasonable frequency but very less Levenstein 
distance ] (Sugg B) . 

We would expect *Sugg B* to have higher weightage in spellcheck suggestions 
than *Sugg A*.  
But it's not happening due to the compulsory braces.

What I feel is, by default it should have braces on, but there should be a 
configuration to switch it off.  

> Parentheses in SpellCheckCollator
> ---------------------------------
>
>                 Key: SOLR-10256
>                 URL: https://issues.apache.org/jira/browse/SOLR-10256
>             Project: Solr
>          Issue Type: Bug
>      Security Level: Public(Default Security Level. Issues are Public) 
>          Components: spellchecker
>            Reporter: Abhishek Kumar Singh
>         Attachments: SOLR-10256.patch
>
>
> SpellCheckCollator adds parentheses ( *'('* and *')'* ) around tokens which 
> have space between them.  
> This should be configurable, because if *_WordBreakSpellCheckComponent_* is 
> being used, queries like : *applejuice* will be broken down to *apple juice*. 
> Such suggestions are being surrounded by braces by current 
> *SpellCheckCollator*. 
> And when surrounded by brackets, they represent the same position by 
> _EdismaxParser_ , which is not required. 
> https://github.com/apache/lucene-solr/blob/master/solr/core/src/java/org/apache/solr/spelling/SpellCheckCollator.java#L227
>   
> A solution to this will be to have a flag, which can help disable this 
> parenthesisation of spell check suggestions.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to