[jira] [Commented] (SOLR-4137) FastVectorHighlighter: StringIndexOutOfBoundsException in BaseFragmentsBuilder

Marcel (JIRA) Mon, 03 Dec 2012 06:35:59 -0800

    [ 
https://issues.apache.org/jira/browse/SOLR-4137?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13508764#comment-13508764
 ]


Marcel commented on SOLR-4137:
------------------------------

@Robert - sure. Posted an example in the bug description. Here is my analyzer 
chain:

{code}
Index Analyzer
org.apache.solr.analysis.WhitespaceTokenizerFactory 
{luceneMatchVersion=LUCENE_36}
org.apache.solr.analysis.HyphenatedWordsFilterFactory 
{luceneMatchVersion=LUCENE_36}
org.apache.solr.analysis.WordDelimiterFilterFactory {preserveOriginal=1, 
splitOnCaseChange=1, generateNumberParts=1, catenateWords=1, 
types=wdftypes.txt, luceneMatchVersion=LUCENE_36, generateWordParts=1, 
catenateAll=0, catenateNumbers=1}
org.apache.solr.analysis.LowerCaseFilterFactory {luceneMatchVersion=LUCENE_36}
org.apache.solr.analysis.StopFilterFactory {words=lang/stopwords_de.txt, 
ignoreCase=true, enablePositionIncrements=true, luceneMatchVersion=LUCENE_36}
org.apache.solr.analysis.DictionaryCompoundWordTokenFilterFactory 
{maxSubwordSize=15, onlyLongestMatch=true, minSubwordSize=4, minWordSize=5, 
dictionary=spellings.txt, luceneMatchVersion=LUCENE_36}
org.apache.solr.analysis.GermanNormalizationFilterFactory 
{luceneMatchVersion=LUCENE_36}
org.apache.solr.analysis.GermanStemFilterFactory {luceneMatchVersion=LUCENE_36}
org.apache.solr.analysis.SnowballPorterFilterFactory {language=German2, 
luceneMatchVersion=LUCENE_36}
org.apache.solr.analysis.RemoveDuplicatesTokenFilterFactory 
{luceneMatchVersion=LUCENE_36}
{code}
                
> FastVectorHighlighter: StringIndexOutOfBoundsException in BaseFragmentsBuilder
> ------------------------------------------------------------------------------
>
>                 Key: SOLR-4137
>                 URL: https://issues.apache.org/jira/browse/SOLR-4137
>             Project: Solr
>          Issue Type: Bug
>          Components: highlighter
>    Affects Versions: 3.6.1
>            Reporter: Marcel
>
> under some circumstances the BaseFragmentsBuilder genereates a 
> StringIndexOutOfBoundsException inside the makeFragment method.
> The starting offset is higher than the end offset.
> I did a small patch checking the offsets and posted it over there at 
> Stackoverflow: 
> http://stackoverflow.com/questions/12456448/solr-highlight-bug-with-usefastvectorhighlighter
> The code in 4.0 seems to be the same as in 3.6.1
> Example how to reproduce the behaviour:
> There is a word called "www.DAKgesundAktivBonus.de" inside the index. If you 
> search for "dak bonus" some offset calculations went wrong.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[jira] [Commented] (SOLR-4137) FastVectorHighlighter: StringIndexOutOfBoundsException in BaseFragmentsBuilder

Reply via email to