Ibrahim created LUCENE-4216:
-------------------------------

             Summary: Token X exceeds length of provided text sized X
                 Key: LUCENE-4216
                 URL: https://issues.apache.org/jira/browse/LUCENE-4216
             Project: Lucene - Java
          Issue Type: Bug
          Components: modules/highlighter
    Affects Versions: 4.0-ALPHA
         Environment: Windows 7, jdk1.6.0_27
            Reporter: Ibrahim


I'm facing this exception:
org.apache.lucene.search.highlight.InvalidTokenOffsetsException: Token رأيكم 
exceeds length of provided text sized 170
        at 
org.apache.lucene.search.highlight.Highlighter.getBestTextFragments(Highlighter.java:233)
        at classes.myApp$16$1.run(myApp.java:1508)


I tried to find anything wrong in my code when i start migrating Lucene 3.6 to 
4.0 without successful. i found similar issues with HTMLStripCharFilter e.g. 
LUCENE-3690, LUCENE-2208 but not with SimpleHTMLFormatter so I'm triggering 
this here to see if there is really a bug or it is something wrong in my code 
with v4. The code that im using:

final Highlighter highlighter = new Highlighter(new SimpleHTMLFormatter("<font 
color=red>", "</font>"), new QueryScorer(query));
.......
final TokenStream tokenStream = 
TokenSources.getAnyTokenStream(defaultSearcher.getIndexReader(), j, "Line", 
analyzer);
final TextFragment[] frag = highlighter.getBestTextFragments(tokenStream, 
doc.get("Line"), false, 10);


Please note that this is working fine with v3.6

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira



---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to