Ibrahim created LUCENE-4216:
-------------------------------
Summary: Token X exceeds length of provided text sized X
Key: LUCENE-4216
URL: https://issues.apache.org/jira/browse/LUCENE-4216
Project: Lucene - Java
Issue Type: Bug
Components: modules/highlighter
Affects Versions: 4.0-ALPHA
Environment: Windows 7, jdk1.6.0_27
Reporter: Ibrahim
I'm facing this exception:
org.apache.lucene.search.highlight.InvalidTokenOffsetsException: Token رأيكم
exceeds length of provided text sized 170
at
org.apache.lucene.search.highlight.Highlighter.getBestTextFragments(Highlighter.java:233)
at classes.myApp$16$1.run(myApp.java:1508)
I tried to find anything wrong in my code when i start migrating Lucene 3.6 to
4.0 without successful. i found similar issues with HTMLStripCharFilter e.g.
LUCENE-3690, LUCENE-2208 but not with SimpleHTMLFormatter so I'm triggering
this here to see if there is really a bug or it is something wrong in my code
with v4. The code that im using:
final Highlighter highlighter = new Highlighter(new SimpleHTMLFormatter("<font
color=red>", "</font>"), new QueryScorer(query));
.......
final TokenStream tokenStream =
TokenSources.getAnyTokenStream(defaultSearcher.getIndexReader(), j, "Line",
analyzer);
final TextFragment[] frag = highlighter.getBestTextFragments(tokenStream,
doc.get("Line"), false, 10);
Please note that this is working fine with v3.6
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators:
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]