And of cource the link broke : https://drive.google.com/file/d/1wfZFQD6loTeA9_-eGrdwi9YGtJcNjKli/view?usp=sharing
> df2832368_...@amberoad.de df2832368_...@amberoad.de <j...@amberoad.de> > hat am 08.03.2021 11:05 geschrieben: > > > Hello, > > I am currently working on getting a custom BreakIterator for the Unified > Highlighter to work, and struggle a bit performance wise. > > I need a BreakIterator for getting nice highlights of passages. For this > I want the start of the highlight to be a sentence-start and the end to be a > word-end. There are also some weird edge cases. > > I already coded the BreakIterator and integrated it to our custom > UnifiedHighlighter class, but when I use this Iterator the qTime of all > requests rise from ~1000 to 12000+ which is not acceptable for this > application. > > Here is a link to my implementation. I can't really find where I am > horrible inefficient.(I know that these functions get called very often) > > Any suggestions are welcome, also other approaches. > > So are there some nice resources to learn more about BreakIterators and > stuff, since digging into the code is really hard here. > > Another approach I am considering next is to do this highlight > "trimming", when the final highlights are found. This would reduce the amount > of logic called, but I guess the scoring system of SOLR wouldn't be taken in > to account the right way. > > As I said all suggestions are welcome and thanks in advance. > > Jan Ulrich Robens >