Re: Extracting span terms using WeightedSpanTermExtractor

2011-07-08 Thread Mark Miller
On Jul 8, 2011, at 5:43 AM, Jahangir Anwari wrote: > I don't think this is the best > solution, am open to other alternatives. Could also make it static public where it is? Either way. - Mark Miller lucidimagination.com

Re: Extracting span terms using WeightedSpanTermExtractor

2011-07-08 Thread Jahangir Anwari
After applying the patch I was able to get the span positions for all the terms in the query. But now when I tried to access the positionSpans of each span term I cannot because they are stored in a package-private PositionSpan class in WeightedSpanTerm.java which prevents them from being visible o

Re: Extracting span terms using WeightedSpanTermExtractor

2011-07-07 Thread Mark Miller
On Jul 7, 2011, at 5:14 PM, Jahangir Anwari wrote: > I did noticed a strange issue though. When the query is just a > PhraseQuery(e.g. "everlasting glory"), getWeightedSpanTerms() returns all > the span terms along with their span positions. But when the query is a > BooleanQuery containing phras

Re: Extracting span terms using WeightedSpanTermExtractor

2011-07-07 Thread Jahangir Anwari
Thanks Mark. After setting maxDocCharsToAnalyze to a value greater than 0, I can now extract the span terms. I did noticed a strange issue though. When the query is just a PhraseQuery(e.g. "everlasting glory"), getWeightedSpanTerms() returns all the span terms along with their span positions. But

Re: Extracting span terms using WeightedSpanTermExtractor

2011-07-06 Thread Mark Miller
Sorry - kind of my fault. When I fixed this to use maxDocCharsToAnalyze, I didn't set a default other than 0 because I didn't really count on this being used beyond how it is in the Highlighter - which always sets maxDocCharsToAnalyze with it's default. You've got to explicitly set it higher t

Re: Extracting span terms using WeightedSpanTermExtractor

2011-07-06 Thread Michael Sokolov
I tried something similar, and failed - I think the API is lacking there? My only advice is to vote for this: https://issues.apache.org/jira/browse/LUCENE-2878 which should provide an alternative better API, but it's not near completion. -Mike On 7/6/2011 5:34 PM, Jahangir Anwari wrote: I h