AUTO: Zhou Lin Dai is out of the office. (returning 2009-03-16)

2009-03-13 Thread Zhou Lin Dai
I am out of the office until 2009-03-16.. I will check emails at night. For anything emergent, you can call my cell phone (86) 131 6290 0375. Note: This is an automated response to your message Re: Lucene Highlighting and Dynamic Summaries sent on 13/3/09 22:20:15. This is the only notificatio

Re: What kind of performance to expect from a MultiTermQuery being used in BooleanQuery?

2009-03-13 Thread ArtemGr
> I wonder: does Lucene need to scan all the terms in the inverted index > and the collect all the document identifiers into the DocIdSet > in order to implement the MultiTermQuery which goes after some other > queries in a BooleanQuery? Like, collecting a thouthand of document > identifiers only t

Re: Lucene Highlighting and Dynamic Summaries

2009-03-13 Thread Amin Mohammed-Coleman
Ok. I tried to apply the patch(s) and completely messed it up (user error). Is there a full example of the highlighter that is available that I can apply and test? Cheers Amin On Fri, Mar 13, 2009 at 12:09 PM, Amin Mohammed-Coleman wrote: > Absolutely! I have received considerable help from

Re: StandardTokenizer issue ?

2009-03-13 Thread iMe
Grant Ingersoll-6 wrote: > > That does sound like an issue. Can you open a JIRA issue for it? > I don't know how to do that... Could somebody do it for me ? Thank you -- View this message in context: http://www.nabble.com/StandardTokenizer-issue---tp22471475p22495653.html Sent from the Lu

Re: Lucene Highlighting and Dynamic Summaries

2009-03-13 Thread Amin Mohammed-Coleman
Absolutely! I have received considerable help from the community and there are so many more stuff I want to ask! Cheers! Amin On Fri, Mar 13, 2009 at 10:41 AM, Michael McCandless < luc...@mikemccandless.com> wrote: > > Well, it's not yet committed. > > You can use it now by pulling the patch a

Re: StandardTokenizer issue ?

2009-03-13 Thread Grant Ingersoll
That does sound like an issue. Can you open a JIRA issue for it? Thanks, Grant On Mar 12, 2009, at 5:55 AM, iMe wrote: I spotted an unexepcted behavior when using the StandardAnalyzer. This analyzer uses the StandardTokenizer which javadoc states: Splits words at hyphens, unless there's

Re: Lucene Highlighting and Dynamic Summaries

2009-03-13 Thread Michael McCandless
Well, it's not yet committed. You can use it now by pulling the patch attached to the issue & testing it yourself. If you do so, please report back! This is how Lucene improves. I'm hoping we can include it in 2.9... Mike On Mar 13, 2009, at 6:35 AM, Amin Mohammed-Coleman wrote: Swee

Re: search problem when indexed using Field.setOmitTf()

2009-03-13 Thread Michael McCandless
OK I opened https://issues.apache.org/jira/browse/LUCENE-1561. Mike Otis Gospodnetic wrote: I bet omitTf will be confusing to people. When I see omitTf I read that as "aha, don't store term frequency". I don't read that as "don't store term frequency and don't store positional inform

Re: search problem when indexed using Field.setOmitTf()

2009-03-13 Thread Michael McCandless
I agree, the name is cryptic, and we should also strengthen the javadocs to explain what searches will not work properly if you use it. I'll open an issue. Any suggestions for better name? omitTermPositions? Mike Otis Gospodnetic wrote: I bet omitTf will be confusing to people. When

Re: Lucene Highlighting and Dynamic Summaries

2009-03-13 Thread Amin Mohammed-Coleman
Sweet! When will this highlighter be available? Can I use this now? Cheers! On Fri, Mar 13, 2009 at 10:10 AM, Michael McCandless < luc...@mikemccandless.com> wrote: > > Amin Mohammed-Coleman wrote: > > I think that would be good. >> > > I'll open an issue. > > Probably a silly thing to ask

Re: Lucene Highlighting and Dynamic Summaries

2009-03-13 Thread Michael McCandless
Amin Mohammed-Coleman wrote: I think that would be good. I'll open an issue. Probably a silly thing to ask but I guess there is a performance implication by setting it to max value. Right. And it's tough choosing a default in situations like this -- performance vs losing stuff. Howe