Re: Search problems

2005-11-01 Thread Robert Watkins
, the approach you would want to take in your application, but at least it will allow you to use Luke as you have specified. -- Robert Watkins On Tue, 1 Nov 2005, Miles Barr wrote: On Thu, 2005-10-27 at 16:35 -0400, Sharma, Siddharth wrote: My index has 4 keyword fields and one unindexed field

Re: Search problems

2005-11-01 Thread Robert Watkins
, the approach you would want to take in your application, but at least it will allow you to use Luke as you have specified. -- Robert Watkins On Tue, 1 Nov 2005, Miles Barr wrote: On Thu, 2005-10-27 at 16:35 -0400, Sharma, Siddharth wrote: My index has 4 keyword fields and one unindexed field

html parsers and numers of terms

2005-12-13 Thread Robert Watkins
obert Robert Watkins [EMAIL PROTECTED] - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]

Re: html parsers and numers of terms

2005-12-13 Thread Robert Watkins
Aha! I had, indeed, been fooled by Luke into thinking that the entities had been converted upon analysis, but you have set me straight. Thanks, -- Robert On Tue, 13 Dec 2005, J.J. Larrea wrote: Beware of HTML/XML entities in your input stream! The Lucene analyzers (including StandardAnalyzer

Re: html parsers and numers of terms

2005-12-13 Thread Robert Watkins
So obvious I missed it (at least that's my excuse). I'm on the road at the moment and -- can you believe it? -- didn't bring my copy of Lucene In Action with me! Looks like I'll have to get the source code from lucenebook.com to crib the analyzer demo code. Much obliged, -- Robert On Tue, 13 Dec

Re: exact match ..

2006-02-21 Thread Robert Watkins
here is, obviously, a fair amount of work involved, but the level of control is the payoff. -- Robert Robert Watkins [EMAIL PROTECTED] On Mon, 20 Feb 2006, Erik Hatcher wrote: Yes, this is what PerFieldAnalyzerWrapper provides for you, as described in

Looking for Lucene consultant (UK based)

2006-03-13 Thread Robert Watkins
anks! ---- Robert Watkins [EMAIL PROTECTED] [EMAIL PROTECTED] - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]

stemmed search and exact match on "same" field

2006-08-14 Thread Robert Watkins
I've been puzzling this one for a while now, and can't figure it out. The idea is to allow stemmed searches and exact matches (tokenized, but unstemmed phrase searches) on the same field. The subject of this email had "same" in quotes, because it's from the search-client perspective that the sa

Re: stemmed search and exact match on "same" field

2006-08-15 Thread Robert Watkins
Thank you, Chris. You have confirmed what I had all but resigned myself to (and you summarized my goal precisely). I am sticking with the two versions of the field and just accepting the fact that the search clients will need to use my custom query parser. Even if one doesn't get the answer one w

Re: stemmed search and exact match on "same" field

2006-08-15 Thread Robert Watkins
Thank you, Chris. You have confirmed what I had all but resigned myself to (and you summarized my goal precisely). I am sticking with the two versions of the field and just accepting the fact that the search clients will need to use my custom query parser. Even if one doesn't get the answer one w

Re: Test new query parser?

2006-08-22 Thread Robert Watkins
Mark -- Yes please! I'm very interested in the mixing of boolean and proximity operators. I have also worked on a parser (using JavaCC) but haven't managed to crack queries such as: ((a OR b) AND c) NEAR (d NOT e) I can get the parse tree okay, but haven't figured out how to translate that

Re: Test new query parser?

2006-08-23 Thread Robert Watkins
Mark -- Don't lose hope! We are migrating from Verity to Lucene, and I know for a fact that we will have to support the kind of complex queries you talk about; maybe not /quite/ as complex as your magnificent: cop | fowl & (fowl | priest & man) ! helicopter ~8 (hillary | tom) but certainly mo

Alert function (aka "profiled alerting")

2005-03-16 Thread Robert Watkins
posted by a Christian Ubbesen. I have tried to contact him, to see if he had managed to solve this, but have not heard back (hey, he could well have moved on in 3.5 years!) Thanks, -- Robert Robert Watkins [EMAIL PROTECTED]

Re: Alert function (aka "profiled alerting")

2005-03-16 Thread Robert Watkins
Okay, I only bought your book a few days ago, so I haven't read much yet! Also, this project is still at the exploratory level, so very little time has been dedicated to it (the irony is I am currently spending most of my time trying to work around K2 ... issues). As such, it will be some time befo

Re: Alert function (aka "profiled alerting")

2005-03-16 Thread Robert Watkins
The reason your suggestion is not practical is scalability. In a production environment you might have, for example, 10,000 stored queries and 10 new documents a minute. That's a fair bit of load on the system for only one aspect of a much larger search application. On Wed, 16 Mar 2005, Dan Funk

Re: Alert function (aka "profiled alerting")

2005-03-17 Thread Robert Watkins
Thank you, David, for your very interesting suggestions. As I said earlier in this thread somewhere, we are still at the exploratory stage (considering Lucene as a replacement for a commercial engine) so it will be some time before I can get my hands dirty, but you have certainly given me some good

wildcards within a phrase query

2005-10-11 Thread Robert Watkins
efixQuery fits into the slot left by the [gap] of the PhraseQuery. Does that sound like I'm on the right track? -- Robert Robert Watkins [EMAIL PROTECTED] - To unsubscribe, e-mail: [EMA

Re: wildcards within a phrase query

2005-10-11 Thread Robert Watkins
the sequence of terms, such as: ( term | term as prefix )+ Am I missing something? -- Robert On Tue, 11 Oct 2005, Daniel Naber wrote: On Dienstag 11 Oktober 2005 15:32, Robert Watkins wrote: The only idea that comes to mind is to try to combine a PhraseQuery and a PrefixQuery Yes

Re: wildcards within a phrase query

2005-10-11 Thread Robert Watkins
Wonderful! But what about wildcards? I realised after I had sent the last message that my pattern should have been written: ( term | term as prefix | wildcard term )+ -- Robert On Tue, 11 Oct 2005, Daniel Naber wrote: On Dienstag 11 Oktober 2005 22:53, Robert Watkins wrote: I was under

Re: wildcards within a phrase query

2005-10-12 Thread Robert Watkins
lable -- to you? -- Robert PS -- Would it be possible to avoid going through _all_ the terms in the TermEnum (that are greater than prefix, of course) by doing something like: } while (te.next() && te.term().text().startsWith(prefix)); or would analysis possibly make that unwise?

Re: wildcards within a phrase query

2005-10-12 Thread Robert Watkins
Thank you, Daniel. Once I get an environment suitable enough I will certainly give this a whirl. -- Robert On Wed, 12 Oct 2005, Daniel Naber wrote: On Mittwoch 12 Oktober 2005 17:18, Robert Watkins wrote: Does that sound reasonable -- and scalable -- to you? I don't think you ne

[ off topic ] stnadard for field: suffix or prefix?

2005-10-18 Thread Robert Watkins
Robert Watkins [EMAIL PROTECTED] - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]