Re: Searching a URL with a PrefixQuery / Too Many Clauses (again...)

2005-07-28 Thread Erik Hatcher
On Jul 28, 2005, at 12:37 PM, Chris May wrote: Works beautifully (at least on my 30K-document test index ). I'll need to do some fiddling if I want to allow partial URLs (i.e. http://www2.warwick.ac.uk/ab* to match http://www2.warwick.ac.uk/ about) but I can see how to do that, I think (and

Re: Searching a URL with a PrefixQuery / Too Many Clauses (again...)

2005-07-28 Thread Chris May
Works beautifully (at least on my 30K-document test index ). I'll need to do some fiddling if I want to allow partial URLs (i.e. http:// www2.warwick.ac.uk/ab* to match http://www2.warwick.ac.uk/about) but I can see how to do that, I think (and I'm not sure I need it anyway). Thanks Scott!

Re: Searching a URL with a PrefixQuery / Too Many Clauses (again...)

2005-07-27 Thread Scott Ganyo
Chris, How about indexing the domain as one field and each part of the path as separate terms in another field? I'm sure you've probably already thought of doing this... and maybe discarded the idea because you'd lose the position information. However, even though you can't just simply

Re: Searching a URL with a PrefixQuery / Too Many Clauses (again...)

2005-07-27 Thread Erik Hatcher
On Jul 27, 2005, at 4:56 PM, Chris May wrote: Always domain + part of a path e.g. url:http://blogs.warwick.ac.uk/chrismay/* or url:http://www2.warwick.ac.uk/fac/soc/law/ug/prospective/degrees/ modules/commonlaw/* or url:http://www2.warwick.ac.uk/services/its/* ... and so on. Part of th

Re: Searching a URL with a PrefixQuery / Too Many Clauses (again...)

2005-07-27 Thread Chris May
Always domain + part of a path e.g. url:http://blogs.warwick.ac.uk/chrismay/* or url:http://www2.warwick.ac.uk/fac/soc/law/ug/prospective/degrees/ modules/commonlaw/* or url:http://www2.warwick.ac.uk/services/its/* ... and so on. Part of the problem is that we may need to go an arbitrar

Re: Searching a URL with a PrefixQuery / Too Many Clauses (again...)

2005-07-27 Thread Erik Hatcher
Could you give some examples of the types of PrefixQuery's you'd like to use? Is it always at a granularity of domain and path? Or are you wanting to do a prefix pieces of the domain and path? Erik On Jul 27, 2005, at 3:47 PM, Chris May wrote: First, apologies for what seems to be s

Searching a URL with a PrefixQuery / Too Many Clauses (again...)

2005-07-27 Thread Chris May
First, apologies for what seems to be something of an FAQ. However, I've not been able to find an answer either in LIA or in the relevant section of the FAQ (http://wiki.apache.org/jakarta-lucene/ LuceneFAQ#head-06fafb5d19e786a50fb3dfb8821a6af9f37aa831) My setup is as follows: I have an inde