On Jul 28, 2005, at 12:37 PM, Chris May wrote:
Works beautifully (at least on my 30K-document test index ). I'll
need to do some fiddling if I want to allow partial URLs (i.e.
http://www2.warwick.ac.uk/ab* to match http://www2.warwick.ac.uk/
about) but I can see how to do that, I think (and
Works beautifully (at least on my 30K-document test index ). I'll
need to do some fiddling if I want to allow partial URLs (i.e. http://
www2.warwick.ac.uk/ab* to match http://www2.warwick.ac.uk/about) but
I can see how to do that, I think (and I'm not sure I need it anyway).
Thanks Scott!
Chris,
How about indexing the domain as one field and each part of the path
as separate terms in another field? I'm sure you've probably already
thought of doing this... and maybe discarded the idea because you'd
lose the position information. However, even though you can't just
simply
On Jul 27, 2005, at 4:56 PM, Chris May wrote:
Always domain + part of a path e.g.
url:http://blogs.warwick.ac.uk/chrismay/*
or
url:http://www2.warwick.ac.uk/fac/soc/law/ug/prospective/degrees/
modules/commonlaw/*
or
url:http://www2.warwick.ac.uk/services/its/*
... and so on. Part of th
Always domain + part of a path e.g.
url:http://blogs.warwick.ac.uk/chrismay/*
or
url:http://www2.warwick.ac.uk/fac/soc/law/ug/prospective/degrees/
modules/commonlaw/*
or
url:http://www2.warwick.ac.uk/services/its/*
... and so on. Part of the problem is that we may need to go an
arbitrar
Could you give some examples of the types of PrefixQuery's you'd like
to use? Is it always at a granularity of domain and path? Or are
you wanting to do a prefix pieces of the domain and path?
Erik
On Jul 27, 2005, at 3:47 PM, Chris May wrote:
First, apologies for what seems to be s
First, apologies for what seems to be something of an FAQ.
However, I've not been able to find an answer either in LIA or in the
relevant section of the FAQ (http://wiki.apache.org/jakarta-lucene/
LuceneFAQ#head-06fafb5d19e786a50fb3dfb8821a6af9f37aa831)
My setup is as follows: I have an inde