Hi Florian,
Perhaps you might run into issues with using an ngram. How I see it is that
you need tokenized urls and need to run an exact search using a keyword
tokenizer on the search string.
You could try this. I am assuming it'll work.
so something like
en.wikipedia.org/wiki/production_code/test
> Dear List,
>
> I'm working on a project where i have to check a Blacklist
> of URL's with Lucene. (about 500.000)
> Is it possible to search for a URL in a hierarchical
> context?
>
> for Example:
> Blacklist entry: "en.wikipedia.org/wiki/production_code"
>
> "en.wikipedia.org/wiki/production_
Dear List,
I'm working on a project where i have to check a Blacklist of URL's with
Lucene. (about 500.000)
Is it possible to search for a URL in a hierarchical context?
for Example:
Blacklist entry: "en.wikipedia.org/wiki/production_code"
"en.wikipedia.org/wiki/production_code/test" should mat
-BEGIN PGP SIGNED MESSAGE-
Hash: SHA1
Hello Lucene users,
On behalf of the Lucene dev community (a growing community far larger
than just the committers) I would like to announce the fifth (and
hopefully last) release candidate for Lucene 2.9.
Please download and check it out - take it f
BEGIN:VCALENDAR
PRODID:-//Microsoft Corporation//Outlook 12.0 MIMEDIR//EN
VERSION:2.0
METHOD:REPLY
X-MS-OLK-FORCEINSPECTOROPEN:TRUE
BEGIN:VEVENT
ATTENDEE;PARTSTAT=ACCEPTED:mailto:aditya.kulka...@gmail.com
CLASS:PUBLIC
CREATED:20090919T073908Z
DTEND:20090924T19Z
DTSTAMP:20090919T073908Z
DTSTART: