I guess the thundering silence is rooted in the problem statement. I have a
hard time understanding how this index is used. By storing things this way,
you'll force the user to know the *exact* format of anything she's looking
for. That is, it's hard to search for <option name="test" value="32"> and
get docs containing both <option name="test"> and <option name="test"
value="32"> as hits. You could do some fancy things with filters and regular
expressions (RegexTermEnum) for instance, but I'd hesitate to recommend that
until I understood your problem a little more.

Perhaps a better thing would be to give us a short statement of the problem
you're trying to solve and see what responses you get from that...

Not very helpful, I know, but it's a start.

Erick

On 10/2/06, John Bugger <[EMAIL PROTECTED]> wrote:

Hello!
I've indexed HTML pages and stored html codes as UN_TOKENIZED fields. So,
I
need to search for specific tags in those documents,
for example: <option name="test">
Do I need to write some custom analyzer or something like that?
Please help me!


Reply via email to