I was going to look at `neatseeker.sourceforge.net' first but I will look at lucene
at the same time.
The trouble with indexing also goes for the global search engines lycos, google,
excite,
altavista. I was looking at Phillip Greenspans book "Web Publishing ..."
(Morgan Kaufmann I think). He talks about the problems that search engines have
searching non static file (*.html ,*.htm). Most dont index AOL Server files "*.tcl" or
"*.apf"
files. What global search engines index JSP pages. I know there are some (e.g Google)
indexing "*.asp" and "*.pdf" pages?
Greenspun also said that global search engines wont index ring fenced web sites.
They do not perform authentication. His solution was to generate a massive
html file with hyperlink that feature remapped URL. I think he means build an index
that search engines can use.
--
Peter Pilgrim
"Putting PLEASE HELP ME! URGENT HELP! in the E-Mail Subject: Line,
will probably lead to the opposite intention." << The Windows Trash Can >>
---------------------------------------- Message History
----------------------------------------
From: Julian Doherty <[EMAIL PROTECTED]>@java.sun.com> on 18/05/2001 09:41
ZE12
Please respond to A mailing list about Java Server Pages specification and
reference <[EMAIL PROTECTED]>
DELEGATED - Sent by: A mailing list about Java Server Pages specification and
[EMAIL PROTECTED]>
To: [EMAIL PROTECTED]
cc:
Subject: Re: A Poor Man's JSP compatible Search Engine Implementation
We are using Lucene (http://www.lucene.com/) as a search engine on our
Intranet project with good results. It handles indexing terms, and running
searches on them. You would need to write/obtain a separate spider to parse
the pages and load the terms into it though, as it doesn't cover this part
of the search process.
--<CUT>--
--
This e-mail may contain confidential and/or privileged information. If you are not the
intended recipient (or have received this e-mail in error) please notify the sender
immediately and destroy this e-mail. Any unauthorised copying, disclosure or
distribution of the material in this e-mail is strictly forbidden.
===========================================================================
To unsubscribe: mailto [EMAIL PROTECTED] with body: "signoff JSP-INTEREST".
For digest: mailto [EMAIL PROTECTED] with body: "set JSP-INTEREST DIGEST".
Some relevant FAQs on JSP/Servlets can be found at:
http://java.sun.com/products/jsp/faq.html
http://www.esperanto.org.nz/jsp/jspfaq.html
http://www.jguru.com/jguru/faq/faqpage.jsp?name=JSP
http://www.jguru.com/jguru/faq/faqpage.jsp?name=Servlets