I was going to look at `neatseeker.sourceforge.net' first but I will look at lucene
at the same time.

The trouble with indexing also goes for the global search engines lycos, google, 
excite,
altavista. I was looking at Phillip Greenspans book "Web Publishing ..."
(Morgan Kaufmann I think). He talks about the problems that search engines have
searching non static file (*.html ,*.htm). Most dont index AOL Server files "*.tcl" or 
"*.apf"
files. What global search engines index JSP pages. I know there are some (e.g Google)
indexing  "*.asp" and "*.pdf" pages?

Greenspun also said that global search engines wont index ring fenced web sites.
They do not perform authentication. His solution was to generate a massive
html file with hyperlink that feature remapped URL.  I think he means build an index
that search engines can use.

--
Peter Pilgrim

"Putting PLEASE HELP ME! URGENT HELP! in the E-Mail Subject: Line,
will probably lead to the opposite intention."             << The  Windows Trash Can >>



---------------------------------------- Message History 
----------------------------------------


From: Julian Doherty <[EMAIL PROTECTED]>@java.sun.com> on 18/05/2001 09:41 
ZE12

Please respond to A mailing list about Java Server Pages specification and             
 reference <[EMAIL PROTECTED]>

DELEGATED - Sent by:     A mailing list about Java Server Pages specification and      
        [EMAIL PROTECTED]>


To:   [EMAIL PROTECTED]
cc:
Subject:  Re: A Poor Man's JSP compatible Search Engine Implementation


We are using Lucene (http://www.lucene.com/) as a search engine on our
Intranet project with good results. It handles indexing terms, and running
searches on them. You would need to write/obtain a separate spider to parse
the pages and load the terms into it though, as it doesn't cover this part
of the search process.

--<CUT>--


--

This e-mail may contain confidential and/or privileged information. If you are not the 
intended recipient (or have received this e-mail in error) please notify the sender 
immediately and destroy this e-mail. Any unauthorised copying, disclosure or 
distribution of the material in this e-mail is strictly forbidden.

===========================================================================
To unsubscribe: mailto [EMAIL PROTECTED] with body: "signoff JSP-INTEREST".
For digest: mailto [EMAIL PROTECTED] with body: "set JSP-INTEREST DIGEST".
Some relevant FAQs on JSP/Servlets can be found at:

 http://java.sun.com/products/jsp/faq.html
 http://www.esperanto.org.nz/jsp/jspfaq.html
 http://www.jguru.com/jguru/faq/faqpage.jsp?name=JSP
 http://www.jguru.com/jguru/faq/faqpage.jsp?name=Servlets

Reply via email to