a complete solution for building a website search with lucene

2010-01-07 Thread jyzhou817
Hi , I am new in Lucene. To build a web search function, it need to have a backendc indexing function. But, before that, should run a Crawler? because Lucene index based on Html documents, while Crawler can change the website pages to Html documents. Am i right? If so, please anyone suggest

Re: a complete solution for building a website search with lucene

2010-01-08 Thread jyzhou817
Hi Paul, Thanks. Use Nutch to do crawling. and integrate Lucene to the web application, so that can do search online. BTW, Nutch seems to have only Linux version, what my development is on Windows. Am i right? Zhou --- On Fri, 8/1/10, Paul Libbrecht wrote: From: Paul Libbrecht Subject: Re

Re: a complete solution for building a website search with lucene

2010-01-10 Thread jyzhou817
Thanks. --- On Sat, 9/1/10, Simon Willnauer wrote: From: Simon Willnauer Subject: Re: a complete solution for building a website search with lucene To: java-user@lucene.apache.org Date: Saturday, 9 January, 2010, 6:16 PM I don't know that much about nutch but hadoop shouldn't really run under

Re: a complete solution for building a website search with lucene

2010-01-10 Thread jyzhou817
Hi, Have you implemented such web search in your web application development?  As detailed as possible. example: 1) index: ? 2) search: Lucene Please do advise. Thanks. --- On Sat, 9/1/10, Simon Willnauer wrote: From: Simon Willnauer Subject: Re: a complete solution for building a websit

how to follow intranet: configuration in nutch website

2010-01-12 Thread jyzhou817
Hi, I try to following the instruction from http://lucene.apache.org/nutch/tutorial8.html . Intranet: Configuration To configure things for intranet crawling you must:1. Create a directory with a flat file of root urls. For example, to crawl the nutch site you might start with a file named

Re: how to follow intranet: configuration in nutch website

2010-01-12 Thread jyzhou817
Thanks. --- On Wed, 13/1/10, Otis Gospodnetic wrote: From: Otis Gospodnetic Subject: Re: how to follow intranet: configuration in nutch website To: java-user@lucene.apache.org Date: Wednesday, 13 January, 2010, 12:07 PM Zhou, Your question will get more attention if you send it to nutch-u...

Re: Applying LUCENE-1606 -- which version

2010-01-24 Thread jyzhou817
Hi, I am planing to integrate Lucene into my web application. Can you please give high level steps to follow? Thanks Zhou --- On Fri, 22/1/10, Sriram Muthuswamy Chittathoor wrote: From: Sriram Muthuswamy Chittathoor Subject: Applying LUCENE-1606 -- which version To: java-user@lucene.apach

How to implement Web Search by integrating Lucene with web application and Nutch

2010-01-25 Thread jyzhou817
Hi, Does any one can tell me some high level steps to follow to build Search Function in Web Application? Any suggestion on tool to crawl the web site? Nutch? Suggest steps? Thanks Zhou