Re: Indexing the spider content

2008-06-25 Thread Grant Ingersoll
If it has an API that let's you get the content that needs to be indexed, then, sure, you can index from the spider. If it doesn't have an API, presumably, you would need to somehow extract the docs from the files it builds. This is, of course, assuming it stores the crawled files in some

Re: Indexing the spider content

2008-06-25 Thread yugana
We are using the VSpider... Yug John Wang wrote: > > Maybe building a Lucene gateway to hook in with VSpider. > Are you using VSpider or K2Spider? > > -John > > On Tue, Jun 24, 2008 at 8:35 PM, yugana <[EMAIL PROTECTED]> wrote: > >> >> Hi Otis, >> >> Thanks for the reply. So you mean it is

Re: Indexing the spider content

2008-06-24 Thread John Wang
Maybe building a Lucene gateway to hook in with VSpider. Are you using VSpider or K2Spider? -John On Tue, Jun 24, 2008 at 8:35 PM, yugana <[EMAIL PROTECTED]> wrote: > > Hi Otis, > > Thanks for the reply. So you mean it is not possible to use Lucene to index > the fetched (Verity Spider Content)

Re: Indexing the spider content

2008-06-24 Thread yugana
Hi Otis, Thanks for the reply. So you mean it is not possible to use Lucene to index the fetched (Verity Spider Content) content. Yug Otis Gospodnetic wrote: > > It sounds like you want to check out Nutch - fetched, indexer, searcher, > etc. in one lovely package. > > > Otis > -- > Sematext

Re: Indexing the spider content

2008-06-24 Thread Otis Gospodnetic
It sounds like you want to check out Nutch - fetched, indexer, searcher, etc. in one lovely package. Otis -- Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch - Original Message > From: yugana <[EMAIL PROTECTED]> > To: java-user@lucene.apache.org > Sent: Tuesday, June 24, 2008