RE: [ANNOUNCE] Web Crawler

2011-05-15 Thread karl.wright
You might want to look at ManifoldCF also. Karl -Original Message- From: ext abhayd [mailto:ajdabhol...@hotmail.com] Sent: Saturday, May 14, 2011 9:29 AM To: java-user@lucene.apache.org Subject: Re: [ANNOUNCE] Web Crawler hi Dominique, I am looking for a crawler to feed solr index. Aft

Re: [ANNOUNCE] Web Crawler

2011-05-15 Thread abhayd
hi Dominique, I am looking for a crawler to feed solr index. After looking at various posts i have settled down on two Nutch and crawl anywhere. I dont see any activities on Nutch wiki so wondering if its not being developed anymore. But most forums say Nutch is standard for solr. Crawl Anywhere

RE: Lucene 3.3 in Eclipse

2011-05-15 Thread Steven A Rowe
(Resending to the list - didn't notice that my reply went to Cheng directly) There is an Ant target "get-db-jar" that can do the downloading for you - you can see the URL it uses here: There is

RE: Lucene 3.3 in Eclipse

2011-05-15 Thread Uwe Schindler
Hi, Those JARs are not shipped with Lucene (different license). They are downloaded by the ANT build, so easiest to run the standard ANT build at least one time. - Uwe Schindler H.-H.-Meier-Allee 63, D-28213 Bremen http://www.thetaphi.de eMail: u...@thetaphi.de > -Original Message-

RE: Lucene 3.3 in Eclipse

2011-05-15 Thread cheng
Steve, thanks for correction. You are right. The version is 3.0.3 released last Oct. I did place an ant jar in Eclipse, and it does the job to remove some compiling errors. However, it seems that I do need some jar file to handle the DbHandleExtractor.java and the org.apache.lucene.store.db pac

RE: Lucene 3.3 in Eclipse

2011-05-15 Thread Steven A Rowe
Hi Cheng, Lucene 3.3 does not exist - do you mean branches/branch_3x ? FYI, as of Lucene 3.1, there is an Ant target you can use to setup an Eclipse project for Lucene/Solr - run this from the top level directory of a full source tree (including dev-tools/ directory) checked out from Subversio

Lucene 3.3 in Eclipse

2011-05-15 Thread cheng
Hi, I created a java project for Lucene 3.3 in Eclipse, and found that in the DbHandleExtractor.java file, the package of com.sleepycat.db.internal.Db is not resolved. How can I overcome this? I have tried to download .jar for this, but don't know which and where to download. Thanks