Lucene Index dump into Solr Index folder

2009-03-30 Thread Allahbaksh Mohammedali Asadullah
Hi All, I have a large Lucene Index. I want to dump it inot solr and search. I tried doing this in following way but I was unsuccessful. Can any one let me know what is proper way to do the it. I copied index created by lucene 2.4.1 into apache-solr-1.3.0/example/solr/data/index fol

Re: Minimum HD usage during an optimize() call

2009-03-30 Thread Lebiram
Hi Mike, Yeah the constructor used for the IndexWriter is one with the implicit autoCommit=false property set. Thanks for the prompt reply. I'll keep this in mind. M From: Michael McCandless To: java-user@lucene.apache.org Sent: Monday, March 30, 2009 8:01

Re: Minimum HD usage during an optimize() call

2009-03-30 Thread Michael McCandless
In fact the documentation is wrong. With autoCommit=false (I assume you used that?) the usage will peak at 3X, because the writer keeps the initial index "alive" exactly as an open reader would do. I'll fix the docs. Mike On Mon, Mar 30, 2009 at 1:09 PM, Lebiram wrote: > Hi all, > > I was tryi

Minimum HD usage during an optimize() call

2009-03-30 Thread Lebiram
Hi all, I was trying to determine if the documentation for optimize() is true: http://lucene.apache.org/java/2_4_0/api/core/org/apache/lucene/index/IndexWriter.html#optimize() Testing was done using Lucene 2.4 I basically have 2 lucene index, Index A) one with no Searcher open during optimize

Re: What is an optimal approach?

2009-03-30 Thread mark harwood
If it is only a performance benchmark you need (as opposed to ongoing synching) then it would probably make life easier to read the original XML files from the file system (or first export them from MarkLogic to the file system if they were created in MarkLogic). From there it is a matter of i

RE: What is an optimal approach?

2009-03-30 Thread Shah, Yagnesh
Hello Mr. Harwood, I am aware about in-built search capabilities but I like to get some performance benchmark. One way I can do is the retrieve the content and index but I was looking for some optimal approach incase someone already have similar situation. -Original Message- From: m

Re: Empty SinkTokenizer

2009-03-30 Thread Raymond Balmès
lucene 2.4.0 On Mon, Mar 30, 2009 at 2:18 PM, Grant Ingersoll wrote: > > On Mar 30, 2009, at 4:42 AM, Raymond Balmès wrote: > >> >> >> I found out that the fields are processed in alpha order... and not in >> creation order. Is there any reason for that ? >> > > Hmm, that doesn't sound right (in

Re: What is an optimal approach?

2009-03-30 Thread Mindaugas Žakšauskas
As a someone who earns for living on writing CMS system integrated with Lucene I can tell you this is not that simple. You can of course index your data, but be aware that all your subsequent content repository operations should be in sync. Say what if a piece of content is deleted from the CR? You

Re: What is an optimal approach?

2009-03-30 Thread mark harwood
That's probably more a question about MarkLogic APIs than it is about Lucene. What APIs does MarkLogic provide for getting at the content e.g does it provide a JSR-170 standard interface ( http://www.slideshare.net/uncled/introduction-to-jcr ) I presume you have already ruled out the in-built M

What is an optimal approach?

2009-03-30 Thread Shah, Yagnesh
Hello Lucene users, We have all our xml documents stored in a content management system from MarkLogic. Is there any best approach to index these documents via lucene?

Re: Empty SinkTokenizer

2009-03-30 Thread Grant Ingersoll
On Mar 30, 2009, at 4:42 AM, Raymond Balmès wrote: I found out that the fields are processed in alpha order... and not in creation order. Is there any reason for that ? Hmm, that doesn't sound right (in other words, something must have changed). What version of Lucene are you using? -Gr

Re: Empty SinkTokenizer

2009-03-30 Thread Raymond Balmès
Yes indeed confusing code... I was also very confused. In the meantime I solved my problem by checking in the tokenStream method of myAnalyzer which field was being looked at and applying the right stream to the right field. No idea if this is how it is intended to be done, but it works perfect in

Re: newbie question again

2009-03-30 Thread Michael McCandless
My guess is you're not using a 2.9-dev (nightly) version of Lucene's jar? That constructor for FSDirectory is new in 2.9-dev, and the book's sources are actively tracking Lucene's. However, you don't need to create your own Makefile: there's an ant build.xml script. You should be able to just ru

newbie question again

2009-03-30 Thread Timon Roth
hello list sory, this is maeby a stupid questin, but i can't resolve. so maeby you can help me: i try to compile the indexer-example from the book lucene in action, 2nd edition (http://www.manning.com/hatcher3/hatcher_meapch1.pdf), but get the following error: -