Hi Fayyaz,
I recommend to use SAX or, maybe, a custom parser for large xml files .It 
should be faster than using Digester. The main difference between those xml 
parsers is that Digester needs to load the entire xml document in memory when 
it creates those objects, meanwhile you can parse the document and add its 
content in Lucene index on the fly using SAX. On the other hand with Digester 
the documents are parsed twice: once to transform the xml to Digester object 
and second you should use this object to add its content to Lucene index.
Digester is very good for small documents and if you don't want to worry about 
the xml parsing problems.
A custom parser maybe is the best solution if you want to have best 
performances. I chose this solution.

Regards,
Catalin

----- Original Message ----
From: syedfa <[EMAIL PROTECTED]>
To: java-user@lucene.apache.org
Sent: Monday, November 19, 2007 5:43:28 AM
Subject: XML parsing using Lucene in Java


Dear Fellow Lucene Developers:

I am a java/jsp developer and have started learning lucene for the
 purpose
of creating a search engine for some books that I have in xml format.
  The
XML document is actually quite large, and would like to provide as
 accurate
results as possible to the user searching through these books.  My
 question
is, which xml parser do you recommend using, SAX or Digester?  Is there
 a
difference?  Does one parser provide better results than the other?
  What
about performance issues?

Any help that you can provide is greatly appreciated.  I look forward
 to
hearing from you soon.

Take care.
Sincerely;
Fayyaz

-- 
View this message in context:
 http://www.nabble.com/XML-parsing-using-Lucene-in-Java-tf4833124.html#a13827336
Sent from the Lucene - Java Users mailing list archive at Nabble.com.


---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]







      
____________________________________________________________________________________
Never miss a thing.  Make Yahoo your home page. 
http://www.yahoo.com/r/hs

Reply via email to