Re: Help with jump from 1.4.3 to 2.0.0

Gopikrishnan Subramani Wed, 20 Dec 2006 05:39:57 -0800

All I could suspect is perhaps you are trying to add documents to an index
that was originally created using Lucene 1.4.3.


If trying to create a fresh index doesn't work, you could send me your
indexer code so I can take a look.

-Gopi


On 12/19/06, JT Kimbell <[EMAIL PROTECTED]> wrote:



Hi,

I'm working on learning Lucene for my job, and the book one of my
professors
purchased for myself and her is Lucene In Action, which is a good book but
it is based on version 1.4.3 (I believe).  I am beginning to grasp a lot
of
the basic concepts behind Lucene and have a basic searching and indexing
program written on the said professor's server (which is running 1.4.3).
However, on my server for work I am using 2.0.0 and it was agreed that it
would be best that I use the newer version.  My program ran fine using
1.4.3, but once I made a few changes to make it compatible with 2.0.0 it
now
returns a Null Pointer Exception about 80% of the way through.

For some background on the files, they are all .txt files stored in a
directory that has folders representing different years (e.g. 2005),
within
that there are month folders (August 2005) and those folders contain all
the
documents.  When I catch the exception and print while File f my program
is
currently on, it says it is that August 2005 folder.  My program is
exactly
the same except for updating Field to be compatible with 2.0.0 and the
data
is an exact copy of the other data.

So I suppose I have two questions:

1)  The relevant methods from the two programs are below, does anyone have
any ideas why this isn't working, am I doing something wrong or assuming
something I shouldn't?  (If you need to see the full code with all
comments
for either program, let me know).

2)  Is there a good tutorial or something online for version 2.0.0 just to
help me understand it better?  Do you have any tips?

Version 1.4.3 Code
       //This method recursively calls itself when it finds a directory
       public void indexDirectory(IndexWriter writer, File dir) throws
IOException{
               File[] files = dir.listFiles();

               for(int i = 0; i < files.length; i++){
                       File f = files[i];
                       if (f.isDirectory()){
                               indexDirectory(writer, f);
                       }else if (f.getName().endsWith(".txt")){
                               indexFile(writer, f);
}
               }
       }

       //This method indexes each individual file
       public void indexFile(IndexWriter writer, File f) throws
IOException{

               if(f.isHidden() || !f.exists() || !f.canRead()){
                       return;
               }

               Document doc = new Document();
               doc.add(Field.Text("contents", new FileReader(f)));
               doc.add(Field.Keyword("filename", f.getCanonicalPath()));
               writer.addDocument(doc);
       }

Version 2.0.0 Code
       //This method recursively calls itself when it finds a directory
       public void indexDirectory(IndexWriter writer, File dir) throws
IOException{
               File[] files = dir.listFiles();

               for(int i = 0; i < files.length; i++){
                       File f = files[i];
                       try{
                            if (f.isDirectory()){
                                  indexDirectory(writer, f);
                      }else if (f.getName().endsWith(".txt")){  //Seems
this is where it is first thrown...
                                  indexFile(writer, f);
                                  System.out.println(f);
                            }
                       }catch(NullPointerException npe){
                               npe.printStackTrace(System.out);
                               System.out.println("File is: " + f);
                       }
               }
       }

       //This method indexes each individual file
       public void indexFile(IndexWriter writer, File f) throws
IOException{

               if(f.isHidden() || !f.exists() || !f.canRead()){
                       return;
               }

               Document doc = new Document();
               doc.add(new Field("contents", new FileReader(f)));
               doc.add(new Field("filename", f.getCanonicalPath(),
Field.Store.YES, Field.Index.UN_TOKENIZED));
               writer.addDocument(doc);
       }

Thanks so much for any help you can give me.  It seems strange to me that
when I print File f, it prints out a directory name (August 2005), but got
past the isDirectory statement and is now checking to see if it has a .txt
extension.

Thanks,

JT
--
View this message in context:
http://www.nabble.com/Help-with-jump-from-1.4.3-to-2.0.0-tf2846591.html#a7949145
Sent from the Lucene - Java Users mailing list archive at Nabble.com.


---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Re: Help with jump from 1.4.3 to 2.0.0

Reply via email to