Re: Unintelligent implementation of IndexWriter locking?

2009-08-30 Thread Jan Peter Stotz
Mark Miller wrote: > Have you tried using a native lock factory? No - I did not even know of it's existence as it is nowhere "visible" from the IndexWriter class (not directly used and nowhere mentioned in the JavDoc. Could you please add a small hint after the description about the "locking file

Unintelligent implementation of IndexWriter locking?

2009-08-30 Thread Jan Peter Stotz
Hi Lucene users, at the moment I have some problems with the locking mechanism of IndexWriter. Some times my application quits/terminates before I can close the IndexWriter. Then the "write.lock" file remains and prohibits every write access to my index. Of course I could simply unlock the directo

Re: Problem using Lucene on Ubuntu

2008-02-18 Thread Jan Peter Stotz
Grant Ingersoll wrote: Note: ENCODING is whatever encoding the file is in, as in "UTF-8", if that is what your files are in. I think there is a misunderstanding, the WordExtractor extracts text from MS Word (.doc) files. Those files are binary and therefore does not have an encoding. I wou

Re: Using Lucene 2.3.0 with PDFBox

2008-02-13 Thread Jan Peter Stotz
Naman Gupta schrieb: PDF Box uses a particular function of the Object 'Field' which is only there in the lucene 1.4.3. *Field.UnIndexed("path", file.getPath() ) This statement should be a good replacement: new Field("path", file.getPath(), Field.Store.YES, Field.Index.UN_TOK

Re: Supported File Formats - PDF, MHT

2008-02-12 Thread Jan Peter Stotz
Naman Gupta wrote: Does lucene support the files in pdf and mht file formats. I wasnt able to retrieve any results after creating an index of such files. Well, the answer is simple: Lucene itself does not support any file format. You need a file parser that converts your files to a plain text

Re: Using a QueryParser with an untokenized field?

2008-02-01 Thread Jan Peter Stotz
Hi Eleanor. In my Lucene index there's a field that contains the local names of XML elements, one name per document. Users can enter arbitrary queries for this field, so I'm using a QueryParser. From reading around it looks as if the field needs to be tokenized, but since the field's conten

Re: Creating an alias for a field name?

2008-01-19 Thread Jan Peter Stotz
Hi Erick, thanks for your response. Not that I know of. I presume that you want this to reduct typing or some such. Your app could simply massage the query that was typed, doing the appropriate substitutions before parsing the query Well I found a much better solution which avoids "double

Creating an alias for a field name?

2008-01-19 Thread Jan Peter Stotz
Hi, I would like to provide multiple field-names that are all mapped to the same field in background (e.g. a long field-name and a short field-name). Is there any mechanism for creating such field-aliases, may be in the IndexWriter or an QueryParser? Jan