Re: rename bug

2005-07-25 Thread Hu Pan
I have downloaded the source code and use File renameTo to do the rename instead of input/output stream copying. Things are working under Windows. Is it possible to check OS type and call the correct methods? Anyway, I would put up a blog when having time so that people can search for the solu

Re: rename bug

2005-07-25 Thread Chris Lu
This is actually a known problem, particular to windows OS. I remember Doug Cuttings has talked about it before. Inside Lucene, the get-around is to try to rename the file until successful. The IOException you saw may not affect your indexes' integrity. But I said so just based on experience, not

rename bug

2005-07-25 Thread Thomas Pan
Run a small indexer within Eclipse 3.1 under Windows and encounter rename bug all the time: java.io.IOException: Cannot rename deleteable.new to deletable at org.apache.lucene.store.FSDirectory.renameFile(FSDirectory.java:239) at org.apache.lucene.index.IndexWriter.writeDeleteableFiles(I

[Fwd: rename bug]

2005-07-25 Thread Thomas Pan
--- Begin Message --- Run a small indexer within Eclipse 3.1 under Windows and encounter rename bug all the time: java.io.IOException: Cannot rename deleteable.new to deletable at org.apache.lucene.store.FSDirectory.renameFile(FSDirectory.java:239) at org.apache.lucene.index.IndexWrite

Re: Index with more than one analyzer?

2005-07-25 Thread Otis Gospodnetic
If you go with one index and distinct fields for the 3 variations you mentioned, then you can perhaps make use of PerFieldAnalyzerWrapper. Otis --- Andrew Boyd <[EMAIL PROTECTED]> wrote: > Hi All, > When I first started my project I was creating 3 indexes. > Standard, Synonym and SoundsLike.

Index with more than one analyzer?

2005-07-25 Thread Andrew Boyd
Hi All, When I first started my project I was creating 3 indexes. Standard, Synonym and SoundsLike. Now that the QueryParser has the ability to put multiple tokens in one position I no longer have to inject the synonyms at index creation time. So I really don't have to have a seperate index

Re: filtering on searching or after through hits?

2005-07-25 Thread Riccardo Daviddi
ok:) so I do this "filtering" on the hits. thx On 7/25/05, Erik Hatcher <[EMAIL PROTECTED]> wrote: > > On Jul 25, 2005, at 7:05 AM, Riccardo Daviddi wrote: > > Hi all, > > I have to filter my searching in two ways: > > 1) by score, I want that only docs with score higher than a threshold > > are

Re: Relations between documents

2005-07-25 Thread Chris Lu
Then your lucene Document can be: Survey + n Variables, The idea is just to put everything related to a combined Document, and search on it. -- Chris Lu - Full-Text Search on Any Database http://www.dbsight.net On 7/25/05, Magne Skjeret <[EMAIL PROTECTED]> wrote: > Chris L

RE: Relations between documents

2005-07-25 Thread Rajesh Munavalli
Hi Magne, Assuming number of variables will not be large, build 2 indexes Index 1: contents and documents(variable1 + variable2 + variable3...) Index 2: documents and variables Example Query: Get "Survey" where "Variable.label" contains "gender" Step 1) First query would be to "Index 1

Re: Relations between documents

2005-07-25 Thread Magne Skjeret
Chris Lu wrote: >Maybe you can have one Document = Survey+Variable, >and have fields like "abstract", "label". > >Just search on the combined documents. > > Hi A Survey can have 100's of variables. That means that the Survey part is duplicated N variable times. Or did I misunderstand you? Magn

Re: Relations between documents

2005-07-25 Thread Chris Lu
Maybe you can have one Document = Survey+Variable, and have fields like "abstract", "label". Just search on the combined documents. -- Chris Lu - Full-Text Search on Any Database http://www.dbsight.net On 7/25/05, Magne Skjeret <[EMAIL PROTECTED]> wrote: > Hi > > I have be

Re: OutOfMemory errors while indexing large documents

2005-07-25 Thread Chris Lu
Also, be careful with MySql JDBC driver. Depends on how you use MySql, you could have OutOfMemory errors, which may not be Lucene or parsers' problem. -- Chris Lu - Full-Text Search on Any Database http://www.dbsight.net On 7/25/05, Harini Raghavan <[EMAIL PROTECTED]> wrote:

Relations between documents

2005-07-25 Thread Magne Skjeret
Hi I have been reading around a bit on the net to find a solution for my problem, but no luck yet. I am pretty sure it can be done, but I haven't found out how yet. My problem is that I have relations between documents. E.g Survey --> Variables Each survey can have loads of variables, and

Re: OutOfMemory errors while indexing large documents

2005-07-25 Thread Harini Raghavan
I am using org.htmlparser.parserapplications.StringExtractor to parse the html pages, I guess the OutOfMemory occurs while parsing the large HTML pages and not while indexing. Sorry about the confusion. - Original Message - From: "Erik Hatcher" <[EMAIL PROTECTED]> To: Sent: Monday,

Re: Searching in Multiple Index

2005-07-25 Thread Erik Hatcher
What is the point of having empty indexes? I don't know for sure without trying it myself, but perhaps you need to write at least one document to the index for it to be a valid index? Don't continue until Luke is happy with your index. Erik On Jul 25, 2005, at 7:39 AM, Daniel Cor

Re: OutOfMemory errors while indexing large documents

2005-07-25 Thread Erik Hatcher
Could you be more specific about where the OutOfMemory error is happening? Do you have a complete stack trace? As for maxFieldLength - in my use of Lucene, it is necessary to index the entire document and not just the first 10,000 or so terms - I set maxFieldLength to Integer.MAX_VALUE.

Re: filtering on searching or after through hits?

2005-07-25 Thread Erik Hatcher
On Jul 25, 2005, at 7:05 AM, Riccardo Daviddi wrote: Hi all, I have to filter my searching in two ways: 1) by score, I want that only docs with score higher than a threshold are returned. 2) by num of docs, I want that only, for example, the first 20 docs are returned. It's better do it with a

Searching in Multiple Index

2005-07-25 Thread Daniel Cortes
I have a problem searching in 6 index lucene at same time, i founded the class MultiSearcher but I think I don't use corretly because it show me this error: java.io.IOException: Bad file descriptor at java.io.RandomAccessFile.seek(Native Method) at org.apache.lucene.store.FSInputSt

OutOfMemory errors while indexing large documents

2005-07-25 Thread Harini Raghavan
Hi All, I am using lucene to index large documents(HTML pages). The application is running on JBoss and MySQL on UNIX. The indexing is throwing OutOfMemory errors beyond a certain point. I am not sure why this is happening. I am using the default IndexWriter properties, but the lucene documenta

filtering on searching or after through hits?

2005-07-25 Thread Riccardo Daviddi
Hi all, I have to filter my searching in two ways: 1) by score, I want that only docs with score higher than a threshold are returned. 2) by num of docs, I want that only, for example, the first 20 docs are returned. It's better do it with a queryfilter or search for all and after look through the

Re: Index locked exception while updating index

2005-07-25 Thread Erik Hatcher
On Jul 24, 2005, at 12:17 PM, Harini Raghavan wrote: Hi All, I am trying to add paging functionality while using lucene search. I have created a PageFilter what takes in the current page num and the number of records as input and invoking the IndexSearcher passing the a Boolean Query obj