addIndexes causing IndexOutOfBoundsException

2006-02-07 Thread Chris Fraschetti
:92) at org.apache.lucene.index.IndexWriter.mergeSegments(IndexWriter.java:487) at org.apache.lucene.index.IndexWriter.optimize(IndexWriter.java:366) at org.apache.lucene.index.IndexWriter.addIndexes(IndexWriter.java:399) -- ___ Chris Fras

post-normalization score filter

2005-08-13 Thread Chris Fraschetti
e. thanks as always! -- _______ Chris Fraschetti e [EMAIL PROTECTED] - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]

max number of documents

2005-08-10 Thread Chris Fraschetti
this? watch my count and switch to a new index when the time comes, then search across both indexes? -- ___ Chris Fraschetti e [EMAIL PROTECTED] - To unsubscribe, e-mail: [EMAIL

search caching

2005-08-03 Thread Chris Fraschetti
earch? -- ___ Chris Fraschetti e [EMAIL PROTECTED] - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]

Re: indexed document id

2005-08-01 Thread Chris Fraschetti
If i'm using multiple threads to add documents to the index, can it be assumed that they will be added to the index in the order they are presented to the indexwriter? and thus keeping my local doc id count would hold true? -Chris Fraschetti On 7/29/05, Erik Hatcher <[EMAIL PROTECTED

indexed document id

2005-07-29 Thread Chris Fraschetti
I've got an index which I rebuild each time and don't do any deletes until the end, so doc ids shouldn't change... at index time, is there a better way to discover the id of the document i just added than docCount() ? -- ___ Chr

Index merge and java heap space

2005-07-28 Thread Chris Fraschetti
ith using merge on a large scale? Thanks in advance -- _______ Chris Fraschetti e [EMAIL PROTECTED] - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]

URL Stemmer

2005-07-27 Thread Chris Fraschetti
nt data to help me eliminate duplicates... is there any good methods that are commonly used for URL stemming? -- _______ Chris Fraschetti e [EMAIL PROTECTED] - To unsubscribe, e-ma

Re: Lucene - PDFBox

2005-05-25 Thread Chris Fraschetti
> Hits hits = searcher.search(query); > > System.out.println(hits.length() + " total matching documents"); > > > > final int HITS_PER_PAGE = 10; > > for (int start = 0; start < hits.length(); start += HITS_PER_PAGE) { > > int end = Math.min(hits.length(), start + HITS_PER_PAGE); > > for (int i = start; i < end; i++) { > > Document doc = hits.doc(i); > > String path = doc.get("path"); > > if (path != null) { > > System.out.println(i + ". " + path); > > } else { > > String url = doc.get("url"); > > if (url != null) { > > System.out.println(i + ". " + url); > > System.out.println(" - " + doc.get("title")); > > } else { > > System.out.println(i + ". " + "No path nor URL for this > > document"); > > } > > } > > } > > > > if (hits.length() > end) { > > System.out.print("more (y/n) ? "); > > line = in.readLine(); > > if (line.length() == 0 || line.charAt(0) == 'n') > > break; > > } > > } > > } > > searcher.close(); > > > > } catch (Exception e) { > > System.out.println(" caught a " + e.getClass() + > > "\n with message: " + e.getMessage()); > > } > > } > > } > > > > - > To unsubscribe, e-mail: [EMAIL PROTECTED] > For additional commands, e-mail: [EMAIL PROTECTED] > > -- ___ Chris Fraschetti e [EMAIL PROTECTED] - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]