Re: "docMap" array in SegmentMergeInfo

2005-10-12 Thread Yonik Seeley
Here's the patch: http://issues.apache.org/jira/browse/LUCENE-454 It resulted in quite a performance boost indeed! On 10/12/05, Yonik Seeley <[EMAIL PROTECTED]> wrote: > > Thanks for the trace Peter, and great catch! > It certainly does look like avoiding the construction of the docMap for a > Mu

Need help with having lucene search result pipe to a file or memory

2005-10-12 Thread Truc Truong
Been trying to figure out how to setup OutputStream to produce a file for search results. Now, I'm looking over if it's more efficient to just try and pipe it to memory instead. Any suggestions? Bucky

Re: RemoteSearchable woes

2005-10-12 Thread Jeff Rodenburg
I'll take the no-response as a "no". :-) On 10/11/05, Jeff Rodenburg <[EMAIL PROTECTED]> wrote: > > Anyone running RemoteSearchable? I'm on v1.4.3 and am using it just fine, > until I need to: > > 1) use a custom sort, or > 2) use something that extends HitCollector > > I've got an idea as to the

Re: Lucene and remote index and java applet, with no java app server

2005-10-12 Thread Dan Armbrust
So here comes the next part of my applet ignorance. Can I embed the Lucene, etc, jar files in my applet so that when the user starts up the applet, they can be used on the local machine. This alone probably stops me from using an applet, I guess. Anyone have any idea where the definitive rules

Re: wildcards within a phrase query

2005-10-12 Thread Robert Watkins
Thank you, Daniel. Once I get an environment suitable enough I will certainly give this a whirl. -- Robert On Wed, 12 Oct 2005, Daniel Naber wrote: On Mittwoch 12 Oktober 2005 17:18, Robert Watkins wrote: Does that sound reasonable -- and scalable -- to you? I don't think you need to itera

RemoteSearchable woes

2005-10-12 Thread Jeff Rodenburg
Anyone running RemoteSearchable? I'm on v1.4.3 and am using it just fine, until I need to: 1) use a custom sort, or 2) use something that extends HitCollector I've got an idea as to the reasons why (serialization and remoteness), but how do I get around these? Anyone run into issues like these an

Re: wildcards within a phrase query

2005-10-12 Thread Daniel Naber
On Mittwoch 12 Oktober 2005 17:18, Robert Watkins wrote: > Does that sound reasonable -- and scalable -- to you? I don't think you need to iterate at all, you can easily expand the terms of a query: QueryParser qp = new QueryParser("f", new StandardAnalyzer()); Query q = qp.parse("e*");

Re: maximum number of documents

2005-10-12 Thread jian chen
Hi, Koji, I think you are right, the max num of documents should be Integer.MAX_VALUE. Some more points below: 1) I double checked the Lucene documentation. It mentioned in the file format that SegSize is UInt32. I don't think this is accurate, as UInt32 is around 4 billion, but Integer.MAX_VAL

Re: wildcards within a phrase query

2005-10-12 Thread Robert Watkins
Having now looked at the test cases in SVN (specifically, TestMultiPhraseQuery.java), I cannot see any tests using simple wildcards, only terms ending with *, and thus suitable for a PrefixQuery. The examples do reveal how it could be done for wildcards, but my concern turns to scalability. I am

Re: Lucene and remote index and java applet, with no java app server

2005-10-12 Thread J. David Boyd
Jon Schuster wrote: > Sorry about that, "download" was a poor word choice. > > By download, I meant that after the applet opens an input stream to the > URL, it will need to read from the stream to get all the index data from > the web server to the user's machine so the applet can perform the > s

Re: "docMap" array in SegmentMergeInfo

2005-10-12 Thread Yonik Seeley
Thanks for the trace Peter, and great catch! It certainly does look like avoiding the construction of the docMap for a MultiTermEnum will be a significant optimization. -Yonik Now hiring -- http://tinyurl.com/7m67g On 10/12/05, Peter Keegan <[EMAIL PROTECTED]> wrote: > > Here is one stack trace:

Re: "docMap" array in SegmentMergeInfo

2005-10-12 Thread Peter Keegan
Here is one stack trace: Full thread dump Java HotSpot(TM) Client VM (1.5.0_03-b07 mixed mode): "Thread-6" prio=5 tid=0x6cf7a7f0 nid=0x59e50 waiting for monitor entry [0x6d2cf000..0x6d2cfd6c] at org.apache.lucene.index.SegmentReader.isDeleted(SegmentReader.java:241) - waiting to lock <0x04e40278>

maximum number of documents

2005-10-12 Thread Koji Sekiguchi
Hello, Is the maximum number of documents in an index Integer.MAX_VALUE? (approx 2 billion) If so, if I want to have 8 billion docs indexed, like Google, can I do it with having four indices, theoretically? Koji - To unsubscr