Re: Lucene search benchmark/stress test tool

2006-04-26 Thread Sunil Kumar PK
Hi, I have added some code in the Lucene 1.9 - source code for Lucene RemoteParallelMultisearcher performance benchmark. I have recorded the time to execute the 'searchables[i].docFreq(term)' (in MultiSearcher.java) method in both client and server, and for ' searchable.search' (in ParallelMul

Re: DateTools question

2006-04-26 Thread Bill Snyder
Makes sense. Thanks for the response! --Bill On 4/26/06, Chris Hostetter <[EMAIL PROTECTED]> wrote: > > > : Why does DateTools.dateToString() return a String representation of my > Date, > : but in a different TimeZone. Does it use its own Calendar/TimeZone > settings? > > Yes, DateTime is hardco

Re: performance differences between 1.4.3 and 1.9.1

2006-04-26 Thread karl wettin
27 apr 2006 kl. 02.18 skrev Andy Goodell: For my application we have several hundred indexes, different subsets of which are searched depending on the situation. Aside from not upgrading to lucene 1.9, or making a big index for every possible subset, do you have any ideas for how can we mainta

Re: performance differences between 1.4.3 and 1.9.1

2006-04-26 Thread Andy Goodell
For my application we have several hundred indexes, different subsets of which are searched depending on the situation. Aside from not upgrading to lucene 1.9, or making a big index for every possible subset, do you have any ideas for how can we maintain fast performance? - andy g On 4/26/06, Da

Re: How to serach in sentence and dispaly the whole sentence

2006-04-26 Thread anton feldmann
Are the names of a field in a document unique or can i make a field with the name "sentence" for each sentence in an text document? Grant Ingersoll schrieb: Anton, I think there are at least a couple of ways of doing this. I assume you have a program that does sentence detection already, as

Re: performance differences between 1.4.3 and 1.9.1

2006-04-26 Thread Daniel Naber
On Mittwoch 26 April 2006 01:22, RONALD MANTAY wrote: > However when searching muliple indexes with multiSearcher and with a > FuzzyQuery with a prefixLength of 1. The search against 3.7m documents > spread over 23 indexes (due to the natural grouping of the data) the > time changed from 800ms t

Re: Dealing with acronyms

2006-04-26 Thread Rajesh Munavalli
> > > So I guess its done by writing or extending an anylzer? > Yes...thats correct. --Rajesh Munavalli Blog: http://munavalli.blogspot.com

Re: Dealing with acronyms

2006-04-26 Thread Hannes Carl Meyer
Rajesh Munavalli schrieb: On 4/26/06, Hannes Carl Meyer <[EMAIL PROTECTED]> wrote: Hi All, I would like enable users to do an acronym search on my index. My idea is the following: 1.) Extract acronyms (ABS, ESP, VCG etc.) from the given document (which is going to be indexed) In cas

Re: Dealing with acronyms

2006-04-26 Thread Rajesh Munavalli
On 4/26/06, Hannes Carl Meyer <[EMAIL PROTECTED]> wrote: > > Hi All, > > I would like enable users to do an acronym search on my index. > My idea is the following: > > 1.) Extract acronyms (ABS, ESP, VCG etc.) from the given document (which > is going to be indexed) In case you havent already loo

Re: RAM Directory / querying Performance issue

2006-04-26 Thread Doug Cutting
Is this markedly faster than using an MMapDirectory? Copying all this data into the Java heap (as RAMDirectory does) puts a tremendous burden on the garbage collector. MMapDirectory should be nearly as fast, but keeps the index out of the Java heap. Doug z shalev wrote: I've rewritten

Re: Dealing with acronyms

2006-04-26 Thread Stefan Will
This makes perfect sense to me. Of course the hard part will be how to extract the acronyms. -- Stefan Hannes Carl Meyer wrote: Hi All, I would like enable users to do an acronym search on my index. My idea is the following: 1.) Extract acronyms (ABS, ESP, VCG etc.) from the given document

How to display a field value

2006-04-26 Thread anton feldmann
Hi how do i display the whole field value of an document the query string is found? cheers anton - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]

Dealing with acronyms

2006-04-26 Thread Hannes Carl Meyer
Hi All, I would like enable users to do an acronym search on my index. My idea is the following: 1.) Extract acronyms (ABS, ESP, VCG etc.) from the given document (which is going to be indexed) 2.) Store the extracted acronyms in a field, for example called "case" 3.) On search, asking the u

Filter operation

2006-04-26 Thread Tom Emerson
Greetings, If I write a filter, does this run over the documents in the index *before* a search is made (i.e., every document in the index is touched) or on the result set after the search? If it is run over all of the documents, doesn't this become a performance bottleneck on any non-trivial filt

Re: Partial token matches

2006-04-26 Thread Chris Hostetter
: I'm sure the guys will chime in, but I think you're in significant danger of : getting a "too many clauses" exception thrown. Try searching on, say, "an". : Under the covers, Lucene expands your query to have a clause for *every* : item in your index that starts with "an", so there's a clause fo

Re: DateTools question

2006-04-26 Thread Chris Hostetter
: Why does DateTools.dateToString() return a String representation of my Date, : but in a different TimeZone. Does it use its own Calendar/TimeZone settings? Yes, DateTime is hardcoded to use GMT for it's string representations. It wouldn't be safe for DateTools to use your current TimeZone/Loca

DateTools question

2006-04-26 Thread Bill Snyder
Hello, Why does DateTools.dateToString() return a String representation of my Date, but in a different TimeZone. Does it use its own Calendar/TimeZone settings? F.I. DateFormat format = new SimpleDateFormat("-MM-dd hh:mm:ss.SSS"); System.out.println(DateTools.dateToString(format.parse("2006-

Re: Partial token matches

2006-04-26 Thread Erick Erickson
I'm sure the guys will chime in, but I think you're in significant danger of getting a "too many clauses" exception thrown. Try searching on, say, "an". Under the covers, Lucene expands your query to have a clause for *every* item in your index that starts with "an", so there's a clause for "an" "a

Lucene search benchmark/stress test tool

2006-04-26 Thread Otis Gospodnetic
Hi, I'm about to write a little command-line Lucene search benchmark tool. I'm interested in benchmarking search performance and the ability to specify concurrency level (# of parallel search threads) and response timing, so I can calculate min, max, average, and mean times. Something like 'a

Re: java.io.IOException: Stale NFS file handle

2006-04-26 Thread Otis Gospodnetic
Steve, There are some locks involved in search, like the one that gets written to the FS before the readers reads all the segment/index files listed in segments file. Once they are all read, the lock is released. Setting lock dir to the local /tmp doesn't sound good, as locks have to be in th

Partial token matches

2006-04-26 Thread Eric Isakson
Hi All, Just wanted to throw out something I'm working on. It is working well for me, but I wanted to see if anyone can suggest any other alternatives that might perform better than what I'm doing now. I have a field in my index that contains keywords (back of the book index terms) and a UI fe

Re: MatchAllDocsQuery, MultiSearcher and a custom HitCollector throwing exception

2006-04-26 Thread jm
ok, thanks for letting me know. I entered a bug, 556. javi On 4/26/06, Yonik Seeley <[EMAIL PROTECTED]> wrote: > Hi Jim, > > This went to the old mailing list... > Could you email this to java-user@lucene.apache.org > and maybe open a JIRA bug for it? > > -Yonik > > On 4/26/06, jm <[EMAIL PROTECT

Re: MatchAllDocsQuery, MultiSearcher and a custom HitCollector throwing exception

2006-04-26 Thread jm
On 4/26/06, Yonik Seeley <[EMAIL PROTECTED]> wrote: > Hi Jim, > > This went to the old mailing list... > Could you email this to java-user@lucene.apache.org > and maybe open a JIRA bug for it? > > -Yonik > > On 4/26/06, jm <[EMAIL PROTECTED]> wrote: > > Hi, > > > > I have encountered an issue with

Re: MatchAllDocsQuery, MultiSearcher and a custom HitCollector throwing exception

2006-04-26 Thread Yonik Seeley
Hi Jim, This went to the old mailing list... Could you email this to java-user@lucene.apache.org and maybe open a JIRA bug for it? -Yonik On 4/26/06, jm <[EMAIL PROTECTED]> wrote: > Hi, > > I have encountered an issue with lucene1.9.1. It involves > MatchAllDocsQuery, MultiSearcher and a custom

MatchAllDocsQuery, MultiSearcher and a custom HitCollector throwing exception

2006-04-26 Thread jm
Hi, I have encountered an issue with lucene1.9.1. It involves MatchAllDocsQuery, MultiSearcher and a custom HitCollector. The following code throws java.lang.UnsupportedOperationException. If I remove the MatchAllDocsQuery condition (comment whole //1 block), or if I dont use the custom hitcoll

RAM Directory / querying Performance issue

2006-04-26 Thread zzzzz shalev
I've rewritten the RAM DIR to supprt 64 bit (still havent had time to add this to lucene, hopefully in the coming months when i have a free second) My question: i have a machine with 4 GB RAM i have a 3GB index file, i successfully load the 3GB index into memory, the

Highlight

2006-04-26 Thread anton feldmann
Hi I wrote a program that make a pdf document to an Lucene document. The field ate "contents", "sentence", : How do i display the sentence the query String is in? and how do I Highlight the String? cheers anton feldmann package de.coli.seek.lucene; import java.io.File; import java.io

java.io.IOException: Stale NFS file handle

2006-04-26 Thread Schwenker, Stephen
Hey, I'm running into this exception with my lucene searching. We have a cluster of 2 servers that execute searches and one server in the back end that writes to the index. I thought that setting up the external boxes on nfs would be alright since searching doesn't require locking. Can anyo

Cannot save index to 'index' directory, please delete it first

2006-04-26 Thread 一只小蚂蚁
I have met a error like this:"Cannot save index to 'index' directory, please delete it first" when I run the demo in lucene1.9.1. please tell me why? i hava set classpath! -- 『忙忙碌碌 ★ 碌碌无为』 一只小蚂蚁 http://blog.csdn.net/qixiang_nj

Re: search problem

2006-04-26 Thread April06
I guess that fixes the problem. Thanx -- View this message in context: http://www.nabble.com/search-problem-t1506294.html#a4096490 Sent from the Lucene - Java Users forum at Nabble.com. - To unsubscribe, e-mail: [EMAIL PROTECTE