Re: Poor performance "race condition" in FieldSortedHitQueue

2006-10-20 Thread kalpesh patel
Hi, Sometimes when the user does a search on the same term at the same time, there is a delay in the search and the memory used considerably goes high. I was going through the lucene user list and saw this discussion about race condition. http://www.gossamer-threads.com/lists/lucene

RE: Wildcard Search and "Note: You cannot use a * or ? symbol as the first character of a search"

2006-10-20 Thread Steven Parkes
You can go to Jira and get the patch and/or vote for it: https://issues.apache.org/jira/browse/LUCENE-489 [Not that this issue needs much voting, I just like the idea of of encouraging voting. Get Out the Vote (if that's TM'd, I take it back.)] -Original Message- From: Otis Gospodnetic

Re: "Catalog" backend for document stored fields?

2006-10-20 Thread Mike Klaas
On 10/20/06, Robichaud, Jean-Philippe <[EMAIL PROTECTED]> wrote: 3- Any ideas on how else I could do this? I'm fully open to discussion! How about not storing the fields at all, but storing term vectors, and reconstructing the data from termpositions + terminfo? -Mike -

Re: "Catalog" backend for document stored fields?

2006-10-20 Thread eks dev
1- is there someone out there that already wrote an extension to Lucene so that 'stored' string for each document/field is in fact stored in a centralized repository? Meaning, only an 'index' is actually stored in the document and the real data is put somewhere else. 2- If not, how ha

Re: index architectures

2006-10-20 Thread Paul Waite
Doron wrote: > Not sure if this is the case, but you said "searchers", so might be it - > you can (and should) reuse searchers for multiple/concurrent queries. > IndexSearcher is thread-safe, so no need to have a different searcher for > each query. Keep using this searcher until you decide to ope

"Catalog" backend for document stored fields?

2006-10-20 Thread Robichaud, Jean-Philippe
Hello to all of you! I'm using Lucene to index millions a relatively small documents. In fact, I'm indexing logs from a transaction-based application. Each document represents what happened inside during 'transaction'. Each of them is composed by 5-6 main 'states' which are themselves compos

RE: Wildcard Search and "Note: You cannot use a * or ? symbol as the first character of a search"

2006-10-20 Thread Vladimir Olenin
Don't know Lucene internals, but I'd say you'd have to create your own 'reverse' B-Tree of some kind (Lucene gurus will probably advise you on the place where this can be changed in the Lucene). Even if this functionality can't be redefined in Lucene itself, you can easily implement it by yourself

Re: Wildcard Search and "Note: You cannot use a * or ? symbol as the first character of a search"

2006-10-20 Thread Otis Gospodnetic
Larry, the patch for that is already in JIRA, the issue-tracking system, and might be committed soon. Otis - Original Message From: "Fillion, Larry" <[EMAIL PROTECTED]> To: java-user@lucene.apache.org Sent: Friday, October 20, 2006 1:58:04 PM Subject: Wildcard Search and "Note: You cann

Wildcard Search and "Note: You cannot use a * or ? symbol as the first character of a search"

2006-10-20 Thread Fillion, Larry
Documentation: http://lucene.apache.org/java/docs/queryparsersyntax.html WildCard Searches Lucene supports single and multiple character wildcard searches. To perform a single character wildcard search use the "?" symbol. To perform a multiple character wildcard search use the "

Re: part of speech tagger

2006-10-20 Thread Breck Baldwin
LingPipe has one, see a tutorial at: http://alias-i.com/lingpipe/demos/tutorial/posTags/read-me.html Also look at the competition for academic packages that may or may not have POS taggers. See: http://alias-i.com/lingpipe/web/competition.html breck (disclosure--we make LingPipe) z shal

Re: part of speech tagger

2006-10-20 Thread Fredrik Hedberg
http://opennlp.sf.net/ - Fredrik 2006/10/20, z shalev <[EMAIL PROTECTED]>: hello all, i would like to retrieve during query time, the part of speech of each word in a query, does anyone know of an implementation of a java part of speech api? thanks in advance, ---

Re: part of speech tagger

2006-10-20 Thread Grant Ingersoll
Google Brill tagger or Brill part of speech tagger. I believe there is a Java API. It is trainable, as well. -Grant On Oct 20, 2006, at 6:25 AM, z shalev wrote: hello all, i would like to retrieve during query time, the part of speech of each word in a query, does anyone know

Re: part of speech tagger

2006-10-20 Thread Pierrick Brihaye
Ji, z shalev a écrit : hello all, i would like to retrieve during query time, the part of speech of each word in a query, does anyone know of an implementation of a java part of speech api? thanks in advance, Aramorph for Java, which is an arabic Analyzer that provides a Lucene interfac

part of speech tagger

2006-10-20 Thread zzzzz shalev
hello all, i would like to retrieve during query time, the part of speech of each word in a query, does anyone know of an implementation of a java part of speech api? thanks in advance, - Stay in the know. Pulse on the new Y

Re: Scalability Questions

2006-10-20 Thread Otis Gospodnetic
Hi Guerre, The reason you haven't received any answers yet is because this is pretty impossible the answer.and so I'll try to answer your questions now, at 3:40 AM. ;) - Original Message From: Guerre Bear <[EMAIL PROTECTED]> To: java-user@lucene.apache.org Sent: Wednesday, October