How to do Multiple-Cluster Query?

2011-03-31 Thread 袁武 [GMail]
Hi, dear experts: Assume that a tuple is composed of <,,...,, Value>. IF AK is stored in Field A, BK is stored in Field B, CK is stored in Field C (1<=K<=N), so there are N A fields,N B fields and N C fields. The question is if I want to query whith the condition of , how can i get the exact v

Re: a faster way to addDocument and get the ID just added?

2011-03-31 Thread Simon Willnauer
Hey Ian, On Thu, Mar 31, 2011 at 11:32 AM, Ian Lea wrote: >>> Subject: a faster way to addDocument and get the ID just added? > > Might it be possible to come up with a version of > IndexWriter.addDocument() that returns the docid rather than void? > Answering that question is way out of my leagu

Apache Lucene 3.1.0

2011-03-31 Thread Grant Ingersoll
March 2011, Apache Lucene 3.1 available The Lucene PMC is pleased to announce the release of Apache Lucene 3.1. This release contains numerous bug fixes, optimizations, and improvements, some of which are highlighted below. The release is available for immediate download at http://www.apache.org

Apache Lucene 3.1.0 is available

2011-03-31 Thread Grant Ingersoll
March 2011, Apache Lucene 3.1 available The Lucene PMC is pleased to announce the release of Apache Lucene 3.1. This release contains numerous bug fixes, optimizations, and improvements, some of which are highlighted below. The release is available for immediate download at http://www.apache.org

Storing Semantic Role labeling information with Lucene

2011-03-31 Thread Aenima
Hi to everyone, I'm new with Lucene and I'm trying to store SRL information in Lucene indexes. Since it is a metadata I thought that payloads could be of some use, but I need to store String not a numerical value. I need to be able to search by word as well as by SRL information. Is it possible to

Re: Performance and index size (rephrased question)

2011-03-31 Thread Erick Erickson
5-10 G indexes are pretty small by Lucene/Solr standards, so given reasonable hardware resources this should be no problem. That said, only measurement will nail this down. But an often-used rule of thumb is that you need to consider some better strategies in the 40G range. CAUTION: you haven't sp

Performance and index size (rephrased question)

2011-03-31 Thread sol myr
Hi, I'm rephrasing a previous performance question, in light of new data... I have a Lucene index of about 0.5 GB. Currently performance is good - up to 200 milliseconds per search (with complex boolean queries, but never retrieving more than 200 top results). The question: how much can the index

Re: a faster way to addDocument and get the ID just added?

2011-03-31 Thread Ian Lea
>> Subject: a faster way to addDocument and get the ID just added? Might it be possible to come up with a version of IndexWriter.addDocument() that returns the docid rather than void? Answering that question is way out of my league, but it would presumably be quick. -- Ian. On Thu, Mar 31, 201

Re: Distributing a Lucene application?

2011-03-31 Thread sol myr
Thanks very much, sounds great :) On Thu, Mar 24, 2011 at 9:13 PM, Chris Lu wrote: > It's great that the requirement is loose... > But I suppose users would ask for more later. > > Well, I worked on DBSight, which covers more than just search. It also > includes scheduling indexing, reindexing,

AW: minimum string length for fuzzy search

2011-03-31 Thread Biedermann,S.,Fa. Post Direkt
This is how I understand the process (in 3.0.2): No, there is no length requirement. What happens in a fuzzy query is that it first rewrites to a Boolean query that contains all the terms as term queries that fuzzy match to the given pattern. In this rewrite all index terms must be fuzzy match