Major Bioinformatics center adopts Lucene to help biologists search "everything"

2006-12-28 Thread Don Gilbert
from European Bioinformatics Institute (EBI) http://www.ebi.ac.uk/ - Better, faster, easier EMBL-EBI launches its new website with powerful search engine Behind this new web interface lies the "EB-eye", a powerful search engine allowing instant searches of all the

newbie lucene indexing/search question

2006-12-28 Thread moraleslos
I currently have a book containing content that is stored in the database by paragraph. For example, a book contains content with 5 paragraphs. Therefore each paragraph is stored as a distinct record in a database. In the object domain, I have a Book object which holds a java.util.List of Paragrap

Re: Clustering Lucene with 40 Servers

2006-12-28 Thread markharw00d
Not quite yet gone up to this scale but here are some points for consideration based on a smaller scale system I have in production that may be of interest: By clustering I presume you are only talking about replication. When we talk about scaling and using multiple machines we need to think

Re: Paging Lucene Results

2006-12-28 Thread Peter W.
Thanks. When you are trying to determine how many items to show on a results page and you have: 1. number of hits you want to display (hpp) 2. total hitcount returned by Lucene from a query (hc) 3. the results page you are currently on (ipg) there's some math involved and I was looking for t

Re: help finding docs, creating analyzer objects

2006-12-28 Thread Haszlakiewicz, Eric
>>> Grant Ingersoll 12/26/2006 10:12 PM >>> >Hi Eric, > >Take a look at http://lucene.apache.org/java/docs/releases.html > >Clicking through to the appropriate mirror, download/unzip >lucene-2.0.0.zip. In it you'll find the jars, contrib, README, demo, >etc. It really should contain everyth

Re: Nested Queries

2006-12-28 Thread Steven Rowe
Hi Kapil, Kapil Chhabra wrote: > Hi Steve, > Thanks for the response. > Actually I am not looking for a query language. My question is, whether > Lucene supports Nested Queries or self joins? > As per > http://lucene.apache.org/java/docs/api/org/apache/lucene/queryParser/QueryParser.html > > In B

Re: Clustering Lucene with 40 Servers

2006-12-28 Thread Yonik Seeley
On 12/28/06, Adam Fleming <[EMAIL PROTECTED]> wrote: I saw that Doug Cutting had an interesting solution for his Technorati website: http://www.mail-archive.com/lucene-user@jakarta.apache.org/msg12709.html It sounds like it's a single-writer, many readers type of system, but quite robust and ef

Re: Nested Queries

2006-12-28 Thread Erick Erickson
No. Lucene is a text search engine, NOT a RDBMS. Whenever you think of joins (self or otherwise), you're thinking in RDMBS terms, which Lucene is not. At best, you'll have to use one of the DB integrations that Steve mentioned (assuming they work). But I wouldn't keep looking for any magic interna

Re[2]: Clustering Lucene with 40 Servers

2006-12-28 Thread Sven Duzont
Hi, Me (& my comp) have also implemented the Doug's index replication solution : - One Node responsible of the indexing. - X Nodes handling the search requests. The difference is that it is the role of the indexer Node to notify (via Jgroups) that an indexation process is over. For now we dont ha