Re: lucene functionality

2006-12-14 Thread Patrick Turcotte
On 12/14/06, Erik Hatcher <[EMAIL PROTECTED]> wrote: On Dec 13, 2006, at 1:51 PM, Patrick Turcotte wrote: > I would suggest you take a look at exist-db (http://exist-db.org/). I really doubt eXist can handle 10M XML files. Last time I tried it, it choked on 20k of them. It is true I don't

Re: lucene functionality

2006-12-14 Thread Erik Hatcher
On Dec 13, 2006, at 1:51 PM, Patrick Turcotte wrote: I would suggest you take a look at exist-db (http://exist-db.org/). I really doubt eXist can handle 10M XML files. Last time I tried it, it choked on 20k of them. Erik A database for XML documents that support XQuery. We a

Re: lucene functionality

2006-12-13 Thread Marcelo Ochoa
Hi Chris: > (1) Each field is searchable and indexable. ...and I assumed hte real problem is being ableto address use cases like "find all documents where the DRECONTENT contains the words "Action" and the words "News" near eachother -- using stemming and other Text Analysys tricks i may wnat

Re: lucene functionality

2006-12-13 Thread Chris Hostetter
: > : For 10 million records We recommend an strong database such as Oracle. : > : > eh ... who is "We" in that statement? : We are independent consultants working for many years with Oracle databases ;) And that's a perfectly acceptible answer, i just don't want any first time Lucene users

Re: lucene functionality

2006-12-13 Thread Marcelo Ochoa
Hi Chris: On 12/13/06, Chris Hostetter <[EMAIL PROTECTED]> wrote: : For 10 million records We recommend an strong database such as Oracle. eh ... who is "We" in that statement? We are independent consultants working for many years with Oracle databases ;) I Suspect you'll find other peop

Re: lucene functionality

2006-12-13 Thread Chris Hostetter
: For 10 million records We recommend an strong database such as Oracle. eh ... who is "We" in that statement? I Suspect you'll find other people on this list who have no problems running Lucene indexes containing 10 million documents. If you want a database, then by all means use a database,

Re: lucene functionality

2006-12-13 Thread Doron Cohen
Lucene RangeQuery would do for the "time" and "numeric" reqs. "Mark Mei" <[EMAIL PROTECTED]> wrote: > At the bottom of this email is the sample xml file that we are using today. > We have about 10 million of these. > > We need to know whether Lucene can support the following functionalities. > (1)

Re: lucene functionality

2006-12-13 Thread Marcelo Ochoa
Hi Mark: For 10 million records We recommend an strong database such as Oracle. You can annotate the Schema (.xsd) which describes your XML record to store some field in traditional VARCHAR2 or NUMBER columns to query it faster, and in a CLOB column. You can find more information at: http://ww

Re: lucene functionality

2006-12-13 Thread Patrick Turcotte
I would suggest you take a look at exist-db (http://exist-db.org/). A database for XML documents that support XQuery. We are using both products here (lucene and exist-db), and for what you are looking for, exist-db seems better. Our documents are far more complex than yours (about 500 differen