Re: Riak and SEC Filings

2011-11-08 Thread Ryan Zezeski
On Tue, Nov 8, 2011 at 7:08 AM, Hector Castro wrote: > > >* In going through the search querying documentation, I haven't > found a way to extract a section of a result containing matches. Something > similar to Google's search results page where you see an excerpt of the > webpage conten

Re: Riak and SEC Filings

2011-11-08 Thread Elias Levy
On Tue, Nov 8, 2011 at 7:15 AM, wrote: > Date: Tue, 8 Nov 2011 07:08:59 -0500 > From: Hector Castro > > I'm currently in the process of evaluating solutions to index the contents > of ~1TB of SEC (Securities and Exchange Commission) documents. File sizes > vary between a few KB to a couple hund

Re: Riak and SEC Filings

2011-11-08 Thread Andres Jaan Tack
> > * Given that the documents total ~1TB of storage (not including the > generated indexes), does something like decreasing the n_val make sense? > Mostly the documents are bulk inserted on a daily or weekly basis – other > than that all of the operations are read-only. The N replication factor