Our experience of Cassandra+Hadoop is good.

We have a 16 node Cassandra cluster storing 110m users plus a 5 node
Hadoop cluster. We can scan through all rows in about 2.5 hours.

Dave


On Thursday, 20 January 2011, David G. Boney
<dbon...@semanticartifacts.com> wrote:
> I don't think the below statement accurately describes data mining or using 
> Cassandra for data mining. All the techniques I am familiar with for either 
> data mining or machine learning, which data mining is a subset, make one or 
> more sequential scans through the data to abstract statistics or build 
> models. The question is how well does Cassandra perform with sequential scans 
> through the data? The Hadoop model works very well for many machine learning 
> problems because it is oriented toward sequential scans through the data. The 
> speed of the Hadoop interface to Cassandra would have a lot of bearing on the 
> application of Cassandra to data mining or machine learning problems.
>
> -------------Sincerely,David G. 
> Boneydboney1@semanticartifacts.comhttp://www.semanticartifacts.com
>
>
>
>
> On Jan 20, 2011, at 6:35 AM, David Boxenhorn wrote:
> Cassandra is not a good solution for data mining type problems, since it 
> doesn't have ad-hoc queries. Cassandra is designed to maximize throughput, 
> which is not usually a problem for data mining.
>
> On Thu, Jan 20, 2011 at 2:07 PM, Surender Singh <suriait2...@gmail.com> wrote:
>
> Hi All
>
> I want to use Apache Cassandra to store information (like first name, last
> name, gender, address)  about 2 million people.  Then need to perform
> analytic and reporting on that data.
> is need to store information about 2 million people in Mysql and then
> transfer that information into Cassandra.?
>
> Please help me as i m new to Apache Cassandra.
>
> if you have any use case like that, please share.
>
> Thanks and regards
>
> Surender Singh
>
>
>
>
>

-- 
*Dave Gardner*
Technical Architect

[image: imagini_58mmX15mm.png]   [image: VisualDNA-Logo-small.png]

*Imagini Europe Limited*
7 Moor Street, London W1D 5NB

[image: phone_icon.png] +44 20 7734 7033
[image: skype_icon.png] daveg79
[image: emailIcon.png] dave.gard...@imagini.net
[image: icon-web.png] http://www.visualdna.com

Imagini Europe Limited, Company number 5565112 (England
and Wales), Registered address: c/o Bird & Bird,
90 Fetter Lane, London, EC4A 1EQ, United Kingdom

Reply via email to