Re: Spark on wikipedia dataset

2014-04-23 Thread Mayur Rustagi
Huge joins would be interesting. I do all my demos on wikipedia dataset for Shark. Joins are typical pain to showcase & show off :) Mayur Rustagi Ph: +1 (760) 203 3257 http://www.sigmoidanalytics.com @mayur_rustagi On Wed, Apr 23, 2014 at 10:33 AM, Ajay Nair

Spark on wikipedia dataset

2014-04-22 Thread Ajay Nair
I am going to perform some test experiments on the wikipedia dataset using the spark framework. I know wikipedia data set might already have been analyzed, but what are the potential explored/unexplored aspects of spark that can be tested and benchmarked on wikipedia dataset? Thanks AJ