Re: Matrix Multiplication and mllib.recommendation

2015-06-28 Thread Ilya Ganelin
o LSH with 30,000 items. >>>>>> >>>>>> — >>>>>> Sent from Mailbox <https://www.dropbox.com/mailbox> >>>>>> >>>>>> >>>>>> On Thu, Jun 18, 2015 at 6:01 AM, Ganelin, Ilya < >>>>

Re: Matrix Multiplication and mllib.recommendation

2015-06-28 Thread Ayman Farahat
g-with-apache-spark-or-how-i-learned-to-stop-worrying-and-love-the-shuffle/. >>> Keep in mind, you're actually doing a ton of math. Even with proper >>> caching and use of broadcast variables this will take a while defending on >>> the size of your cluster. To get real

Re: Matrix Multiplication and mllib.recommendation

2015-06-28 Thread Ilya Ganelin
— >>>>>> Sent from Mailbox <https://www.dropbox.com/mailbox> >>>>>> >>>>>> >>>>>> On Thu, Jun 18, 2015 at 6:01 AM, Ganelin, Ilya < >>>>>> ilya.gane...@capitalone.com> wrote: >>>>>> &g

Re: Matrix Multiplication and mllib.recommendation

2015-06-28 Thread Ilya Ganelin
t;>>> >>>>>> Actually talk about this exact thing in a blog post here >>>>>> http://blog.cloudera.com/blog/2015/05/working-with-apache-spark-or-how-i-learned-to-stop-worrying-and-love-the-shuffle/. >>>>>> Keep in mind, you're ac

Re: Matrix Multiplication and mllib.recommendation

2015-06-27 Thread Ayman Farahat
t; sensitive hashing to limit your search space and definitely look into >> spinning up multiple threads to process your product features in parallel to >> increase resource utilization on the cluster. >> >> >> >> Thank you, >> Ilya Ganelin >>

Re: Matrix Multiplication and mllib.recommendation

2015-06-18 Thread Nick Pentreath
gt;>> Sent from Mailbox >>>>>>> >>>>>>> >>>>>>>> On Thu, Jun 18, 2015 at 6:01 AM, Ganelin, Ilya >>>>>>>> wrote: >>>>>>>> Actually talk about this exact thing in a blog post here >>&

Re: Matrix Multiplication and mllib.recommendation

2015-06-18 Thread Ayman Farahat
earch space and definitely look into >> spinning up multiple threads to process your product features in parallel to >> increase resource utilization on the cluster. >> >> >> >> Thank you, >> Ilya Ganelin >> >> >> >> -Origi

Re: Matrix Multiplication and mllib.recommendation

2015-06-18 Thread Debasish Das
cloudera.com/blog/2015/05/working-with-apache-spark-or-how-i-learned-to-stop-worrying-and-love-the-shuffle/. >>>>> Keep in mind, you're actually doing a ton of math. Even with proper >>>>> caching >>>>> and use of broadcast variables this will take a whil

Re: Matrix Multiplication and mllib.recommendation

2015-06-18 Thread Debasish Das
g >>>> and use of broadcast variables this will take a while defending on the size >>>> of your cluster. To get real results you may want to look into locality >>>> sensitive hashing to limit your search space and definitely look into >>>> spinning up m

Re: Matrix Multiplication and mllib.recommendation

2015-06-18 Thread Debasish Das
space and definitely look into >>> spinning up multiple threads to process your product features in parallel >>> to increase resource utilization on the cluster. >>> >>> >>> >>> Thank you, >>> Ilya Ganelin >>> >>> >

Re: Matrix Multiplication and mllib.recommendation

2015-06-18 Thread Ayman Farahat
; > > Thank you, > Ilya Ganelin > > > > -Original Message----- > From: afarahat [ayman.fara...@yahoo.com] > Sent: Wednesday, June 17, 2015 11:16 PM Eastern Standard Time > To: user@spark.apache.org > Subject: Matrix Multiplication and mllib.recommendation

Re: Matrix Multiplication and mllib.recommendation

2015-06-17 Thread Sabarish Sasidharan
look into >> spinning up multiple threads to process your product features in parallel >> to increase resource utilization on the cluster. >> >> >> >> Thank you, >> Ilya Ganelin >> >> >> >> -----Original Message- >> *Fr

RE: Matrix Multiplication and mllib.recommendation

2015-06-17 Thread Nick Pentreath
> increase resource utilization on the cluster. > Thank you, > Ilya Ganelin > -Original Message- > From: afarahat [ayman.fara...@yahoo.com<mailto:ayman.fara...@yahoo.com>] > Sent: Wednesday, June 17, 2015 11:16 PM Eastern Standard Time > To: user@spark.apache.org &g

RE: Matrix Multiplication and mllib.recommendation

2015-06-17 Thread Ganelin, Ilya
n the cluster. Thank you, Ilya Ganelin -Original Message- From: afarahat [ayman.fara...@yahoo.com<mailto:ayman.fara...@yahoo.com>] Sent: Wednesday, June 17, 2015 11:16 PM Eastern Standard Time To: user@spark.apache.org Subject: Matrix Multiplication and mllib.recommendation Hello;

Matrix Multiplication and mllib.recommendation

2015-06-17 Thread afarahat
Hello; I am trying to get predictions after running the ALS model. The model works fine. In the prediction/recommendation , I have about 30 ,000 products and 90 Millions users. When i try the predict all it fails. I have been trying to formulate the problem as a Matrix multiplication where I fi