Here's a good blog post by Sandy Ryza @ Cloudera on Spark + Time Series
Data:
http://blog.cloudera.com/blog/2015/12/spark-ts-a-new-library-for-analyzing-time-series-data-with-apache-spark/
Might give you some things to try.
On Thu, Jan 7, 2016 at 11:40 PM, dEEPU wrote:
> Maybe u want to convert
Maybe u want to convert the date to a duration in form of number of hours/days
and then do calculation on it
On Jan 8, 2016 12:39 AM, Jorge Machado wrote:
Hello all,
I'm new to machine learning. I'm trying to predict some electric usage with a
decision Free
The data is :
2015-12-10-10:00, 120
First extracting year, month, day, time from the datetime.
Then you should decide which variables can be treated as category features
such as year/month/day and encode them to boolean form using OneHotEncoder.
At last using VectorAssembler to assemble the encoded output vector and the
other raw inp
Or he can also transform the whole date into a string
On Thursday, January 7, 2016 2:25 PM, Sujit Pal
wrote:
Hi Jorge,
Maybe extract things like dd, mm, day of week, time of day from the datetime
string and use them as features?
-sujit
On Thu, Jan 7, 2016 at 11:09 AM, Jorge Machado