I would suggest study spark ,flink,strom and based on your understanding and finding prepare your research paper.
May be you will invented new spark ☺ Regards, Vaquar khan On 16 Jul 2015 00:47, "Michael Segel" <msegel_had...@hotmail.com> wrote: > Silly question… > > When thinking about a PhD thesis… do you want to tie it to a specific > technology or do you want to investigate an idea but then use a specific > technology. > Or is this an outdated way of thinking? > > "I am doing my PHD thesis on large scale machine learning e.g Online > learning, batch and mini batch learning.” > > So before we look at technologies like Spark… could the OP break down a > more specific concept or idea that he wants to pursue? > > Looking at what Jorn said… > > Using machine learning to better predict workloads in terms of managing > clusters… This could be interesting… but is it enough for a PhD thesis, or > of interest to the OP? > > > On Jul 15, 2015, at 9:43 AM, Jörn Franke <jornfra...@gmail.com> wrote: > > Well one of the strength of spark is standardized general distributed > processing allowing many different types of processing, such as graph > processing, stream processing etc. The limitation is that it is less > performant than one system focusing only on one type of processing (eg > graph processing). I miss - and this may not be spark specific - some > artificial intelligence to manage a cluster, e.g. Predicting workloads, how > long a job may run based on previously executed similar jobs etc. > Furthermore, many optimizations you have do to manually, e.g. Bloom > filters, partitioning etc - if you find here as well some intelligence that > does this automatically based on previously executed jobs taking into > account that optimizations themselves change over time would be great... > You may also explore feature interaction > > Le mar. 14 juil. 2015 à 7:19, Shashidhar Rao <raoshashidhar...@gmail.com> > a écrit : > >> Hi, >> >> I am doing my PHD thesis on large scale machine learning e.g Online >> learning, batch and mini batch learning. >> >> Could somebody help me with ideas especially in the context of Spark and >> to the above learning methods. >> >> Some ideas like improvement to existing algorithms, implementing new >> features especially the above learning methods and algorithms that have not >> been implemented etc. >> >> If somebody could help me with some ideas it would really accelerate my >> work. >> >> Plus few ideas on research papers regarding Spark or Mahout. >> >> Thanks in advance. >> >> Regards >> > > >