I would recommend using HDFS. For that, you need to specify the paths like this: hdfs:///path/to/data.
On Tue, Jun 2, 2015 at 2:48 PM, Pa Rö <paul.roewer1...@googlemail.com> wrote: > nice, > > which file system i must use for the cluster? java.io or hadoop.fs or > flink? > > 2015-06-02 14:29 GMT+02:00 Robert Metzger <rmetz...@apache.org>: > >> Hi, >> you can start Flink on YARN on the Cloudera distribution. >> >> See here for more: >> http://ci.apache.org/projects/flink/flink-docs-master/setup/yarn_setup.html >> >> These are the commands you need to execute >> >> wget >> http://stratosphere-bin.s3-website-us-east-1.amazonaws.com/flink-0.9-SNAPSHOT-bin-hadoop2.tgz >> tar xvzf flink-0.9-SNAPSHOT-bin-hadoop2.tgzcd flink-0.9-SNAPSHOT/ >> ./bin/yarn-session.sh -n 4 -jm 1024 -tm 4096 >> >> >> >> >> >> On Tue, Jun 2, 2015 at 2:03 PM, Pa Rö <paul.roewer1...@googlemail.com> >> wrote: >> >>> hi community, >>> >>> i want test my flink k-means on a hadoop cluster. i use the cloudera >>> live distribution. how i can run flink on this cluster? maybe only the java >>> dependencies are engouth? >>> >>> best regards, >>> paul >>> >> >> >