[GitHub] flink pull request #3828: [FLINK-6447] update aws/emr docs

greghogan Thu, 04 May 2017 12:04:05 -0700

Github user greghogan commented on a diff in the pull request:

    https://github.com/apache/flink/pull/3828#discussion_r114861184
  
    --- Diff: docs/setup/aws.md ---
    @@ -32,17 +32,23 @@ Amazon Web Services offers cloud computing services on 
which you can run Flink.
     
     [Amazon Elastic MapReduce](https://aws.amazon.com/elasticmapreduce/) 
(Amazon EMR) is a web service that makes it easy to  quickly setup a Hadoop 
cluster. This is the **recommended way** to run Flink on AWS as it takes care 
of setting up everything.
     
    -### Create EMR Cluster
    +### Standard EMR Installation
     
    -The EMR documentation contains [examples showing how to start an EMR 
cluster](http://docs.aws.amazon.com/ElasticMapReduce/latest/ManagementGuide/emr-gs-launch-sample-cluster.html).
 You can follow that guide and install any EMR release. You don't need to 
install *All Applications* part of the EMR release, but can stick to *Core 
Hadoop*:
    +Flink is a supported application on Amazon EMR. Basically all you have to 
do is choose Flink as an application, along with whatever
    +else you need, and configure the instances and roles. [Amazon's 
documentation](http://docs.aws.amazon.com/emr/latest/ReleaseGuide/emr-flink.html)
 gives all the details.
     
    -<img src="{{ site.baseurl }}/fig/flink-on-emr.png" class="img-responsive">
    +### Custom EMR Installation
     
    -When creating your cluster, make sure to setup [IAM 
roles](http://docs.aws.amazon.com/ElasticMapReduce/latest/ManagementGuide/emr-iam-roles.html)
 allowing you to access your S3 buckets if required.
    +The standard installation (above) is easier, but if you need to use a 
version of Flink that Amazon doesn't support,
    +then you can setup a stock EMR cluster and install Flink yourself.
     
    -{% top %}
    +**Create EMR Cluster**
    +
    +The EMR documentation contains [examples showing how to start an EMR 
cluster](http://docs.aws.amazon.com/ElasticMapReduce/latest/ManagementGuide/emr-gs-launch-sample-cluster.html).
 You can follow that guide and install any EMR release. You don't need to 
install the *All Applications* part of the EMR release, but can stick to *Core 
Hadoop*.
    +
    +When creating your cluster, make sure to setup [IAM 
roles](http://docs.aws.amazon.com/ElasticMapReduce/latest/ManagementGuide/emr-iam-roles.html)
 allowing you to access your S3 buckets if required.
    --- End diff --
    
    How about something like "When creating a cluster, access to S3 buckets 
requires configuration of [IAM 
roles](http://docs.aws.amazon.com/ElasticMapReduce/latest/ManagementGuide/emr-iam-roles.html)."?
 And prefix with our "note" warning?



---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

[GitHub] flink pull request #3828: [FLINK-6447] update aws/emr docs

Reply via email to