Alex Byrd created ZEPPELIN-3659:
-----------------------------------

             Summary: 'Using Pig for querying data' tutorial is outdated
                 Key: ZEPPELIN-3659
                 URL: https://issues.apache.org/jira/browse/ZEPPELIN-3659
             Project: Zeppelin
          Issue Type: Bug
          Components: documentation
            Reporter: Alex Byrd


The third paragraph (first that is not a description) calls hadoop.

hadoop fs -put bank.csv .

 

This hadoop call infers hadoop is already installed, which is not mentioned in 
the previous paragraphs as a dependency or included/mentioned in the 
installation files and quickstart. When a user has a fresh install and just 
hits 'run all paragraphs' it'll error out here.

 

While this is not terribly difficult to overcome it creates friction in getting 
up and running without issues. The next question is what version of Hadoop to 
use, I've tested with Hadoop 2.7.7 and it appears to work just fine but I 
haven't fully vetted it yet.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to