Cheolsoo Park created PIG-3400:
----------------------------------
Summary: FS commands do not work with S3 paths
Key: PIG-3400
URL: https://issues.apache.org/jira/browse/PIG-3400
Project: Pig
Issue Type: Bug
Components: grunt
Reporter: Cheolsoo Park
Assignee: Cheolsoo Park
Fix For: 0.12
To reproduce issues, run the following commands w/ S3 paths:
{code}
pig -e 'ls s3://<path>'
pig -e 'mkdir s3://<path>'
pig -e 'cp s3://<path1> s3://<path2>'
pig -e 'mv s3://<path1> s3://<path2>'
pig -e 'copyToLocal s3://<path>/<file> .'
pig -e 'copyFromLocal <file> s3://<path>'
{code}
As of now, none of these commands works in Apache Pig, whereas they do in EMR
Pig. The problem is that in GruntParser, DataStorage is constructed using the
default file system provided by configuration, and s3 paths are not recognized.
Instead, we should construct DataStorage based on the given URL. For example,
{code}
DataStorage dfs = new HDataStorage(new Path(path).toUri(), mConf);
{code}
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira