LinJi created HADOOP-15838: ------------------------------ Summary: Copy files from SFTP to HDFS using DistCp failed with error Key: HADOOP-15838 URL: https://issues.apache.org/jira/browse/HADOOP-15838 Project: Hadoop Common Issue Type: Bug Components: tools/distcp Affects Versions: 2.7.2, 2.5.0 Environment: Hadoop 2.5.0 + kerberos Reporter: LinJi Fix For: 2.7.5 Attachments: 微信截图_20181010224316.png, 微信截图_20181010224330.png
1. When I run command: {code:java} hadoop distcp sftp://mysftp:1qaz_@WSX@192.168.1.44:/upload/hosts /tmp/JOY{code} I got error like: {noformat} 2018-10-10 22:31:37,799 INFO util.KerberosUtil: Using principal pattern: HTTP/_HOST 2018-10-10 22:31:39,055 INFO tools.DistCp: Input Options: DistCpOptions{atomicCommit=false, syncFolder=false, deleteMissing=false, ignoreFailures=false, maxMaps=20, sslConfigurationFile='null', copyStrategy='uniformsize', sourceFileListing=null, sourcePaths=[sftp://mysftp:1qaz_@WSX@192.168.1.44:/upload/hosts], targetPath=/tmp/JOY, targetPathExists=false} 2018-10-10 22:31:39,365 ERROR tools.DistCp: Exception encountered java.io.IOException: Invalid host specified at org.apache.hadoop.fs.sftp.SFTPFileSystem.initialize(SFTPFileSystem.java:67) at org.apache.hadoop.fs.FileSystem.createFileSystem(FileSystem.java:2591) at org.apache.hadoop.fs.FileSystem.access$200(FileSystem.java:89) at org.apache.hadoop.fs.FileSystem$Cache.getInternal(FileSystem.java:2643) at org.apache.hadoop.fs.FileSystem$Cache.get(FileSystem.java:2625) at org.apache.hadoop.fs.FileSystem.get(FileSystem.java:368) at org.apache.hadoop.fs.Path.getFileSystem(Path.java:296) at org.apache.hadoop.tools.GlobbedCopyListing.doBuildListing(GlobbedCopyListing.java:76) at org.apache.hadoop.tools.CopyListing.buildListing(CopyListing.java:84) at org.apache.hadoop.tools.DistCp.createInputFileListing(DistCp.java:353) at org.apache.hadoop.tools.DistCp.execute(DistCp.java:160) at org.apache.hadoop.tools.DistCp.run(DistCp.java:121) at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:70) at org.apache.hadoop.tools.DistCp.main(DistCp.java:401) {noformat} 2. When I run command: {code:java} hadoop distcp sftp://mysftp:1qaz_%40WSX@192.168.1.44:/upload/hosts /tmp/JOY{code} I got error like: {noformat} 2018-10-10 22:31:59,909 INFO util.KerberosUtil: Using principal pattern: HTTP/_HOST 2018-10-10 22:32:01,286 INFO tools.DistCp: Input Options: DistCpOptions{atomicCommit=false, syncFolder=false, deleteMissing=false, ignoreFailures=false, maxMaps=20, sslConfigurationFile='null', copyStrategy='uniformsize', sourceFileListing=null, sourcePaths=[sftp://mysftp:1qaz_%40WSX@192.168.1.44:/upload/hosts], targetPath=/tmp/JOY, targetPathExists=false} 2018-10-10 22:32:02,190 ERROR tools.DistCp: Exception encountered java.io.IOException: SSH_MSG_DISCONNECT: 2 Too many authentication failures for mysftp at org.apache.hadoop.fs.sftp.SFTPFileSystem.connect(SFTPFileSystem.java:143) at org.apache.hadoop.fs.sftp.SFTPFileSystem.getFileStatus(SFTPFileSystem.java:371) at org.apache.hadoop.fs.Globber.getFileStatus(Globber.java:57) at org.apache.hadoop.fs.Globber.glob(Globber.java:252) at org.apache.hadoop.fs.FileSystem.globStatus(FileSystem.java:1623) at org.apache.hadoop.tools.GlobbedCopyListing.doBuildListing(GlobbedCopyListing.java:77) at org.apache.hadoop.tools.CopyListing.buildListing(CopyListing.java:84) at org.apache.hadoop.tools.DistCp.createInputFileListing(DistCp.java:353) at org.apache.hadoop.tools.DistCp.execute(DistCp.java:160) at org.apache.hadoop.tools.DistCp.run(DistCp.java:121) at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:70) at org.apache.hadoop.tools.DistCp.main(DistCp.java:401){noformat} The SFTP username is mysftp password is 1qaz_@WSX -- This message was sent by Atlassian JIRA (v7.6.3#76005) --------------------------------------------------------------------- To unsubscribe, e-mail: common-dev-unsubscr...@hadoop.apache.org For additional commands, e-mail: common-dev-h...@hadoop.apache.org