Kriti Jha created SQOOP-3455: -------------------------------- Summary: Sqoop job fails while importing to S3 as Parquet Key: SQOOP-3455 URL: https://issues.apache.org/jira/browse/SQOOP-3455 Project: Sqoop Issue Type: Bug Components: sqoop2-kite-connector Affects Versions: 1.4.7 Reporter: Kriti Jha
A Sqoop job to import data from a MySQL database into S3 fails on using --as-parquetfile with the error as shown below: ---- {{ERROR sqoop.Sqoop: Got exception running Sqoop: org.kitesdk.data.DatasetNotFoundException: Unknown dataset URI pattern: dataset:s3://sqoop-trial-bucket/sqoop-trial/trial Check that JARs for s3 datasets are on the classpath org.kitesdk.data.DatasetNotFoundException: Unknown dataset URI pattern: dataset:s3://}}{{sqoop-trial-bucket}}{{/sqoop-trial/trial Check that JARs for s3 datasets are on the classpath at org.kitesdk.data.spi.Registration.lookupDatasetUri(Registration.java:128) at org.kitesdk.data.Datasets.exists(Datasets.java:624) at org.kitesdk.data.Datasets.exists(Datasets.java:646) at org.apache.sqoop.mapreduce.ParquetJob.configureImportJob(ParquetJob.java:118) at org.apache.sqoop.mapreduce.DataDrivenImportJob.configureMapper(DataDrivenImportJob.java:132) at org.apache.sqoop.mapreduce.ImportJobBase.runImport(ImportJobBase.java:264) at org.apache.sqoop.manager.SqlManager.importTable(SqlManager.java:692) at org.apache.sqoop.manager.MySQLManager.importTable(MySQLManager.java:127) at org.apache.sqoop.tool.ImportTool.importTable(ImportTool.java:520) at org.apache.sqoop.tool.ImportTool.run(ImportTool.java:628) at org.apache.sqoop.Sqoop.run(Sqoop.java:147) at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:76) at org.apache.sqoop.Sqoop.runSqoop(Sqoop.java:183) at org.apache.sqoop.Sqoop.runTool(Sqoop.java:234) at org.apache.sqoop.Sqoop.runTool(Sqoop.java:243) at org.apache.sqoop.Sqoop.main(Sqoop.java:252)}} ---- {{}} {{All the JARs for S3 are present in the classpath. Further, the same works on simply removing the argument --as-parquetfile, i.e. with any other format.}} {{}} -- This message was sent by Atlassian Jira (v8.3.4#803005)