Re: Incremental import from PostgreSQL to Hive having issues

2012-04-15 Thread Nitin Pawar
best way to solve this is load the data in different partition each time you load the data. (depending on the data you can put the data partitioned by date or data-hour combination frequency on which you load the data) I am not sure how you are installing sqoop. If you are using yum on redhat, the

Re: Incremental import from PostgreSQL to Hive having issues

2012-04-15 Thread Roshan Pradeep
Hi Nitin Thanks for your reply. I am using sqoop *1.4.1-incubating* version. In the sqoop releases download page the is no such version you are referring. Please correct me if I am wrong. Delete the warehouse folder and import is working fine, but my tables having GB of data, so every time delet

Re: Incremental import from PostgreSQL to Hive having issues

2012-04-13 Thread Nitin Pawar
Hi Roshan, I guess you are using sqoop version older than 17. You are facing similar issue mentioned in SQOOP-216 You can try to delete the directory already existing. Thanks, Nitin On Fri, Apr 13, 2012 at 6:12 PM, Roshan Pradeep wrote: > Hadoop

Re: Incremental import from PostgreSQL to Hive having issues

2012-04-13 Thread Roshan Pradeep
Hadoop - 0.20.2 Hive - 0.8.1 Thanks. On Fri, Apr 13, 2012 at 5:03 PM, Nitin Pawar wrote: > can you tell us what is > 1) hive version > 2) hadoop version that you are using? > > > > > > On Fri, Apr 13, 2012 at 12:23 PM, Roshan Pradeep wrote: > >> Hi >> >> I want to import the updated data from my

Re: Incremental import from PostgreSQL to Hive having issues

2012-04-13 Thread Nitin Pawar
can you tell us what is 1) hive version 2) hadoop version that you are using? On Fri, Apr 13, 2012 at 12:23 PM, Roshan Pradeep wrote: > Hi > > I want to import the updated data from my source (PostgreSQL) to hive > based on a column (lastmodifiedtime) in postgreSQL > > *The command I am using

Incremental import from PostgreSQL to Hive having issues

2012-04-12 Thread Roshan Pradeep
Hi I want to import the updated data from my source (PostgreSQL) to hive based on a column (lastmodifiedtime) in postgreSQL *The command I am using* /app/sqoop/bin/sqoop import --hive-table users --connect jdbc:postgresql:// --table users --username XXX --password YY --hive-home /app/hiv