Re: data transfer from rdbms to hive

2014-05-02 Thread CRAIG LIU
I am new in hive and here is my idea? 1. Use mysqldump to dump your data to csv file. 2. Load csv to hive temp table. 3. Create partition table. 4. Use dynamic partition, select from temp table to insert to partition table. You can use udf to get the date from the timestamp. Regards, Craig 2014-5-

Re: data transfer from rdbms to hive

2014-05-02 Thread Shushant Arora
for that do i need to load files first in non partitioned table and then in from there to partitioned table use insert from unpartitioned table to partitioned one. On Fri, May 2, 2014 at 4:04 PM, Hamza Asad wrote: > Sqoop also support dynamic partitioning. I have done that. For that you > have

Re: data transfer from rdbms to hive

2014-05-02 Thread Matt Tucker
It sounds like you might need to export. Via sqoop using a query or view, as the date granularity in your MySQL table is different from the desired Hive table. The overall performance may be lower as MySQL must do more than just read rows from disk, but you may still find ways to get the data in pa

Re: data transfer from rdbms to hive

2014-05-02 Thread Hamza Asad
Sqoop also support dynamic partitioning. I have done that. For that you have to enable dynamic partition i.e dynamic partition = true, in hive. On Fri, May 2, 2014 at 12:57 PM, unmesha sreeveni wrote: > > On Fri, May 2, 2014 at 9:41 AM, Shushant Arora > wrote: > >> Sqoop > > > ​Hi Shushant >

Re: data transfer from rdbms to hive

2014-05-02 Thread unmesha sreeveni
On Fri, May 2, 2014 at 9:41 AM, Shushant Arora wrote: > Sqoop ​Hi Shushant I dont think other ecosystem projects can help you.The only way to import data from relational DB is SQOOP. http://my.safaribooksonline.com/book/databases/9781449364618/6dot-hadoop-ecosystem-integration/integration_hiv

Re: data transfer from rdbms to hive

2014-05-01 Thread Shushant Arora
But how to achieve dynamic partitioning. For each row in mysql date from column get partition name and insert in corresponding partition in hive. Sqoop requires partition t be told beforehand. On Fri, May 2, 2014 at 8:36 AM, unmesha sreeveni wrote: > I suggest you to go for sqoop - They import

Re: data transfer from rdbms to hive

2014-05-01 Thread unmesha sreeveni
I suggest you to go for sqoop - They imports data from RDBMS. On Thu, May 1, 2014 at 7:13 PM, Shushant Arora wrote: > Hi > > I have a requirement to transfer data from RDBMS mysql to partitioned hive > table > Partitioned on Year and month. > Each record in mysql data contains timestamp of user