I write a shell script to do that 2011/5/4 Marcos Ortiz <mlor...@uci.cu>
> On 05/04/2011 04:14 PM, Alexandre "TAZ" dos Santos Andrade wrote: > > Hi Marcos, > > I'm doing exactally the same migration, first of all you have to remember > that hive is gonna make mapreduce for each query you dont write the result > on a table, second is a litle bit anoing to migrate the data, there's no > direct connector so I user a simple dump, extracted the header and footer > and Loaded in hive structure. > > I hope I could Help you > > Alexandre dos Santos Andrade > > 2011/5/4 Marcos Ortiz <mlor...@uci.cu> > >> We are planning a migration from a large PostgreSQL-based DWH to >> Hadoop/Hive. The principal reason for this migration is the massive growth >> of the data to analyze (5.6 TB and growing) where PostgreSQL like a >> MVCC-based RDBMS has its pitfalls with heavy updates and query execution >> with great quantities of data. (We had done many query tunning and >> optimization to the server, with a minor effect on the latency of the >> queries). >> >> So, we have viewed Hadoop and we have done some tests combined with Hive >> and HBase and it´s awesome the obtained performance. >> >> Can you give us some advices to develop a good plan for this? >> >> Environment: >> - O.S:CentOS-5.5 64 bits >> - Java version: 1.6. Update 20 >> - Hardware: 8 Nodes - AMD Opteron QuadCore 4130 >> 8 GB RAM >> 1 TB HDD >> >> Regards >> >> -- >> Marcos Luís Ortíz Valmaseda >> Software Engineer (Large-Scaled Distributed Systems) >> University of Information Sciences, >> La Habana, Cuba >> Linux User # 418229 >> http://about.me/marcosortiz >> >> > > > -- > <a href=" > http://cwconnect.computerworld.com.br/profile_view.aspx?customerid=alexandreandrade"><img > src=" > http://cwconnect.computerworld.com.br/businesscard.aspx?customerid=alexandreandrade" > border="0" alt="Join Me at CW Connect!"></a> > > Thanks a lot, Alexandre. > Did you use Sqoop to load the data from PostgreSQL to Hive? > > > > > -- > Marcos Luís Ortíz Valmaseda > Software Engineer (Large-Scaled Distributed Systems) > University of Information Sciences, > La Habana, Cuba > Linux User # 418229 > http://about.me/marcosortiz > > -- <a href=" http://cwconnect.computerworld.com.br/profile_view.aspx?customerid=alexandreandrade"><img src=" http://cwconnect.computerworld.com.br/businesscard.aspx?customerid=alexandreandrade" border="0" alt="Join Me at CW Connect!"></a>