java code - parsing string -outofindex error -File on HDFS

2009-11-12 Thread dgoker
When i try to read a file from HDFS, i want to substring the record by java code (javada line.substring(1,9)) but i am getting outofindex error. When i do the same operation on same file by PIG code there is no problem. I took following error, What shoul i do in Java. java.lang.StringIndexOutO

ETL using Hadoop ???

2009-11-12 Thread Palikala, Rajendra (CCL)
Hi All, I am an experienced Informatica and Java Developer. I am very new to Hadoop. I have one basic question for the group. Please advise. Can we do ETL (Example: load data from an oracle table to another oracle table) using Hadoop? If so, how can we implement that. >From my understanding, I

Re: ETL using Hadoop ???

2009-11-12 Thread Dhruba Borthakur
Hi Rajendra, We use Hive for a large data warehouse, details here: http://wiki.apache.org/hadoop/Hive thanks, dhruba On Thu, Nov 12, 2009 at 9:55 AM, Palikala, Rajendra (CCL) < rpalik...@carnival.com> wrote: > Hi All, > > I am an experienced Informatica and Java Developer. I am very new to > H

[jira] Created: (HADOOP-6370) Contrib project ivy dependencies are not included in binary target

2009-11-12 Thread Aaron Kimball (JIRA)
Contrib project ivy dependencies are not included in binary target -- Key: HADOOP-6370 URL: https://issues.apache.org/jira/browse/HADOOP-6370 Project: Hadoop Common Issue Type:

Re: ETL using Hadoop ???

2009-11-12 Thread Jeff Hammerbacher
Hey Rajendra, For the "E" part of your task, you may want to check out Sqoop: http://www.cloudera.com/hadoop-sqoop. Regards, Jeff On Thu, Nov 12, 2009 at 10:02 AM, Dhruba Borthakur wrote: > Hi Rajendra, > > We use Hive for a large data warehouse, details here: > http://wiki.apache.org/hadoop/H

RE: ETL using Hadoop ???

2009-11-12 Thread Palikala, Rajendra (CCL)
Jeff, This is great info. Thanks -Original Message- From: Jeff Hammerbacher [mailto:ham...@cloudera.com] Sent: Thursday, November 12, 2009 4:16 PM To: common-dev@hadoop.apache.org Subject: Re: ETL using Hadoop ??? Hey Rajendra, For the "E" part of your task, you may want to check out

Re: ETL using Hadoop ???

2009-11-12 Thread Steve Morin
Does sqoop also allow you to export data from hadoop to a db? I didn't see that in the documentation -Steve On Thu, Nov 12, 2009 at 1:16 PM, Jeff Hammerbacher wrote: > Hey Rajendra, > > For the "E" part of your task, you may want to check out Sqoop: > http://www.cloudera.com/hadoop-sqoop. > > R

Re: ETL using Hadoop ???

2009-11-12 Thread Jeff Hammerbacher
Hey Steve, See https://issues.apache.org/jira/browse/MAPREDUCE-1168. Regards, Jeff On Thu, Nov 12, 2009 at 1:35 PM, Steve Morin wrote: > Does sqoop also allow you to export data from hadoop to a db? I > didn't see that in the documentation > > -Steve > > On Thu, Nov 12, 2009 at 1:16 PM, Jeff

Re: ETL using Hadoop ???

2009-11-12 Thread Steve Morin
Thanks Jeff On Thu, Nov 12, 2009 at 1:36 PM, Jeff Hammerbacher wrote: > Hey Steve, > > See https://issues.apache.org/jira/browse/MAPREDUCE-1168. > > Regards, > Jeff > > On Thu, Nov 12, 2009 at 1:35 PM, Steve Morin wrote: > >> Does sqoop also allow you to export data from hadoop to a db?  I >> di

running hadoop pipes locally (debugging / unit testing / gdb / c++)

2009-11-12 Thread Erez Katz
Greetings, I have written a simple yet pretty handy framework for debugging hadoop pipes programs locally. It is called GaDooB  ... combination of GDB and Hadoop :) . It helps debugging/unit testing c++ hadoop map-reduce programs that were built using hadoop pipes. It is basically a sequencer

[jira] Created: (HADOOP-6371) Misleading information in documentation - Directories don't use host file system space and don't count against the space quota.

2009-11-12 Thread Ravi Phulari (JIRA)
Misleading information in documentation - Directories don't use host file system space and don't count against the space quota. --- Key: HADOOP-6371