Hadoop based product recomendations.

2013-05-28 Thread Sai Sai
Just wondering if anyone would have any suggestions. We r a bunch of developers on bench for a few months trained on Hadoop but do not have any projects to work. We would like to develop a Hadoop/Hive/Pig based product for our company so we can be of value to the company and not be scared of lay

Deadline Extension: 2013 Workshop on Middleware for HPC and Big Data Systems (MHPC'13)

2013-05-28 Thread MHPC 2013
we apologize if you receive multiple copies of this message === CALL FOR PAPERS 2013 Workshop on Middleware for HPC and Big Data Systems MHPC '13 as part of Euro-Par 2013, Aachen, Germany

RE: how does hive find where is MR job tracker

2013-05-28 Thread Frank Luo
Thanks for reply. Yes, I had the old server name in mapred-site.xml. Odd enough, I couldn't find a way to update the file through CM. From: Sanjay Subramanian [mailto:sanjay.subraman...@wizecommerce.com] Sent: Tuesday, May 28, 2013 12:08 PM To: user@hive.apache.org; bejoy...@yahoo.com Subject: R

Re: Accessing Table Properies from InputFormat

2013-05-28 Thread Edward Capriolo
ORC, Parquet, and "new ones" are ... new. They do not constitute a huge portion of the user base if they constitute any at all. I do see a case for what you are describing, currently there are input formats that do properties via the configuration to the task. Also I feel like some of the confusio

Re: Accessing Table Properies from InputFormat

2013-05-28 Thread Owen O'Malley
On Tue, May 28, 2013 at 9:27 AM, Edward Capriolo wrote: > The question we are diving into is how much of hive is going to be > designed around edge cases? Hive really was not made for columnar formats, > or self describing data-types. For the most part it handles them fairly > well. > I don't vie

Re: how does hive find where is MR job tracker

2013-05-28 Thread Sanjay Subramanian
In Cloudera Manager , there is a Safety Valve feature (its a multiline text widget) that u can use to input the XML properties that u would use for mapred-site.xml Possibly since u changed the JobTracker machine , u have to mod the mapred-site.xml to specify the machine name and port Regards

Re: how does hive find where is MR job tracker

2013-05-28 Thread bejoy_ks
Hive gets the JobTracker from the mapred-site.xml specified within your $HADOOP_HOME/conf. Is your $HADOOP_HOME/conf/mapred-site.xml on the node that runs hive have the correct value for jobtracker? If not changing that to the right one might resolve your issue. Regards Bejoy KS Sent from re

how does hive find where is MR job tracker

2013-05-28 Thread Frank Luo
I have a cloudera cluster, version 4.2.0. In the hive configuration, I have "MapReduce Service" set to "mapreduce1", which is my MR service. However, without setting "mapred.job.tracker", whenever I run hive command, it always sends the job to a wrong job tracker. Here is the error: java.net.

Re: Accessing Table Properies from InputFormat

2013-05-28 Thread Edward Capriolo
Right, hive discarding the key is rather annoying. I have a series of key+value input formats key only input format etc. Having hive return both the key and the value, would be a breaking change, but not be very difficult. The question we are diving into is how much of hive is going to be designed

Re: Accessing Table Properies from InputFormat

2013-05-28 Thread Owen O'Malley
On Tue, May 28, 2013 at 8:45 AM, Edward Capriolo wrote: > That does not really make sense. Your breaking the layered approache. > InputFormats read/write data, serdes interpret data based on the table > definition. its like asking "Why can't my input format run assembly code?" > The current model

Re: Accessing Table Properies from InputFormat

2013-05-28 Thread Edward Capriolo
That does not really make sense. Your breaking the layered approache. InputFormats read/write data, serdes interpret data based on the table definition. its like asking "Why can't my input format run assembly code?" On Tue, May 28, 2013 at 11:42 AM, Owen O'Malley wrote: > > > > On Tue, May 28,

Re: Accessing Table Properies from InputFormat

2013-05-28 Thread Owen O'Malley
On Tue, May 28, 2013 at 7:59 AM, Peter Marron < peter.mar...@trilliumsoftware.com> wrote: > Hi, > > ** ** > > Hive 0.10.0 over Hadoop 1.0.4. > > ** ** > > Further to my filtering questions of before. > > I would like to be able to access the table properties from inside my > custom In

Accessing Table Properies from InputFormat

2013-05-28 Thread Peter Marron
Hi, Hive 0.10.0 over Hadoop 1.0.4. Further to my filtering questions of before. I would like to be able to access the table properties from inside my custom InputFormat. I've done searches and there seem to be some other people who have had a similar problem. The closest I can see to a solution

Combining 2 JSON objects in Hive

2013-05-28 Thread Ike Walker
Hello, I have two JSON objects stored as strings in a Hive table. I would like to combine them into a single JSON object in Hive. I'm running Hive 0.7, but am planning to upgrade soon so a solution that works in Hive 0.8 could be acceptable as well. For example, here's the data now: +-

IndexOutOfBoundsException with Snappy compressed SequenceFile from Flume

2013-05-28 Thread Keith Wright
Hi all, This is my first post to the hive mailing list and I was hoping to get some help with the exception I am getting below. I am using CDH4.2 (hive 0.10.0) to query snappy compressed, Sequence files that are built using Flume (relevant portion of flume conf below as well). Note that I'

Re: Where should Hive Process Installed??

2013-05-28 Thread yuchen xie
I have just met this. You can try to add this configuration in hive-site.xml mapreduce.jobtracker.address ignorethis Or you can patch https://issues.apache.org/jira/browse/HIVE-3029 to your hive 2013/4/8 rohithsharma > Hi > > ** ** > > I am using Hive-0.9.0 + Hadoop-2.0.1 with 2