Re: Hive setup on Hadoop cluster

JOHN MILLER Fri, 20 May 2016 04:50:44 -0700

Greetings  Attached is the results of the select count(1) from table

The contents of the dataset(table) is 18 columns and 3340 rows


hive> select count(1) from commoncrawl18;
Query ID = jmill383_20160520074710_3b5ee662-2ead-4d89-9123-df9b2cf6e2d7
Total jobs = 1
Launching Job 1 out of 1
Number of reduce tasks determined at compile time: 1
In order to change the average load for a reducer (in bytes):
  set hive.exec.reducers.bytes.per.reducer=<number>
In order to limit the maximum number of reducers:
  set hive.exec.reducers.max=<number>
In order to set a constant number of reducers:
  set mapreduce.job.reduces=<number>
Starting Job = job_1463594979064_0005, Tracking URL =
http://starchild:8088/proxy/application_1463594979064_0005/
Kill Command = /opt/hadoop/bin/hadoop job  -kill job_1463594979064_0005
Hadoop job information for Stage-1: number of mappers: 0; number of
reducers: 0
2016-05-20 07:47:15,936 Stage-1 map = 0%,  reduce = 0%
Ended Job = job_1463594979064_0005 with errors
Error during job, obtaining debugging information...
Job Tracking URL:
http://starchild:8088/cluster/app/application_1463594979064_0005
FAILED: Execution Error, return code 2 from
org.apache.hadoop.hive.ql.exec.mr.MapRedTask
MapReduce Jobs Launched:
Stage-Stage-1:  HDFS Read: 0 HDFS Write: 0 FAIL
Total MapReduce CPU Time Spent: 0 msec
hive>


On Thu, May 19, 2016 at 8:56 AM, Mich Talebzadeh <mich.talebza...@gmail.com>
wrote:

> Hi John,
>
> stderr does not say much
>
> Exception in thread "main" java.lang.IncompatibleClassChangeError:
> Implementing class
>
> at java.lang.ClassLoader.defineClass1(Native Method)
>
> at java.lang.ClassLoader.defineClass(ClassLoader.java:800)
>
> at java.security.SecureClassLoader.defineClass(SecureClassLoader.java:142)
>
> at java.net.URLClassLoader.defineClass(URLClassLoader.java:449)
>
> at java.net.URLClassLoader.access$100(URLClassLoader.java:71)
>
> at java.net.URLClassLoader$1.run(URLClassLoader.java:361)
>
> at java.net.URLClassLoader$1.run(URLClassLoader.java:355)
>
> at java.security.AccessController.doPrivileged(Native Method)
>
> at java.net.URLClassLoader.findClass(URLClassLoader.java:354)
>
> at java.lang.ClassLoader.loadClass(ClassLoader.java:425)
>
> at sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:308)
>
> at java.lang.ClassLoader.loadClass(ClassLoader.java:358)
>
> at java.lang.Class.getDeclaredMethods0(Native Method)
>
> at java.lang.Class.privateGetDeclaredMethods(Class.java:2615)
>
> at java.lang.Class.getMethod0(Class.java:2856)
>
> at java.lang.Class.getMethod(Class.java:1668)
>
> at sun.launcher.LauncherHelper.getMainMethod(LauncherHelper.java:494)
>
> at sun.launcher.LauncherHelper.checkAndLoadMain(LauncherHelper.java:486)
>
>
> However, sounds like you may have an  issue with yarn container memory.
>
>
> How big is the underlying table. Also can you just do a plain select
> count(1) from <table> itself (no distinct etc) and see it works?
>
>
> HTH
>
>
>
>
> Dr Mich Talebzadeh
>
>
>
> LinkedIn * 
> https://www.linkedin.com/profile/view?id=AAEAAAAWh2gBxianrbJd6zP6AcPCCdOABUrV8Pw
> <https://www.linkedin.com/profile/view?id=AAEAAAAWh2gBxianrbJd6zP6AcPCCdOABUrV8Pw>*
>
>
>
> http://talebzadehmich.wordpress.com
>
>
>
> On 18 May 2016 at 19:46, JOHN MILLER <jmill...@gmail.com> wrote:
>
>> Mich
>>
>> Attaching hadoop logs
>>
>> John M
>>
>> On Wed, May 18, 2016 at 1:48 PM, Mich Talebzadeh <
>> mich.talebza...@gmail.com> wrote:
>>
>>> Hi John,
>>>
>>> can you please a new thread for your problem so we can deal with
>>> separately.
>>>
>>> thanks
>>>
>>> Dr Mich Talebzadeh
>>>
>>>
>>>
>>> LinkedIn * 
>>> https://www.linkedin.com/profile/view?id=AAEAAAAWh2gBxianrbJd6zP6AcPCCdOABUrV8Pw
>>> <https://www.linkedin.com/profile/view?id=AAEAAAAWh2gBxianrbJd6zP6AcPCCdOABUrV8Pw>*
>>>
>>>
>>>
>>> http://talebzadehmich.wordpress.com
>>>
>>>
>>>
>>> On 18 May 2016 at 15:11, JOHN MILLER <jmill...@gmail.com> wrote:
>>>
>>>> Greetings Mitch
>>>>
>>>> I have an issue with running mapreduce in hive   I am getting a
>>>> FAILED: Execution Error, return code 2 from
>>>> org.apache.hadoop.hive.ql.exec.mr.MapRedTask
>>>>
>>>> error while attemtiing to execute SELECT DISTINCT(fieldname) FROM TABLE
>>>> x  or SELECT  COUNT(*)  FROM TABLE x;;  Trying to run cascading-hive gives
>>>> me the same problem as well
>>>>
>>>> Please advise if u have come across this type of problem or generated
>>>> some ideas as to resolve this problema
>>>>
>>>> On Wed, May 18, 2016 at 9:53 AM, Mich Talebzadeh <
>>>> mich.talebza...@gmail.com> wrote:
>>>>
>>>>> Hi Kuldeep,
>>>>>
>>>>> Have you installed hive on any of these nodes.
>>>>>
>>>>> Hive is basically an API. You will also need to install sqoop as well
>>>>> if you are going to import data from other RDBMss like Oracle, Sybase etc.
>>>>>
>>>>> Hive has a very small footprint so my suggestion is to install it on
>>>>> all your boxes and permission granted to Haddop user say hduser.
>>>>>
>>>>> Hive will require a metadata in  a database of your choice. default is
>>>>> derby which I don't use. try to use a reasonable database. ours is on
>>>>> Oracle
>>>>>
>>>>>  Now under directory $HIVE_HOME/conf/hive-site.xml you can set up info
>>>>> about Hadoop and your metastore etc. You also need to set up environment
>>>>> variables for both Hadoop and hive in your start up script like .profile
>>>>> .kshrc etc
>>>>>
>>>>> Have a look anyway.
>>>>>
>>>>> HTH
>>>>>
>>>>> Dr Mich Talebzadeh
>>>>>
>>>>>
>>>>>
>>>>> LinkedIn * 
>>>>> https://www.linkedin.com/profile/view?id=AAEAAAAWh2gBxianrbJd6zP6AcPCCdOABUrV8Pw
>>>>> <https://www.linkedin.com/profile/view?id=AAEAAAAWh2gBxianrbJd6zP6AcPCCdOABUrV8Pw>*
>>>>>
>>>>>
>>>>>
>>>>> http://talebzadehmich.wordpress.com
>>>>>
>>>>>
>>>>>
>>>>> On 18 May 2016 at 13:49, Kuldeep Chitrakar <
>>>>> kuldeep.chitra...@synechron.com> wrote:
>>>>>
>>>>>> I have a very basic question regarding Hadoop & Hive setup.  I have 7
>>>>>> Machines say M1,M2,M3,M4,M5,M6,M7
>>>>>>
>>>>>>
>>>>>>
>>>>>> Hadoop Cluster Setup:
>>>>>>
>>>>>>
>>>>>>
>>>>>> Namenode: M1
>>>>>>
>>>>>> Seondary Namenode: M2
>>>>>>
>>>>>> Datanodes: M3,M4,M5
>>>>>>
>>>>>>
>>>>>>
>>>>>>
>>>>>>
>>>>>> Now question is:
>>>>>>
>>>>>>
>>>>>>
>>>>>> Where do I need to install Hive.
>>>>>>
>>>>>> 1.       Should I install Hiverserver on M6
>>>>>>
>>>>>> a.       if yes does that machine needs core Hadoop JAR’s installed?
>>>>>>
>>>>>> b.      How this Hive server knows where Hadoop cluster is. What
>>>>>> configurations needs to be done?
>>>>>>
>>>>>> c.       How can we restrict this machine to be only hive server and
>>>>>> not datanode of Hadoop cluster?
>>>>>>
>>>>>>
>>>>>>
>>>>>> 2.       Where do we install Hive CLI
>>>>>>
>>>>>> a.       If I want to hive M7 as Hive CLI, then what needs to be
>>>>>> installed on this machine.
>>>>>>
>>>>>> b.      Any required configurations.
>>>>>>
>>>>>>
>>>>>>
>>>>>> Thanks,
>>>>>>
>>>>>> Kuldeep
>>>>>>
>>>>>>
>>>>>>
>>>>>
>>>>>
>>>>
>>>
>>
>

Re: Hive setup on Hadoop cluster

Reply via email to