One error that is striking is the error by NetUtil.py failing to connect with Slide agent service as the OS Python version 2.7.6 where module ssl doesn’t have the attribute `_create_unverified_context`, this is resulting in the following error when running LLAP for Hive2,
NFO 2019-01-17 09:04:59,889 NetUtil.py:66 - Failed to connect to https://shdp-ycn04:38097/ws/v1/slider/agents/ due to 'module' object has no attribute '_create_unverified_context' INFO 2019-01-17 09:04:59,889 NetUtil.py:85 - Server at https://shdp-ycn04:38097/ws/v1/slider/agents/ is not reachable, sleeping for 10 seconds... On Thu, Jan 17, 2019 at 1:05 PM Abhishek Gupta <abhila...@gmail.com> wrote: > I have been trying to start Hive2 Interactive on HDP 2.6.5 that ships with > Hive 2.1.0. I have followed the guidelines mentioned in the following > articles about how to size and tune LLAP with Yarn > > https://community.hortonworks.com/articles/149486/llap-sizing-and-setup.html > > But LLAP daemon fails to start and Hive interactive gives up after > configured retries. > Also followed the following steps to troubleshoot and the error doesn't > seem to be sizing related. > > https://community.hortonworks.com/articles/149899/investigating-when-llap-doesnt-start.html > > Following is my cluster setup in brief: > > hive.llap.daemon.yarn.container.mb= 10 GB > llap_heap_size= 7 GB > In-Memory Cache per Daemon= 2 GB > Head room= 1 GB > (hive.llap.daemon.yarn.container.mb should be = llap_heap_size + In-Memory > Cache per Daemon + Headroom) > Yarn container max= 14.6 GB, min 1 GB > Number of executors per LLAP Daemon= 1 > Yarn pre-emption enabled > Retries for checking LLAP status= 20 > > Attached the startup script logs and Yarn container relevant logs. > https://pastebin.com/raw/4ywJHp4M > https://pastebin.com/raw/AiJkfwUR > > Following is a snapshot of Slide AM page > Application: llap0 > Status all containers allocated > Total number of containers 5 > Create time: 17 Jan 2019 05:40:15 GMT > Running since: 17 Jan 2019 05:40:15 GMT > Time last flexed: N/A > Application storage path: hdfs://hsft/user/hive/.slider/cluster/llap0/database > > Application configuration path: > hdfs://hsft/user/hive/.slider/cluster/llap0/snapshot > > Component Instances > *Component* *Desired* *Actual* *Outstanding Requests* *Failed* *Failed to > start* *Placement* > LLAP 4 4 0 0 0 > slider-appmaster 1 1 0 0 0 > > > > Application Container Diagnostics > *Container ID* *Component* *State* *Exit Code* *Logs* *Diagnostics* > container_e27_1547703090681_0001_01_000002 LLAP 3 -1000 Logs > container_e27_1547703090681_0001_01_000003 LLAP 3 -1000 Logs > container_e27_1547703090681_0001_01_000004 LLAP 3 -1000 Logs > container_e27_1547703090681_0001_01_000005 LLAP 3 -1000 Logs > > > > >