I have been trying to start Hive2 Interactive on HDP 2.6.5 that ships with Hive 2.1.0. I have followed the guidelines mentioned in the following articles about how to size and tune LLAP with Yarn
But LLAP daemon fails to start and Hive interactive gives up after configured retries. Also followed the following steps to troubleshoot and the error doesn't seem to be sizing related.
|Status||all containers allocated|
|Total number of containers|
|Create time:||17 Jan 2019 05:40:15 GMT|
|Running since:||17 Jan 2019 05:40:15 GMT|
|Time last flexed:||N/A|
|Application storage path:||hdfs://hsft/user/hive/.slider/cluster/llap0/database|
|Application configuration path:||hdfs://hsft/user/hive/.slider/cluster/llap0/snapshot|
|Component||Desired||Actual||Outstanding Requests||Failed||Failed to start||Placement|
|Container ID||Component||State||Exit Code||Logs||Diagnostics|
@Abhishek Gupta I'm seeing a permission denied error in the logs.
/hadoop/yarn/local/usercache/hive/appcache/application_1547703090681_0001/container_e27_1547703090681_0001_01_000004/app/install//bin/runLlapDaemon.sh: Permission denied
Yes, I did see this error when ambari issues a nohup nice on this script. However I am not sure why is this happening.
Another error that is striking is the error by NetUtil.py failing to connect with Slide agent service as the OS Python version 2.7.6 where module ssl doesn’t have the attribute `_create_unverified_context`, this is resulting in the following error in slideragent log
INFO 2019-01-17 09:04:59,889 NetUtil.py:66 - Failed to connect to https://shdp-ycn04:38097/ws/v1/slider/agents/ due to 'module' object has no attribute '_create_unverified_context'
INFO 2019-01-17 09:04:59,889 NetUtil.py:85 - Server at https://shdp-ycn04:38097/ws/v1/slider/agents/ is not reachable, sleeping for 10 seconds...