Support Questions
Find answers, ask questions, and share your expertise

Error when deploying services using Ambari cluster install wizard

Explorer

I am trying to setup a 2 node cluster with SPARK, HIVE using Ambari Cluster Install Wizard. I had passed first 8 steps and get stuck at "Install, Start and Test" step. Here is the error message from one of the node

I am using Ubuntu 16.04

stderr: 
<script id="metamorph-23258-start" type="text/x-placeholder"></script>Traceback (most recent call last):
  File "/var/lib/ambari-agent/cache/stacks/HDP/2.0.6/hooks/before-ANY/scripts/hook.py", line 35, in <module>
    BeforeAnyHook().execute()
  File "/usr/lib/python2.6/site-packages/resource_management/libraries/script/script.py", line 375, in execute
    method(env)
  File "/var/lib/ambari-agent/cache/stacks/HDP/2.0.6/hooks/before-ANY/scripts/hook.py", line 29, in hook
    setup_users()
  File "/var/lib/ambari-agent/cache/stacks/HDP/2.0.6/hooks/before-ANY/scripts/shared_initialization.py", line 51, in setup_users
    fetch_nonlocal_groups = params.fetch_nonlocal_groups,
  File "/usr/lib/python2.6/site-packages/resource_management/core/base.py", line 166, in __init__
    self.env.run()
  File "/usr/lib/python2.6/site-packages/resource_management/core/environment.py", line 160, in run
    self.run_action(resource, action)
  File "/usr/lib/python2.6/site-packages/resource_management/core/environment.py", line 124, in run_action
    provider_action()
  File "/usr/lib/python2.6/site-packages/resource_management/core/providers/accounts.py", line 84, in action_create
    shell.checked_call(command, sudo=True)
  File "/usr/lib/python2.6/site-packages/resource_management/core/shell.py", line 72, in inner
    result = function(command, **kwargs)
  File "/usr/lib/python2.6/site-packages/resource_management/core/shell.py", line 102, in checked_call
    tries=tries, try_sleep=try_sleep, timeout_kill_strategy=timeout_kill_strategy)
  File "/usr/lib/python2.6/site-packages/resource_management/core/shell.py", line 150, in _call_wrapper
    result = _call(command, **kwargs_copy)
  File "/usr/lib/python2.6/site-packages/resource_management/core/shell.py", line 303, in _call
    raise ExecutionFailed(err_msg, code, out, err)
resource_management.core.exceptions.ExecutionFailed: Execution of 'usermod -G hadoop,user,spark,git,wheel -g hadoop spark' returned 6. usermod: user 'spark' does not exist in /etc/passwd
Error: Error: Unable to run the custom hook script ['/usr/bin/python', '/var/lib/ambari-agent/cache/stacks/HDP/2.0.6/hooks/before-ANY/scripts/hook.py', 'ANY', '/var/lib/ambari-agent/data/command-399.json', '/var/lib/ambari-agent/cache/stacks/HDP/2.0.6/hooks/before-ANY', '/var/lib/ambari-agent/data/structured-out-399.json', 'INFO', '/var/lib/ambari-agent/tmp', 'PROTOCOL_TLSv1', '']Traceback (most recent call last):
  File "/var/lib/ambari-agent/cache/stacks/HDP/2.0.6/hooks/before-INSTALL/scripts/hook.py", line 37, in <module>
    BeforeInstallHook().execute()
  File "/usr/lib/python2.6/site-packages/resource_management/libraries/script/script.py", line 382, in execute
    self.save_component_version_to_structured_out(self.command_name)
  File "/usr/lib/python2.6/site-packages/resource_management/libraries/script/script.py", line 244, in save_component_version_to_structured_out
    stack_select_package_name = stack_select.get_package_name()
  File "/usr/lib/python2.6/site-packages/resource_management/libraries/functions/stack_select.py", line 110, in get_package_name
    package = get_packages(PACKAGE_SCOPE_STACK_SELECT, service_name, component_name)
  File "/usr/lib/python2.6/site-packages/resource_management/libraries/functions/stack_select.py", line 224, in get_packages
    supported_packages = get_supported_packages()
  File "/usr/lib/python2.6/site-packages/resource_management/libraries/functions/stack_select.py", line 148, in get_supported_packages
    raise Fail("Unable to query for supported packages using {0}".format(stack_selector_path))
resource_management.core.exceptions.Fail: Unable to query for supported packages using /usr/bin/hdp-select<script id="metamorph-23258-end" type="text/x-placeholder"></script>
 stdout:
<script id="metamorph-23260-start" type="text/x-placeholder"></script>2018-07-25 16:44:16,944 - Stack Feature Version Info: Cluster Stack=2.6, Command Stack=None, Command Version=None -> 2.6
2018-07-25 16:44:16,948 - Using hadoop conf dir: /usr/hdp/current/hadoop-client/conf
2018-07-25 16:44:16,948 - Group['livy'] {}
2018-07-25 16:44:16,949 - Group['spark'] {}
2018-07-25 16:44:16,955 - Group['hdfs'] {}
2018-07-25 16:44:16,955 - Group['hadoop'] {}
2018-07-25 16:44:16,956 - Group['users'] {}
2018-07-25 16:44:16,956 - User['hive'] {'gid': 'hadoop', 'fetch_nonlocal_groups': True, 'groups': [u'hadoop'], 'uid': None}
2018-07-25 16:44:16,957 - Modifying user hive
2018-07-25 16:44:16,973 - User['livy'] {'gid': 'hadoop', 'fetch_nonlocal_groups': True, 'groups': [u'hadoop'], 'uid': None}
2018-07-25 16:44:16,975 - Modifying user livy
2018-07-25 16:44:16,989 - User['zookeeper'] {'gid': 'hadoop', 'fetch_nonlocal_groups': True, 'groups': [u'hadoop'], 'uid': None}
2018-07-25 16:44:16,991 - Modifying user zookeeper
2018-07-25 16:44:17,006 - User['spark'] {'gid': 'hadoop', 'fetch_nonlocal_groups': True, 'groups': [u'hadoop'], 'uid': None}
2018-07-25 16:44:17,008 - Modifying user spark
Error: Error: Unable to run the custom hook script ['/usr/bin/python', '/var/lib/ambari-agent/cache/stacks/HDP/2.0.6/hooks/before-ANY/scripts/hook.py', 'ANY', '/var/lib/ambari-agent/data/command-399.json', '/var/lib/ambari-agent/cache/stacks/HDP/2.0.6/hooks/before-ANY', '/var/lib/ambari-agent/data/structured-out-399.json', 'INFO', '/var/lib/ambari-agent/tmp', 'PROTOCOL_TLSv1', '']
Command failed after 1 tries
<script id="metamorph-23260-end" type="text/x-placeholder"></script>

This is the summary configuration reported in Review step and screen output for "Deploy" step

Cluster Name : HW2N
Total Hosts : 2 (2 new)
Repositories:
    ubuntu16 (HDP-2.6):
    http://public-repo-1.hortonworks.com/HDP/ubuntu16/2.x/updates/2.6.5.0
    ubuntu16 (HDP-2.6-GPL):
    http://public-repo-1.hortonworks.com/HDP-GPL/ubuntu16/2.x/updates/2.6.5.0
    ubuntu16 (HDP-UTILS-1.1.0.22):
    http://public-repo-1.hortonworks.com/HDP-UTILS-1.1.0.22/repos/ubuntu16
Services:
    HDFS
        DataNode : 1 host
        NameNode : msl-dpe-perf77.msl.lab
        NFSGateway : 0 host
        SNameNode : msl-dpe-perf77.msl.lab
    YARN + MapReduce2
        App Timeline Server : msl-dpe-perf77.msl.lab
        NodeManager : 1 host
        ResourceManager : msl-dpe-perf77.msl.lab
    Tez
        Clients : 1 host
    Hive
        Metastore : msl-dpe-perf77.msl.lab
        HiveServer2 : msl-dpe-perf77.msl.lab
        WebHCat Server : msl-dpe-perf77.msl.lab
        Database : New MySQL Database
    HBase
        Master : msl-dpe-perf77.msl.lab
        RegionServer : 1 host
        Phoenix Query Server : 0 host
    Pig
        Clients : 1 host
    ZooKeeper
        Server : msl-dpe-perf77.msl.lab
    Ambari Metrics
        Metrics Collector : msl-dpe-perf77.msl.lab
        Grafana : msl-dpe-perf77.msl.lab
    SmartSense
        Activity Analyzer : msl-dpe-perf77.msl.lab
        Activity Explorer : msl-dpe-perf77.msl.lab
        HST Server : msl-dpe-perf77.msl.lab
    Spark
        Livy Server : 0 host
        History Server : msl-dpe-perf77.msl.lab
        Thrift Server : 0 host
    Spark2
        Livy for Spark2 Server : 0 host
        History Server : msl-dpe-perf77.msl.lab
        Thrift Server : 0 host
    Slider
        Clients : 1 host

84405-new-bitmap-image.png

1 ACCEPTED SOLUTION

Explorer

resource_management import error was caused by ambari wizard using /usr/lib/python2.6/site-packages. For Ubuntu 16, python 2.7 does not have this directory on path. It can be resolved by adding

PYTHONPATH=/usr/lib/python2.6/site-packages

View solution in original post

5 REPLIES 5

hi @Harry Li ,

Its looks like the installation is failing at step :

2018-07-2516:44:17,008-Modifying user spark

Can you investigate whether its any user creation related issue or due to some extraneous entries of users group from /etc/group

You can get the full error log from /var/lib/ambari-agent/data/error-399.json and /var/lib/ambari-agent/data/output-399.json

in the node : msl-dpe-perf74.msl.lab .

Explorer

Thanks Adi and Akhil

A closer look of the issue seems pointing to failed run of hook.py, here is the message

Error: Error: Unable to run the custom hook script ['/usr/bin/python', '/var/lib/ambari-agent/cache/stacks/HDP/2.0.6/hooks/before-ANY/scripts/hook.py', 'ANY', '/var/lib/ambari-agent/data/command-399.json', '/var/lib/ambari-agent/cache/stacks/HDP/2.0.6/hooks/before-ANY', '/var/lib/ambari-agent/data/structured-out-399.json', 'INFO', '/var/lib/ambari-agent/tmp', 'PROTOCOL_TLSv1', '']Traceback (most recent call last):

I tested this script manually and here is what I got

harry.li@msl-dpe-perf74:/usr/lib/python2.6/site-packages$  sudo /usr/bin/python '/var/lib/ambari-agent/cache/stacks/HDP/2.0.6/hooks/before-ANY/scripts/hook.py'
Traceback (most recent call last):
  File "/var/lib/ambari-agent/cache/stacks/HDP/2.0.6/hooks/before-ANY/scripts/hook.py", line 20, in <module>
    from resource_management import *
ImportError: No module named resource_management

I verified that my python 2.7.12 is installed correctly and resource_management directory has been installed correctly too. Is there a setting in Ambari to control python import path?

root@msl-dpe-perf74:/usr/lib/python2.6/site-packages# ls -l /usr/lib/ambari-agent/lib
total 20
drwxr-xr-x 3 root root 4096 Jul 25 17:54 ambari_commons
drwxr-xr-x 3 root root 4096 Jul 24 17:29 ambari_jinja2
drwxr-xr-x 2 root root 4096 Jul 24 17:29 ambari_simplejson
drwxr-xr-x 2 root root 4096 Jul 24 17:29 examples
drwxr-xr-x 4 root root 4096 Jul 24 17:29 resource_management
root@msl-dpe-perf74:/usr/lib/python2.6/site-packages# ls -l /usr/lib/ambari-agent/lib/resource_management/
total 16
drwxr-xr-x 5 root root 4096 Jul 24 17:29 core
-rwxrwxrwx 1 root root  887 Feb 23 11:10 __init__.py
-rw-r--r-- 1 root root 1049 Jul 24 17:29 __init__.pyc
drwxr-xr-x 6 root root 4096 Jul 24 17:29 libraries

Explorer

resource_management import error was caused by ambari wizard using /usr/lib/python2.6/site-packages. For Ubuntu 16, python 2.7 does not have this directory on path. It can be resolved by adding

PYTHONPATH=/usr/lib/python2.6/site-packages

great . please accept your answer as best answer and close this threaad .

; ;