Created 08-03-2017 08:08 AM
My HDF cluster is not starting after a server reboot. I was trying to mimic a recent Data Centre crash due to power failure and implementing the auto reboot of our dev cluster using ambari blueprints. I have errors with the NIFI_CA and NiFi is not starting up. Please see the error message below; Not sure why there is a permission issue...
Traceback (most recent call last):
File "/var/lib/ambari-agent/cache/common-services/NIFI/1.0.0/package/scripts/nifi_ca.py", line 114, in <module>
CertificateAuthority().execute()
File "/usr/lib/python2.6/site-packages/resource_management/libraries/script/script.py", line 280, in execute
method(env)
File "/var/lib/ambari-agent/cache/common-services/NIFI/1.0.0/package/scripts/nifi_ca.py", line 92, in start
Execute((run_ca_script, params.jdk64_home, ca_server_script, params.nifi_config_dir + '/nifi-certificate-authority.json', params.nifi_ca_log_file_stdout, params.nifi_ca_log_file_stderr, status_params.nifi_ca_pid_file), user=params.nifi_user)
File "/usr/lib/python2.6/site-packages/resource_management/core/base.py", line 155, in __init__
self.env.run()
File "/usr/lib/python2.6/site-packages/resource_management/core/environment.py", line 160, in run
self.run_action(resource, action)
File "/usr/lib/python2.6/site-packages/resource_management/core/environment.py", line 124, in run_action
provider_action()
File "/usr/lib/python2.6/site-packages/resource_management/core/providers/system.py", line 273, in action_run
tries=self.resource.tries, try_sleep=self.resource.try_sleep)
File "/usr/lib/python2.6/site-packages/resource_management/core/shell.py", line 70, in inner
result = function(command, **kwargs)
File "/usr/lib/python2.6/site-packages/resource_management/core/shell.py", line 92, in checked_call
tries=tries, try_sleep=try_sleep)
File "/usr/lib/python2.6/site-packages/resource_management/core/shell.py", line 140, in _call_wrapper
result = _call(command, **kwargs_copy)
File "/usr/lib/python2.6/site-packages/resource_management/core/shell.py", line 293, in _call
raise ExecutionFailed(err_msg, code, out, err)
resource_management.core.exceptions.ExecutionFailed: Execution of '/var/lib/ambari-agent/cache/common-services/NIFI/1.0.0/package/scripts/run_ca.sh /usr/java/jdk1.8.0_77 /var/lib/ambari-agent/cache/common-services/NIFI/1.0.0/package/files/nifi-toolkit-1.1.0.2.1.4.0-5/bin/tls-toolkit.sh /usr/hdf/current/nifi/conf/nifi-certificate-authority.json /var/log/nifi/nifi-ca.stdout /var/log/nifi/nifi-ca.stderr /var/run/nifi/nifi-ca.pid' returned 126. -bash: /var/lib/ambari-agent/cache/common-services/NIFI/1.0.0/package/scripts/run_ca.sh: Permission denied
Created 08-03-2017 08:15 AM
The error says :
-bash: /var/lib/ambari-agent/cache/common-services/NIFI/1.0.0/package/scripts/run_ca.sh: Permission denied
.
Can you please check if you have that file existing and if it has the correct permissions? Can you please check and share the permission of this file?
Are you starting ambari-agent as "root" user or non-root user?
You also might want to clear the ambari-agent "cache" directory and then try restarting the agent so that it can pull fresh files from ambari server.
.
Created 08-03-2017 08:15 AM
The error says :
-bash: /var/lib/ambari-agent/cache/common-services/NIFI/1.0.0/package/scripts/run_ca.sh: Permission denied
.
Can you please check if you have that file existing and if it has the correct permissions? Can you please check and share the permission of this file?
Are you starting ambari-agent as "root" user or non-root user?
You also might want to clear the ambari-agent "cache" directory and then try restarting the agent so that it can pull fresh files from ambari server.
.
Created 08-03-2017 08:39 AM
The file has the correct permissions (see below). I am running ambari-agent as "root" user.
-rwxr-xr-x 1 root root 2002 Aug 1 10:44 /var/lib/ambari-agent/cache/common-services/NIFI/1.0.0/package/scripts/run_ca.sh
Created 08-03-2017 11:30 AM
@Jay SenSharma Restarting ambari-server did it for me. Thanks