Member since
03-14-2016
4721
Posts
1111
Kudos Received
874
Solutions
My Accepted Solutions
| Title | Views | Posted |
|---|---|---|
| 2722 | 04-27-2020 03:48 AM | |
| 5283 | 04-26-2020 06:18 PM | |
| 4447 | 04-26-2020 06:05 PM | |
| 3570 | 04-13-2020 08:53 PM | |
| 5377 | 03-31-2020 02:10 AM |
07-23-2017
08:08 AM
@Joshva Peter
If this helped answering your query then it will be great if you mark this thread as "Answered" by clicking on the "Accepted" link, that way it will be helpful for HCC users to quickly find the answered threads.
... View more
07-21-2017
05:21 PM
1 Kudo
@Joshva Peter hdfs-audit and namenode log files are generated via log4j framework. Which is initialized at the time of component startup and creates the log file for writing or use the existing log file for writing in append mode. However if you delete these log files in the running mode of the components then the logging will not be reinitialized hence you will not see the new log file generated. This happens because the log4j appender will has a handle for the deleted file hence it will continue to write to it (which does not exist now because it is deleted), log4j will not detect that it has been deleted. In custom applications we can make it happen by reinitializing the logging configuration via code, But existing components like NameNode does not control logging initialization/resetConfiguration at the code level. org.apache.log4j.LogManager.resetConfiguration(); So you will need to restart the affected components (like NameNode in this case) to see the new files getting created.
... View more
07-21-2017
06:07 AM
@Avijeet Dash You can find the list of Supported HDF components here: https://docs.hortonworks.com/HDPDocuments/HDF3/HDF-3.0.0/bk_release-notes/content/ch_hdf_relnotes.html Apache Ambari 2.5.1
Apache Kafka 0.10.2.1
Apache NiFi 1.2.0
Apache Ranger 0.7.0
Apache Storm 1.1.0
Apache ZooKeeper 3.4.6
Apache MiNiFi Java Agent 0.2.0
Apache MiNiFi C++ Technical Preview
Hortonworks Schema Registry 0.3.0
Hortonworks Streaming Analytics Manager 0.5.0
. - With HDP you can get more components/service, Please see the component lists: https://docs.hortonworks.com/HDPDocuments/HDP2/HDP-2.6.1/bk_release-notes/content/comp_versions.html - Regarding the "+ symbol" can you lease share the screenshot where exactly do you see them? - If you see the [+] symbol in the services tab on the left panel of ambari UI then it indicates maintenance mode. To know more about maintenance mode please refer to: https://docs.hortonworks.com/HDPDocuments/Ambari-2.5.1.0/bk_ambari-operations/content/setting_maintenance_mode.html Setting Maintenance Mode enables you to suppress alerts and omit bulk operations for specific services, components, and hosts in an Ambari-managed cluster when you want to focus on performing hardware or software maintenance, changing configuration settings, troubleshooting, decommissioning, or removing cluster nodes. With Sandbox few services are by default set in "Maintenance Mode". .
... View more
07-21-2017
01:25 AM
1 Kudo
@nshelke There was an issue reported for ambari 2.4 (fixed in 2.4.0) but the stack looks similar. https://issues.apache.org/jira/browse/AMBARI-17991 Can you check if the workaround mentioned in the mentioned JIRA works for you, to edit the "/usr/lib/python2.6/site-packages/ambari_agent/security.py" file and increase the timeout to a larger value like 360 seconds. def create_connection(self):
if self.sock:
self.sock.close()
logger.info("SSL Connect being called.. connecting to the server")
sock = socket.create_connection((self.host, self.port), 360) . 360 or more to see of it is still getting times out? Is it happening with all the ambari agents? Have you tried running the Ambari Agent in Debug mode to extract more details? .
... View more
07-19-2017
01:06 PM
@Rajesh Reddy Are you sure that your Ambari Server FQDN is "ip-172-31-27-38.ap-south-1.compute.internal" Is this FQDN accessible from the Agent machines? (I means from the agent machines are you able to resolve this hostname ?) ping ip-172-31-27-38.ap-south-1.compute.internal - Are you able to access the port 8440 from agent machine? (this is to isolate any N/W or Firewall issue) # telnet ip-172-31-27-38.ap-south-1.compute.internal 8440 . - If above command does not connect to the ambari server on port 8440 then please check the IPTables on the ambari server if the firewall is disabled or not? # service iptables stop .
... View more
07-18-2017
01:20 PM
2 Kudos
@david garcia You are getting the error: Caused by: java.security.cert.CertificateException: No name matching master1.xxx.yyyy.net found Please check the Certificate CN (Common Name) that you used to create the certificate. It should be the same as hostname teing used in the URL while accessing the service component. If the CN in the certificate is not the same as the hostname of the component, you will get the error "java.security.cert.CertificateException: No name matching <host_name> found", where , "host_name" is the host name you used as part of the URL in your client application. (Here client is hive view) . Please not that, you cannot change the CN in an already created certificate. So you will need to create a new certificate with the correct hostname of the service.
... View more
07-17-2017
01:40 PM
@Hovo Khachikyan Looks like some N/W configuration issue at your end which might be blocking access to few specific sites Or the mirrorlist is not connecting from your side. I can access the same repo without any issue locally: Example: [root@standalone1 tmp]# wget "http://mirrorlist.centos.org/?release=6&arch=x86_64&repo=os"
--2017-07-17 13:34:17-- http://mirrorlist.centos.org/?release=6&arch=x86_64&repo=os
Resolving mirrorlist.centos.org (mirrorlist.centos.org)... 85.236.43.108, 212.69.166.138, 216.176.179.218, ...
Connecting to mirrorlist.centos.org (mirrorlist.centos.org)|85.236.43.108|:80... connected.
HTTP request sent, awaiting response... 200 OK
Length: 493 [text/plain]
Saving to: ‘index.html?release=6&arch=x86_64&repo=os’
100%[==================================================================================================================================================================================================>] 493 --.-K/s in 0s
2017-07-17 13:34:17 (54.8 MB/s) - ‘index.html?release=6&arch=x86_64&repo=os’ saved [493/493]
.
.
[root@standalone1 tmp]# cat index.html\?release\=6\&arch\=x86_64\&repo\=os
http://mirror.oss.ou.edu/centos/6.9/os/x86_64/
http://centos.firehosted.com/6.9/os/x86_64/
http://mirrors.ocf.berkeley.edu/centos/6.9/os/x86_64/
http://mirror.ancl.hawaii.edu/linux/centos/6.9/os/x86_64/
http://lug.mtu.edu/centos/6.9/os/x86_64/
http://centos.host-engine.com/6.9/os/x86_64/
http://mirror.millry.co/CentOS/6.9/os/x86_64/
http://mirror.teklinks.com/centos/6.9/os/x86_64/
http://centos.mirrors.tds.net/pub/linux/centos/6.9/os/x86_64/
http://mirrors.cmich.edu/centos/6.9/os/x86_64/ .
... View more
07-17-2017
01:17 PM
@mravipati Please check the stack Trace of NullPointerException to verify if it is the following API that is causing the NullPointer? This you can find in ambari-server.log Caught a runtime exception executing a query java.lang.NullPointerException
at org.apache.ambari.server.controller.internal.AlertResourceProvider.toResource(AlertResourceProvider.java . If yes then please check if there is any inconsistency int he following two tables in your Ambari Database: >>> select * from alert_current where definition_id not in (select definition_id from alert_definition);
>>> select * from alert_history where alert_definition_id not in (select definition_id from alert_definition); . If above queries returns any result then please delete those records as following then restart ambari server. >>> delete from alert_history where alert_definition_id not in (select definition_id from alert_definition);
>>> delete from alert_current where definition_id not in (select definition_id from alert_definition);
. Also the following: >>> select * from alert_current where history_id not in (select alert_id from alert_history);
>>> delete from alert_current where history_id not in (select alert_id from alert_history); Then restart ambari-server and login to ambari UI then doublecheck the logs if NPE error is gone.
... View more
07-17-2017
12:00 PM
@Rohit Sharma If all those stale alerts are coming from any particular host then you will need to check the hostname which is causing the stale alert. # curl -i -H "X-Requested-By: ambari"-u admin:admin -X GET http://AMBARI_SERVER_HOST:8080/api/v1/hosts . Please list all the hosts using the above curl command (or by opening the mentioned URL in the browser, Where you have logged in to ambari UI) and then match the hosts list to findout if by mistake any unwanted host is being registered. Some times old host can have the ambari-agent.ini entry to point to ambari server. .
... View more
07-17-2017
10:19 AM
1 Kudo
@Rohit Sharma Did you perform a Disable and Enable operation on the stale alert definition from ambari UI. Please wait for 10-15 seconds between disabling and enabling the stale alert to see if it gets cleared. .
... View more