Member since
08-28-2015
194
Posts
45
Kudos Received
1
Solution
My Accepted Solutions
Title | Views | Posted |
---|---|---|
2212 | 07-05-2017 11:58 PM |
06-14-2017
02:30 PM
When install HDP at registration step, we always see thewarning on hugepage is enabled. here are the commands to disable hugepage on linux. I found them when I install CDH 5 on AWS. echo never >/sys/kernel/mm/transparent_hugepage/defrag echo never > /sys/kernel/mm/transparent_hugepage/enabled also, it is recommended to add to /etc/rc.d, so it is disabled after each reboot.
... View more
Labels:
- Labels:
-
Apache Ambari
06-14-2017
02:24 PM
In the registration hosts during install HDP cluster, there is warning about hugepage is enbaled on all hosts. However, only when I install CDH 5, I found these 2 commands provided to disable hugepage on linux echo never >/sys/kernel/mm/transparent_hugepage/defrag echo never > /sys/kernel/mm/transparent_hugepage/enabled they commended to add to /etc/rc.d/, so every time it is up after reboot. thanks, Robin
... View more
Labels:
- Labels:
-
Apache Ambari
06-14-2017
03:47 AM
Thank you Cdraper, Finally I use this combination, RHEL 7.3 (m4.large) , ambari 2.4.10 and HDP 2.5 install 7 nodes on AWS. the only issues I deal with is change the namenode java heap size to 4gb in my case. then everything worked. your info is still very helpful. I am going to setup kafka 3 nodes and mongodb from here. will keep you posted. thanks again. Robin
... View more
06-12-2017
02:21 PM
thank you for getting back to me. 1 iptables -L dont have any ouput 2. the firewall package is available but not installed in a AWS/EC2 instance. therefore, I did run systemctl stop firewalld and systemctl disable firewalld on REHL 7 and service iptables stop, chkconfig iptables off on REHL6. 3. the point here, if these firewall or iptables package is not installed, I may have to think they are never act, so if not installed, they are not there = disabled. let me know if you think differently. thanks,
... View more
06-12-2017
02:13 PM
Thank you Cdraper for getting back to me. 1. I dont have HA setup, but I think HDP requires to have secondary name node, I didnt do any configure yet, but snamenode is on the list to start 2. do yo think Hbase space issue could cause this problem. as Hbase stores all metadata for namenode and cluster, if Hbase has problem, then namenode cant start, so does snamenode then HDFS then entire cluster? 3. if I use m4.xlarge instance on AWS, ended up with warnings on hbase_master_heapsize, hbase_collector_heapszie etc, if you change to the recommended values, the new sets of parameters come up. one time I have changed 7 times and never get rid of that warning. if I ignore the warning, my namenode never, never started. 4. if I use m4.large, I dont get the these heapsize warnings, instead I get warning on packages not install, so I tried to manually install them on host, it just dont execute or yum dont have this package. I am kind of give up on HDP 2.4 install on RHEL 6, 7/AWS/EC2, so what do you suggest, shall I try ubuntu or Centos, which version? HDP stable version is 2.4.1.0 and ubuntu 12? After all these experiment, I just feel HDP with AWS is not compatible, I could be wrong. either AWS/EC2 root / size too samll, you can resize in instance creation process, but dont get the size you define. if instance RAM = 8gb or more, then HDP get lost on configure them, that is why all these heap_size size issues.... sorry long story, thank you for your help. Robin
... View more
06-11-2017
05:58 PM
Clink, I got the same error, and I changed my /etc/hosts file as: 52.60.239.41 ip-172-31-12-19.ca-central-1.compute.internal node1 also, I restart agent on each hosts, 4 of them, I still get the same error. please help. Robin
... View more
06-11-2017
05:45 PM
Clint, my hosts file look like this, and seemed it is not working. 52.60.239.41 ip-172-31-12-19.ca-central-1.compute.internal 52.60.157.176 ip-172-31-3-218.ca-central-1.compute.internal 35.182.18.189 ip-172-31-3-179.ca-central-1.compute.internal 35.182.25.52 ip-172-31-2-137.ca-central-1.compute.internal do I have to give a short name at end, where this shortname come from my EC2 instance name, right?
... View more
06-09-2017
02:18 AM
thank you so much. even I need more time to digest it, I can tell this is what I need to know. thank you for spending time to help. Robin
... View more
06-08-2017
05:00 AM
Please refer to my attached snapshot for df -h. there is nothing 800gb. do I need to mkfs /dev/xvdb and mkdir /mydir and mount /dev/xvdb to /mydir and may have to change fstab at end. any easy way to deal with ambari-server disk issues? Please help. thanks,
... View more
Labels:
- Labels:
-
Apache Ambari
06-08-2017
02:48 AM
I keep getting this error, no idea why? please help.
... View more
Labels:
- Labels:
-
Apache Ambari
-
Apache Hadoop
06-08-2017
01:59 AM
Failed to issue method call: Unit iptables.service not loaded. my question is in order to install ambari, I need to stop and disable iptables, and the package is not installed by default on AWS EC2 instance. I know I can do the yum install iptables-server -y, but I dont need it so I dont want to installit. So, Can I assume that no iptables package install = iptables disabled. then I dont need to do any thing with iptable, right?
... View more
Labels:
- Labels:
-
Apache Ambari
06-06-2017
05:39 AM
both of #openssl s_client -connect ip-172-31-13-143.us-west-2.compute.internal:8440 and 8441 have no output.
... View more
06-06-2017
05:36 AM
#hostname -f
ip-172-31-13-143.us-west-2.compute.internal #telnet ip-172-31-13-143.us-west-2.compute.internal8440
telnet: ip-172-31-13-143.us-west-2.compute.internal8440: Name or service not known
ip-172-31-13-143.us-west-2.compute.internal8440: Unknown host # telnet ip-172-31-13-143.us-west-2.compute.internal8441
telnet: ip-172-31-13-143.us-west-2.compute.internal8441: Name or service not known
... View more
06-06-2017
05:27 AM
Thank you so much Jay. this is the file I modified by replace localhost to $hostname -f on all nodes here. Also, I started agent on all nodes, no problem. so how do I know if agent can communicate with master?
... View more
06-06-2017
05:23 AM
sorry, I terminated all nodes, however, I know this alert info: connection failed to http://ip-172-31-15-246.us-west-2.compute.internal on every node. some ports just dont open and directories not exist. my passwdless setup correctly for sure. however, connection refused, it is IP problem? if need static IP, limitation is 5. I have more than 10 nodes. Can you help?
... View more
06-06-2017
05:10 AM
Since the connection refused error stopped me everytime, I decided to install ambari.repo and yum install ambari-agent on each node in cluster. so install cluster, 3 hosts failed at the first step--confirm hosts. only ambari-server host passed it. the error was, failed to reconnect to local server as above. so seemed ip on the node have changed. I may have to use the VCP to have static IP, right? if yes, the limit for VCP is 5, I have over 10 nodes here. Also, the host check has no warnings... Please help. thanks,
... View more
Labels:
- Labels:
-
Apache Ambari
06-05-2017
02:21 PM
I have installed HDP many times, always have an issue-- connection failed and connection refused when start ambari metrics. 1. during ambari install, I have ignored the ams_env warning msg. if I follow the recommended number, it will come up with newer number, I kept change for many time, one time I even changed 7 times, still not get it pass. so I ignored this warning, should I? 2. all ambari-metrics-collcter and ambari-metrics-monitor are installed, I can see from rpm -qa|grep ambari-collecter 3. finally, ambari-metrics just dont start, get all connection failed error. so seemed the configuration cause the connection failed, so any one have good combination to edit ams_env and ams_habase_env? thank you very much.
... View more
Labels:
- Labels:
-
Apache Ambari
06-05-2017
02:03 PM
oh, my ambari service is external mode, use http://ec2....amazon.com:8080, if this is so called external mode. after ambari-server reset, I got the same error. the repo is: http://public-repo-1.hortonworks.com/ambari/centos6/2.x/updates/2.4.0.1/ambari.repo rpm -qa|grep ambari-metrics -- show all packages are installed. so the hbase configure info is the problem. it refuse the connection. I dont need to edit hbase configure with ambari 2.4 in the past at all. any changes on HDP? thanks, Robin
... View more
06-05-2017
02:02 PM
oh, my ambari service is external mode, use http://ec2....amazon.com:8080, if this is so called external mode. after ambari-server reset, I got the same error. the repo is: http://public-repo-1.hortonworks.com/ambari/centos6/2.x/updates/2.4.0.1/ambari.repo rpm -qa|grep ambari-metrics -- show all packages are installed. so the hbase configure info is the problem. it refuse the connection. I dont need to edit hbase configure with ambari 2.4 in the past at all. any changes on HDP? thanks, Robin
... View more
06-05-2017
02:02 PM
oh, my ambari service is external mode, use http://ec2....amazon.com:8080, if this is so called external mode. after ambari-server reset, I got the same error. the repo is: http://public-repo-1.hortonworks.com/ambari/centos6/2.x/updates/2.4.0.1/ambari.repo rpm -qa|grep ambari-metrics -- show all packages are installed. so the hbase configure info is the problem. it refuse the connection. I dont need to edit hbase configure with ambari 2.4 in the past at all. any changes on HDP? thanks, Robin
... View more
06-05-2017
02:02 PM
oh, my ambari service is external mode, use http://ec2....amazon.com:8080, if this is so called external mode. after ambari-server reset, I got the same error. the repo is: http://public-repo-1.hortonworks.com/ambari/centos6/2.x/updates/2.4.0.1/ambari.repo rpm -qa|grep ambari-metrics -- show all packages are installed. so the hbase configure info is the problem. it refuse the connection. I dont need to edit hbase configure with ambari 2.4 in the past at all. any changes on HDP? thanks, Robin
... View more
06-05-2017
01:42 PM
on my ambari server and datanode, both has no ambari-metrics-collector found. [root@ip-172-31-1-92 etc]# ls -lart /etc/ambari-metrics-collector ls: cannot access /etc/ambari-metrics-collector: No such file or directory [root@ip-172-31-12-243 etc]# ls -lart /etc/ambari-metrics-collector
ls: cannot access /etc/ambari-metrics-collector: No such file or directory 1. when add volume 100gb or 800gb, I still have the same connection problem 2. this time I didnt add new volume to EBS, only 20gb for root /. 3. I reinstall more than 10 time, always the same issue here. so do you think my security settings is the problem, here is my security group info, all TCP, HTTP all ports, all UDP and all traffice with anywhere access. this is wide open security. Do you think the adding ambari id_rsa.pub to authorized_keys on all servers for passwdless is enough? From ambari server, I can passweless ssh to all other servers including itself, do you think I need to make passwdless between all nodes? thank you for your help.
... View more
06-05-2017
11:31 AM
605-hdp-error.txt on one of the datanode. no ambari-metrics-collector.log on ambari server. this is metrics log file. I only have 20g for root dir /, no EBS storage assigned here
... View more
06-05-2017
11:17 AM
you are right, Jay. the problem is port connection refused. however, my /tmp is 777, so what to do from here. [root@ip-172-31-12-243 /]# free -m total used free shared buffers cached
Mem: 16077 390 15687 3 81 74
-/+ buffers/cache: 234 15843
Swap: 0 0 0 [root@ip-172-31-12-243 /]# hostname -f
ip-172-31-12-243.ca-central-1.compute.internal telnet ip-172-31-1-92.ca-central-1.compute.internal 3000 Trying 52.60.67.105... telnet: connect to address 52.60.67.105: Connection refused [root@ip-172-31-12-243 /]# telnet ip-172-31-1-92.ca-central-1.compute.internal 16000 Trying 52.60.67.105...
telnet: connect to address 52.60.67.105: Connection refused thank you so much for helping here.
... View more
06-05-2017
04:15 AM
I only can upload one of my error file. but most of them are the same, connection failed...and seemed ambari-metrics never can be started. Only ZK is up. so please help. thank you very much.
... View more
06-05-2017
04:12 AM
I used the following to setup these recommended size I found in this community help ams-env :: collector_heapsize = 2048 ams-hbase-env :: hbase_master_heapsize = 512 ams-hbase-env :: hbase_master_xmn = 102 ams-hbase-env :: hbase_regionserver_heapsize = 4096 ams-hbase-env :: regionserver_xmn_size = 512 604-hdp-errors.png
... View more
06-05-2017
02:50 AM
Thank you so much, let me try it and get back to you in couple of hours.
... View more
06-05-2017
02:26 AM
Amabari installation failed, the log file -->Installing package ambari-metrics-collector ('/usr/bin/yum -d 0 -e 0 -y install ambari-metrics-collector') command failed on my ambari server, I dont have this package at all, anyhow, I copied HDP.repo, HDP-UTILS.repo to all data nodes then try to manually install ambari-metrics-collector, but I got following: /usr/bin/yum -d 0 -e 0 -y install ambari-metrics-collector
Error: Nothing to do /usr/bin/yum -d 0 -e 0 -y install ambari-metrics-collector
Error: Nothing to do these errors repeatedly stop my installation. and my repo was downloaded as: 1. I used this repo -->http://public-repo-1.hortonworks.com/ambari/centos6/2.x/updates/2.4.0.1/ambari.repo 2. I used AWS/EC2 m4.large with 20 gb for Root dir and 100 EBS(magnetic), I ignored the metrics_collector_heap_size warnings, because if I following the recommended size to modify them, it would come up with the second recommended size, I even go as far as 7 times edit these numbers, never done with it. 3. Any one had the same issue like mine? if the storage setting here matters? what is a good combination of these metrics_collector_heap_size, xmn_size and master_heap_size? 4. the type of instance are different, different RAM, these heap_size will depends, what to follow? isnt this HDP job? thank so much for you help.
... View more
Labels:
- Labels:
-
Apache Ambari
06-01-2017
04:14 AM
you are right, Since my setting all working before I never use VCP and just thought about this is may be a cause. I will work on it and get back to you. thanks again.
... View more
06-01-2017
03:44 AM
Thank you so much for this quick respond. why are IP and hostname changed? my network.hostname=localhost, no change on /etc/sysconfig/network file. did I missed anything? is this AWS/EC2 networking DNS problem? I already created a ticket to AWS support. do you think I can fix it on my side.
... View more