Member since
06-21-2016
40
Posts
1
Kudos Received
1
Solution
My Accepted Solutions
Title | Views | Posted |
---|---|---|
1162 | 05-11-2017 03:42 PM |
06-15-2022
06:26 AM
Operator error, I forgot the balancer was an hdfs command and not something in /usr/bin
... View more
06-14-2022
01:25 PM
We had an HDP 2.5.3.0 cluster that had an HDFS balancer CLI command that is not found with HDP 3.1.5.0. I am adding new nodes the the 3.1.5 cluster and it appears there is some HDFS balancing going on, albeit slowly. Did the CLI go away with 3.1.5.0 or am I missing something?
... View more
Labels:
- Labels:
-
HDFS
-
Hortonworks Data Platform (HDP)
03-24-2022
11:32 AM
I have a small cluster running HDP 3.1.5.6091 with Ambari 2.7.5.0 the I inherited
The Yarn Timeline service starts but gets a constant critical alert
ATSv2 HBase Application The HBase application reported a 'FAILED' state. Check took 2.261s
The hadoop-yarn-timelinereader.log shows
failed on connection exception: org.apache.hbase.thirdparty.io.netty.channel.AbstractChannel$AnnotatedConnectException: Connection refused:server.name:17020
I added that port (17020) to the firewall on the server in question, restarted the Timeline Service V2.0 Reader service but still getting the same error.
Any suggestions would be appreciated
... View more
Labels:
02-22-2021
07:53 AM
I am running HDP-2.5.3.0, ambari 2.4.2.0
I have a cluster node/host that died over the weekend and I am not able to resurrect it
Through ambari I am able to put it in maintenance mode.
I would like to delete this node from the cluster. It's role was datanode and yarn Nodemanager.
Went choosing the delete option for Host Actions I get a warning that the DataNode and Nodemanager should be decommissioned first to prevent loss but those options are not available via ambari.
Is there another option for decommissioning or is my only option to delete given the status of the node?
Thanks.
... View more
Labels:
09-12-2018
09:50 PM
Thanks, this did work for me! Is there a way to configure the hadoop cluster to use a specific installed version of python?
... View more
09-11-2018
09:50 PM
I have two versions on python installed (2.6 and 2.7) Spark jobs run thru shell in pyspark are picking up one version of Python (2.7). Jobs submitted to the cluster via yarn are picking up the 2.6 version of python. How can I get yarn jobs to point to the 2.7 version?
... View more
Labels:
- Labels:
-
Apache Spark
-
Apache YARN
06-12-2017
03:14 PM
Thanks, adding the jar to HIVE_AUX_JARS_PATH in hive-env.sh got SerDe working in zeppelin
... View more
05-30-2017
05:18 PM
Trying to get serde working with zeppelin and/or the spark shell. Running the following in either val links =
sqlContext.sql("SELECT * FROM test_links LIMIT 10")
links.show() Produces an error ERROR hive.log: error in initSerDe: java.lang.ClassNotFoundException Class org.apache.hive.hcatalog.data.JsonSerDe not found
java.lang.ClassNotFoundException: Class org.apache.hive.hcatalog.data.JsonSerDe not found Tried added this to /etc/spark/conf/hive-site.xml <property> <name>hive.aux.jars.path</name>
<value>file:///usr/hdp/2.5.3.0-37/hive2/lib/hive-hcatalog-core.jar</value> </property> But am still getting the error.
... View more
Labels:
- Labels:
-
Apache Spark
-
Apache Zeppelin
05-18-2017
02:11 PM
HDP 2.5.3 Ambari 2.4.2 18 data nodes 190TB HDFS disk usage is at about 92% ~15TB free with critical alarms or warnngs on most all the data nodes Percent DataNodes With Available Space is alarming as well Are the best practice recommendations for setting these thresholds, for managing the percent of HDFS disk usage? Are there concerns for running HDFS disk usage above a certain percentage?
... View more
Labels:
- Labels:
-
Apache Hadoop
05-11-2017
03:42 PM
I figured out what was wrong. There are 2 HDFS configuration groups on this cluster. One is set up for the datanodes. I just needed to add the new servers to that group
... View more