About wazzu62

wazzu62 · ‎06-15-2022

Operator error, I forgot the balancer was an hdfs command and not something in /usr/bin

wazzu62 · ‎06-14-2022

We had an HDP 2.5.3.0 cluster that had an HDFS balancer CLI command that is not found with HDP 3.1.5.0. I am adding new nodes the the 3.1.5 cluster and it appears there is some HDFS balancing going on, albeit slowly. Did the CLI go away with 3.1.5.0 or am I missing something?

wazzu62 · ‎03-24-2022

I have a small cluster running HDP 3.1.5.6091 with Ambari 2.7.5.0 the I inherited The Yarn Timeline service starts but gets a constant critical alert ATSv2 HBase Application The HBase application reported a 'FAILED' state. Check took 2.261s The hadoop-yarn-timelinereader.log shows failed on connection exception: org.apache.hbase.thirdparty.io.netty.channel.AbstractChannel$AnnotatedConnectException: Connection refused:server.name:17020 I added that port (17020) to the firewall on the server in question, restarted the Timeline Service V2.0 Reader service but still getting the same error. Any suggestions would be appreciated

wazzu62 · ‎02-22-2021

I am running HDP-2.5.3.0, ambari 2.4.2.0 I have a cluster node/host that died over the weekend and I am not able to resurrect it Through ambari I am able to put it in maintenance mode. I would like to delete this node from the cluster. It's role was datanode and yarn Nodemanager. Went choosing the delete option for Host Actions I get a warning that the DataNode and Nodemanager should be decommissioned first to prevent loss but those options are not available via ambari. Is there another option for decommissioning or is my only option to delete given the status of the node? Thanks.

wazzu62 · ‎09-12-2018

Thanks, this did work for me! Is there a way to configure the hadoop cluster to use a specific installed version of python?

wazzu62 · ‎09-11-2018

I have two versions on python installed (2.6 and 2.7) Spark jobs run thru shell in pyspark are picking up one version of Python (2.7). Jobs submitted to the cluster via yarn are picking up the 2.6 version of python. How can I get yarn jobs to point to the 2.7 version?

wazzu62 · ‎06-12-2017

Thanks, adding the jar to HIVE_AUX_JARS_PATH in hive-env.sh got SerDe working in zeppelin

wazzu62 · ‎05-30-2017

Trying to get serde working with zeppelin and/or the spark shell. Running the following in either val links = sqlContext.sql("SELECT * FROM test_links LIMIT 10") links.show() Produces an error ERROR hive.log: error in initSerDe: java.lang.ClassNotFoundException Class org.apache.hive.hcatalog.data.JsonSerDe not found java.lang.ClassNotFoundException: Class org.apache.hive.hcatalog.data.JsonSerDe not found Tried added this to /etc/spark/conf/hive-site.xml <property> <name>hive.aux.jars.path</name> <value>file:///usr/hdp/2.5.3.0-37/hive2/lib/hive-hcatalog-core.jar</value> </property> But am still getting the error.

wazzu62 · ‎05-18-2017

HDP 2.5.3 Ambari 2.4.2 18 data nodes 190TB HDFS disk usage is at about 92% ~15TB free with critical alarms or warnngs on most all the data nodes Percent DataNodes With Available Space is alarming as well Are the best practice recommendations for setting these thresholds, for managing the percent of HDFS disk usage? Are there concerns for running HDFS disk usage above a certain percentage?

wazzu62 · ‎05-11-2017

I figured out what was wrong. There are 2 HDFS configuration groups on this cluster. One is set up for the datanodes. I just needed to add the new servers to that group

Online	Offline
Last Visited	‎06-15-2022 11:19 AM

Member Since	‎06-21-2016 06:25 PM
Last Visited	‎06-15-2022 11:19 AM
Posts	40
Kudos received	1

Cloudera Community

Re: changing datanode.dir

Re: HDP HDFS balancer CLI not found

HDP HDFS balancer CLI not found

ATSv2 HBase Application The HBase application repo...

Deleting a failed node/host

Re: multiple versions of python issues

multiple versions of python issues

Re: org.apache.hive.hcatalog.data.JsonSerDe not fo...

org.apache.hive.hcatalog.data.JsonSerDe not found

HDFS Disk Usage and datanode storage thresholds

Re: changing datanode.dir