About mike_bronson7

mike_bronson7 · ‎09-30-2018

hi Sandepp . can you also advice my thred - https://community.hortonworks.com/questions/222627/how-re-balance-the-partitions-to-available-brokers.html

akhilsnaik · ‎09-13-2018

Hi @Michael Bronson , ambari will give you recommendations on the Value to be set. You can set any value as per your Cluster and business logic. Once you saved and restart the Yarn clients the changes will be synced to your yarn-site.xml if you are able to save the configuration after clickin on proceed anyway ambari will save 100 . Please see if this helps you.

TarunParimi · ‎09-13-2018

@Michael Bronson I will look to create an article about configuring the vcores for cpu scheduling when I get time. I will mention this part there.

falbani · ‎09-07-2018

@Michael Bronson In yarn master mode executors will run inside a yarn container. Spark will launch an Application Master that will be responsible of negotiating the containers with Yarn. Having that said only nodes running Nodemanager are eligible to run executors. First question: The executor logs you are looking for will be part of the yarn application logs for the container running on the specific node. (yarn logs -applicationId <appId>) Second question: Executor will notify in case heartbeat fails to reach driver for some network problem/timeout. So this should be in the executor log that is part of the application logs. HTH *** If you found this answer addressed your question, please take a moment to login and click the "accept" link on the answer.

sandyy006 · ‎09-06-2018

@Michael Bronson By default Spark2 has log level as WARN. Set it to INFO to get more context on what is going on in the driver and executor. More over the log will be locally available in Nodemanager when the container is still running. The easiest way is to go to spark UI (yarn application master UI) -> click on executors tab -> Here you should see stderr and stdout corresponding to driver and executors. Regarding the WARN on heartbeat , we'd need to check what driver is doing at that point. I think you already have asked another question with more details on driver and executor.

mike_bronson7 · ‎09-12-2018

so in case we verify the logs of gc by http://gceasy.io/ , and we see that Driver isn't doing full garbage collection , that what are the next steps that we need to do ?

mike_bronson7 · ‎09-06-2018

we need the debug for Spark Thrift server , we have issue when heartbeat from datanode machine not communicated with the driver , so this is the reason that we need debug mode on for Spark Thrift server

Deepan_N · ‎10-25-2018

@Michael Bronson The warning message means that the Executor is unable to send the Heartbeat to the driver (might be network issue). This is just a warning message, but each failure increments heartbeat failure count and when we hit the maximum failures the executor will fail and exit with error. There are two configurations that we can tune to avoid this issue. spark.executor.heartbeat.maxFailures (default value: 60) Number of times an executor will try to send heartbeats to the driver before it gives up and exits (with exit code 56). spark.executor.heartbeatInterval ( default value: 10s ) Interval between each executor's heartbeats to the driver. Heartbeats let the driver know that the executor is still alive and update it with metrics for in-progress tasks. spark.executor.heartbeatInterval should be significantly less than spark.network.timeout

mike_bronson7 · ‎09-05-2018

@Jay . please let me know if I understand it as the following let say that one of the replica spark2-hdp-yarn-archive.tar.gz , is corrupted when I run this CLI su - hdfs -c "hdfs fsck /hdp/apps/2.6.4.0-91/spark2/spark2-hdp-yarn-archive.tar.gz" dose its actually means that fsck will replace the bad one with the good one and status finally will be HEALTHY ?

kpalanisamy · ‎09-06-2018

You can do tail in namenode and datanode log, also you can redirect output to dummy log file during restart. #tailf <namenode log> >/tmp/namenode-`hostname`.log #tailf <datanode log> >/tmp/datanode-`hostname`.log

Online	Offline
Last Visited	‎08-27-2024 09:17 AM

Member Since	‎08-08-2017 09:40 AM
Last Visited	‎08-27-2024 09:17 AM
Posts	1,652
Kudos received	29

Cloudera Community

Re: how to find number of CPU core on datanode ma...

Re: postgresql + ambari server failed to open port...

Re: how to stop the thrift servers by REST API

Re: namenode is in safe mode

Re: Directory /grid/sdg/hadoop/hdfs/data became un...

Re: why we not get all kafka broker list from zook...

Re: ambati values that greater then scrollbar

Re: how to calculate the yarn.nodemanager.resource...

Re: What are Spark executors that runs from the da...

Re: why spark2 logs are not created in the datano...

Re: Spark failure detection - why datanode not sen...

Re: change the Advanced spark2-log4j-propertiese ...

Re: spark application + communicating with driver ...

Re: what could be the cause for spark2-hdp-yarn-ar...

Re: HDFS is almost full 90% but data node disks ar...