Member since
09-28-2017
88
Posts
3
Kudos Received
0
Solutions
02-20-2019
12:15 PM
i want to disable second attempt
... View more
02-20-2019
12:11 PM
the server node have 32gb ram, and he only accept spark submit jobs (he does not act as client\worker) each worker node is one of two servers types: 16 core 64GB or 48 core 196GB and the workers nodes have only Metrics Monitor / NodeManager installed all the configuration is on default. when running large job i don't mind the minute hold up, but when running short job should be over under 1 minute (for example 500 jobs (each take 30 seconds on one core) should be over under 1 minute when have enough cpu\ram to allocate, i think that the problem is the delay of actual job starting time (i can see the process start on by running top on command line on shell on the worker) 30-60 seconds after the submit is received., i see some java tasks manly regarding the creation on the container
... View more
02-20-2019
12:03 PM
when submitting a job that failed, it try to run with another attemped, how i can disable the second run ? what configuration param i must set?
... View more
Labels:
- Labels:
-
Apache Spark
-
Apache YARN
02-20-2019
11:15 AM
ok, just stopped 3 hive services, still it abit bother me that there are red dots in the ui :), Btw why there are a dependency on hive?
... View more
02-20-2019
11:02 AM
Thanks for the quick response, Can i delete all hive? Hive Metastore HiveServer2 Hive Client Metrics are useful. i can see some bottlenecks and improve the performance, Currently still struggling the improve the time it take to allocate the containers each job ( all the time they are the same and it take 30-60 seconds every job to allocate them ) Btw why it is a requirements (in the install stage it was selected cause i needed spark\yarn)
... View more
02-20-2019
10:37 AM
i have a large cluster installed currently it is used only for spark on yarn jobs what services on the master i can remove\uninstall to free up some ram? one server to accept spark jobs currently have the following services: Timeline Service V1.5 / YARN Master History Server / MapReduce2 Master Hive Metastore / Hive Master HiveServer2 / Hive Master Metrics Collector / Ambari Metrics Master Grafana / Ambari Metrics Master NameNode / HDFS Master ResourceManager / YARN Master SNameNode / HDFS Master Spark2 History Server / Spark2 Master Timeline Service V2.0 Reader / YARN Master YARN Registry DNS / YARN Master ZooKeeper Server / ZooKeeper Master DataNode / HDFS Slave Metrics Monitor / Ambari Metrics Slave HDFS Client / HDFS Client Hive Client / Hive Client MapReduce2 Client / MapReduce2 Client Spark2 Client / Spark2 Client Tez Client / Tez Client YARN Client / YARN Client ZooKeeper Client / ZooKeeper Client and multiple clients with: Metrics Monitor / Ambari Metrics Slave NodeManager / YARN Slave
... View more
Labels:
02-20-2019
10:28 AM
i found out that the container is reused only while this job still active,, and in my case used between 3-10 times. can it force it to stay until the resources (cpu/ram) are needed for different (cpu/ram ) requirement across other jobs
... View more
02-18-2019
11:27 AM
works thanks 🙂
... View more
02-18-2019
09:30 AM
how can i disable hdd monitoring, all i want is cpu\ram\network monitoring
... View more
02-17-2019
12:51 PM
when i run short jobs the container take more time to load and the actual job, for example it take sometimes a over 60 seconds to start my process because for each core in the computer a new container is generated, is it possible to configure the nodemanager not to kill the container? and reuse it when the same cpu/ram is requested?
... View more
Labels:
- Labels:
-
Apache Spark
-
Apache YARN