Member since
03-18-2016
12
Posts
2
Kudos Received
0
Solutions
04-26-2016
02:04 PM
Predrag, thanks! Your sample can explain the spark screenshot I shared below; however, it cannot explain the the hive job below: the last 2 lines, 3 containers and 4 containers allocated same amount of mem.
... View more
04-26-2016
02:06 AM
It is common case, look at first line here, 5 containers take 59GB.
... View more
04-26-2016
12:58 AM
Please see my post below, sorry, posted it as answer.
... View more
04-26-2016
12:58 AM
Ben, "Thank you for your reply. "So if you request 5GB you get 8GB in your case." ---- this is the most confusing part, I do not think it is true. Please refer to the screenshot I uploaded. My cluster yarn.scheduler.minimum-allocation-mb = 6.5GB, look at the last 2 lines: 4 containers have 26624 GB and 3 containers also have 26624GB meaning each container is 6.5G and 8.5G respectively, not necessarily 6.5GB, 13GB... Thought?
... View more
04-25-2016
07:46 PM
1 Kudo
I am tuning our Yarn cluster which hosts tez, mapreduce and Spark-on-Yarn. I observed during busy hour, all Yarn cluster memory is used up while lots of cores are free--- which leads me to believe that we should decrease minimum container size. Meanwhile, lots of applications specify their memory, -xms=4GB etc; considering container overhead memory, would not it requires container size bigger than 4G?
If I set yarn.scheduler.minimum-allocation-mb = 4GB, Spark executor size is 4GB as well, would it actually assign 2 containers/8GB to the Spark executor or yarn is smart enough to allocate around 5GB? Even in HWX tutorial, I think something not right: In https://community.hortonworks.com/content/kbentry/14309/demystify-tez-tuning-step-by-step.html hive.tez.container.size is multiple of yarn.scheduler.minimum-allocation-mb, why so? if yarn.scheduler.maximum-allocation-mb = 24GB, yarn.scheduler.minimum-allocation-mb = 4GB, hive.tez.container.size=5B, would not Yarn smart enough to assign 5GB to a container to satisfy tez needs?
... View more
Labels:
- Labels:
-
Apache Spark
-
Apache YARN
04-25-2016
07:19 PM
Ancil, I have question regarding: hive.tez.container.size is multiple of yarn.scheduler.minimum-allocation-mb, why so? if yarn.scheduler.maximum-allocation-mb = 24GB, yarn.scheduler.minimum-allocation-mb = 4GB, hive.tez.container.size=5B, would not Yarn smart enough to assign 5GB to a container to satisfy tez needs? Thanks, Richard
... View more
03-24-2016
02:31 PM
We are able to restart Yarn App Timeline server. Further investigation indicates it leaks memory:
yarn 32397 8.4 43.6 8432880 6680964 ? Sl Mar23 101:19 /usr/jdk64/jdk1.8.0_60/bin/java -Dproc_timelineserver -Xmx1024m -Dhdp.version=2.3.4.0-3485 Configured with 1GB max but now using 6GB, thought?
... View more
03-21-2016
05:31 PM
1 Kudo
No error/exception found in log, only a few lines of warning: 2016-03-21 06:49:49,236 INFO timeline.LogInfo (LogInfo.java:parseForStore(76)) - Parsed 0 entities from hdfs://hostname/ats/active/application_1458130432014_0671/appattempt_1458130432014_0671_000001/summarylog-appattempt_1458130432014_0671_000001 in 11 msec 2016-03-21 06:49:49,246 WARN timeline.EntityGroupFSTimelineStore (LogInfo.java:doParse(164)) - Error putting entity: dag_1458130432014_0695_1 (TEZ_DAG_ID): 6 2016-03-21 06:49:49,246 WARN timeline.EntityGroupFSTimelineStore (LogInfo.java:doParse(164)) - Error putting entity: dag_1458130432014_0695_1 (TEZ_DAG_ID): 6 2016-03-21 06:49:49,246 WARN timeline.EntityGroupFSTimelineStore (LogInfo.java:doParse(164)) - Error putting entity: dag_1458130432014_0694_1 (TEZ_DAG_ID): 6 2016-03-21 06:49:49,246 WARN timeline.EntityGroupFSTimelineStore (LogInfo.java:doParse(164)) - Error putting entity: dag_1458130432014_0694_1 (TEZ_DAG_ID): 6 2016-03-21 06:49:49,247 INFO timeline.LogInfo (LogInfo.java:parseForStore(76)) - Parsed 6 entities from hdfs://hostname/ats/active/application_1458130432014_0695/appattempt_1458130432014_0695_000001/summarylog-appattempt_1458130432014_0695_000001 in 19 msec 2016-03-21 06:49:49,247 INFO timeline.LogInfo (LogInfo.java:parseForStore(76)) - Parsed 6 entities from hdfs://hostname/ats/active/application_1458130432014_0694/appattempt_1458130432014_0694_000001/summarylog-appattempt_1458130432014_0694_000001 in 19 msec It is HDP 2.3.4.
... View more
Labels:
- Labels:
-
Apache YARN
03-18-2016
05:31 PM
It is Ambari 2.2.0.0, HDP-2.3.4.0-3485
... View more
03-18-2016
04:15 PM
Installing Ranger via Ambari on AWS, got this: 2016-03-18 16:10:44,048 [JISQL] /usr/jdk64/jdk1.8.0_60/bin/java -cp /usr/share/java/mysql-connector-java.jar:/usr/hdp/current/ranger-admin/jisql/lib/* org.apache.util.sql.Jisql -driver mysqlconj -cstring jdbc:mysql://mysqldb/ranger -u 'user' -p '********' -noheader -trim -c \; -input /usr/hdp/current/ranger-admin/db/mysql/patches/007-updateBlankPolicyName.sql Error executing: CREATE FUNCTION `getTempPolicyCount`(assetId bigint, resId bigint) RETURNS int(11) BEGIN DECLARE tempPolicyCount int default 1; DECLARE dbResourceId bigint; DECLARE exitLoop int DEFAULT FALSE; DECLARE policyList CURSOR FOR SELECT id from x_resource where asset_id = assetId; DECLARE CONTINUE HANDLER FOR NOT FOUND SET exitLoop = true; OPEN policyList; readPolicy : LOOP FETCH policyList into dbResourceId; IF exitLoop THEN set tempPolicyCount = tempPolicyCount + 1; LEAVE readPolicy; END IF; IF (resId = dbResourceId) THEN LEAVE readPolicy; END IF; set tempPolicyCount = tempPolicyCount + 1; END LOOP; CLOSE policyList; RETURN tempPolicyCount; END
java.sql.SQLException: This function has none of DETERMINISTIC, NO SQL, or READS SQL DATA in its declaration and binary logging is enabled (you *might* want to use the less safe log_bin_trust_function_creators variable)
SQLException : SQL state: HY000 java.sql.SQLException: This function has none of DETERMINISTIC, NO SQL, or READS SQL DATA in its declaration and binary logging is enabled (you *might* want to use the less safe log_bin_trust_function_creators variable) ErrorCode: 1418
2016-03-18 16:10:44,438 [E] 007-updateBlankPolicyName.sql import failed!
... View more
Labels:
- Labels:
-
Apache Ranger