<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: hive testbench shuffle error when running tpch setup in Support Questions</title>
    <link>https://community.cloudera.com/t5/Support-Questions/hive-testbench-shuffle-error-when-running-tpch-setup/m-p/301565#M220748</link>
    <description>&lt;P&gt;./tpch-setup.sh 5 /hive-data-dir-benchmark&lt;BR /&gt;TPC-H text data generation complete.&lt;BR /&gt;Loading text data into external tables.&lt;BR /&gt;WARN: The method class org.apache.commons.logging.impl.SLF4JLogFactory#release() was invoked.&lt;BR /&gt;WARN: Please see &lt;A href="http://www.slf4j.org/codes.html#release" target="_blank"&gt;http://www.slf4j.org/codes.html#release&lt;/A&gt; for an explanation.&lt;BR /&gt;Optimizing table part (1/8).&lt;/P&gt;&lt;P&gt;^CCommand failed, try 'export DEBUG_SCRIPT=ON' and re-running&lt;BR /&gt;sandbox.cluster.com /home/c095784/benchmarking/hive-testbench :&lt;BR /&gt;sandbox.cluster.com /home/c095784/benchmarking/hive-testbench :&lt;BR /&gt;sandbox.cluster.com /home/c095784/benchmarking/hive-testbench :&lt;BR /&gt;sandbox.cluster.com /home/c095784/benchmarking/hive-testbench : export DEBUG_SCRIPT=ON&lt;BR /&gt;sandbox.cluster.com /home/c095784/benchmarking/hive-testbench : ./tpch-setup.sh 5 /hive-data-dir-benchmark&lt;BR /&gt;+ '[' X5 = X ']'&lt;BR /&gt;+ '[' X/hive-data-dir-benchmark = X ']'&lt;BR /&gt;+ '[' 5 -eq 1 ']'&lt;BR /&gt;+ hdfs dfs -mkdir -p /hive-data-dir-benchmark&lt;BR /&gt;+ hdfs dfs -ls /hive-data-dir-benchmark/5/lineitem&lt;BR /&gt;+ '[' 0 -ne 0 ']'&lt;BR /&gt;+ hdfs dfs -ls /hive-data-dir-benchmark/5/lineitem&lt;BR /&gt;+ '[' 0 -ne 0 ']'&lt;BR /&gt;+ echo 'TPC-H text data generation complete.'&lt;BR /&gt;TPC-H text data generation complete.&lt;BR /&gt;+ echo 'Loading text data into external tables.'&lt;BR /&gt;Loading text data into external tables.&lt;BR /&gt;+ runcommand 'hive -i settings/load-flat.sql -f ddl-tpch/bin_flat/alltables.sql -d DB=tpch_text_5 -d LOCATION=/hive-data-dir-benchmark/5'&lt;BR /&gt;+ '[' XON '!=' X ']'&lt;BR /&gt;+ hive -i settings/load-flat.sql -f ddl-tpch/bin_flat/alltables.sql -d DB=tpch_text_5 -d LOCATION=/hive-data-dir-benchmark/5&lt;/P&gt;&lt;P&gt;Logging initialized using configuration in jar:file:/opt/cloudera/parcels/CDH-5.15.2-1.cdh5.15.2.p0.3/jars/hive-common-1.1.0-cdh5.15.2.jar!/hive-log4j.properties&lt;BR /&gt;OK&lt;BR /&gt;Time taken: 2.198 seconds&lt;BR /&gt;OK&lt;BR /&gt;Time taken: 0.013 seconds&lt;BR /&gt;OK&lt;BR /&gt;Time taken: 0.578 seconds&lt;BR /&gt;OK&lt;BR /&gt;Time taken: 1.183 seconds&lt;BR /&gt;OK&lt;BR /&gt;Time taken: 0.814 seconds&lt;BR /&gt;OK&lt;BR /&gt;Time taken: 0.494 seconds&lt;BR /&gt;OK&lt;BR /&gt;Time taken: 0.504 seconds&lt;BR /&gt;OK&lt;BR /&gt;Time taken: 0.493 seconds&lt;BR /&gt;OK&lt;BR /&gt;Time taken: 0.506 seconds&lt;BR /&gt;OK&lt;BR /&gt;Time taken: 0.495 seconds&lt;BR /&gt;OK&lt;BR /&gt;Time taken: 0.502 seconds&lt;BR /&gt;OK&lt;BR /&gt;Time taken: 0.494 seconds&lt;BR /&gt;OK&lt;BR /&gt;Time taken: 0.503 seconds&lt;BR /&gt;OK&lt;BR /&gt;Time taken: 0.496 seconds&lt;BR /&gt;OK&lt;BR /&gt;Time taken: 0.505 seconds&lt;BR /&gt;OK&lt;BR /&gt;Time taken: 0.495 seconds&lt;BR /&gt;OK&lt;BR /&gt;Time taken: 0.503 seconds&lt;BR /&gt;OK&lt;BR /&gt;Time taken: 0.495 seconds&lt;BR /&gt;WARN: The method class org.apache.commons.logging.impl.SLF4JLogFactory#release() was invoked.&lt;BR /&gt;WARN: Please see &lt;A href="http://www.slf4j.org/codes.html#release" target="_blank"&gt;http://www.slf4j.org/codes.html#release&lt;/A&gt; for an explanation.&lt;BR /&gt;+ i=1&lt;BR /&gt;+ total=8&lt;BR /&gt;+ test 5 -le 1000&lt;BR /&gt;+ SCHEMA_TYPE=flat&lt;BR /&gt;+ DATABASE=tpch_flat_orc_5&lt;BR /&gt;+ MAX_REDUCERS=2600&lt;BR /&gt;++ test 5 -gt 2600&lt;BR /&gt;++ echo 5&lt;BR /&gt;+ REDUCERS=5&lt;BR /&gt;+ for t in '${TABLES}'&lt;BR /&gt;+ echo 'Optimizing table part (1/8).'&lt;BR /&gt;Optimizing table part (1/8).&lt;BR /&gt;+ COMMAND='hive -i settings/load-flat.sql -f ddl-tpch/bin_flat/part.sql -d DB=tpch_flat_orc_5 -d SOURCE=tpch_text_5 -d BUCKETS=13 -d SCALE=5 -d REDUCERS=5 -d FILE=orc'&lt;BR /&gt;+ runcommand 'hive -i settings/load-flat.sql -f ddl-tpch/bin_flat/part.sql -d DB=tpch_flat_orc_5 -d SOURCE=tpch_text_5 -d BUCKETS=13 -d SCALE=5 -d REDUCERS=5 -d FILE=orc'&lt;BR /&gt;+ '[' XON '!=' X ']'&lt;BR /&gt;+ hive -i settings/load-flat.sql -f ddl-tpch/bin_flat/part.sql -d DB=tpch_flat_orc_5 -d SOURCE=tpch_text_5 -d BUCKETS=13 -d SCALE=5 -d REDUCERS=5 -d FILE=orc&lt;/P&gt;&lt;P&gt;Logging initialized using configuration in jar:file:/opt/cloudera/parcels/CDH-5.15.2-1.cdh5.15.2.p0.3/jars/hive-common-1.1.0-cdh5.15.2.jar!/hive-log4j.properties&lt;BR /&gt;OK&lt;BR /&gt;Time taken: 2.152 seconds&lt;BR /&gt;OK&lt;BR /&gt;Time taken: 0.017 seconds&lt;BR /&gt;OK&lt;BR /&gt;Time taken: 0.051 seconds&lt;BR /&gt;Query ID = c095784_20200816094848_3f33b234-3f7f-4d1b-b862-2624c0bb43cd&lt;BR /&gt;Total jobs = 1&lt;BR /&gt;Launching Job 1 out of 1&lt;BR /&gt;Number of reduce tasks not specified. Estimated from input data size: 1&lt;BR /&gt;In order to change the average load for a reducer (in bytes):&lt;BR /&gt;set hive.exec.reducers.bytes.per.reducer=&amp;lt;number&amp;gt;&lt;BR /&gt;In order to limit the maximum number of reducers:&lt;BR /&gt;set hive.exec.reducers.max=&amp;lt;number&amp;gt;&lt;BR /&gt;In order to set a constant number of reducers:&lt;BR /&gt;set mapreduce.job.reduces=&amp;lt;number&amp;gt;&lt;BR /&gt;^C+ '[' 130 -ne 0 ']'&lt;BR /&gt;+ echo 'Command failed, try '\''export DEBUG_SCRIPT=ON'\'' and re-running'&lt;BR /&gt;Command failed, try 'export DEBUG_SCRIPT=ON' and re-running&lt;BR /&gt;+ exit 1&lt;BR /&gt;sandbox.cluster.com /home/c095784/benchmarking/hive-testbench :&lt;BR /&gt;sandbox.cluster.com /home/c095784/benchmarking/hive-testbench :&lt;BR /&gt;sandbox.cluster.com /home/c095784/benchmarking/hive-testbench :&lt;BR /&gt;sandbox.cluster.com /home/c095784/benchmarking/hive-testbench :&lt;BR /&gt;sandbox.cluster.com /home/c095784/benchmarking/hive-testbench :&lt;BR /&gt;sandbox.cluster.com /home/c095784/benchmarking/hive-testbench : klist&lt;BR /&gt;Ticket cache: FILE:/tmp/krb5cc_895784&lt;BR /&gt;Default principal: neha@EXELONDS.COM&lt;/P&gt;&lt;P&gt;Valid starting Expires Service principal&lt;BR /&gt;08/16/20 08:41:04 08/16/20 18:41:04 krbtgt/EXELONDS.COM@EXELONDS.COM&lt;BR /&gt;08/16/20 08:41:04 08/16/20 18:41:04 BDAL1CCC1N06$@EXELONDS.COM&lt;BR /&gt;sandbox.cluster.com /home/c095784/benchmarking/hive-testbench : export DEBUG_SCRIPT=ON&lt;BR /&gt;sandbox.cluster.com /home/c095784/benchmarking/hive-testbench : ./tpch-setup.sh 5 /hive-data-dir-benchmark&lt;BR /&gt;+ '[' X5 = X ']'&lt;BR /&gt;+ '[' X/hive-data-dir-benchmark = X ']'&lt;BR /&gt;+ '[' 5 -eq 1 ']'&lt;BR /&gt;+ hdfs dfs -mkdir -p /hive-data-dir-benchmark&lt;BR /&gt;+ hdfs dfs -ls /hive-data-dir-benchmark/5/lineitem&lt;BR /&gt;+ '[' 0 -ne 0 ']'&lt;BR /&gt;+ hdfs dfs -ls /hive-data-dir-benchmark/5/lineitem&lt;BR /&gt;+ '[' 0 -ne 0 ']'&lt;BR /&gt;+ echo 'TPC-H text data generation complete.'&lt;BR /&gt;TPC-H text data generation complete.&lt;BR /&gt;+ echo 'Loading text data into external tables.'&lt;BR /&gt;Loading text data into external tables.&lt;BR /&gt;+ runcommand 'hive -i settings/load-flat.sql -f ddl-tpch/bin_flat/alltables.sql -d DB=tpch_text_5 -d LOCATION=/hive-data-dir-benchmark/5'&lt;BR /&gt;+ '[' XON '!=' X ']'&lt;BR /&gt;+ hive -i settings/load-flat.sql -f ddl-tpch/bin_flat/alltables.sql -d DB=tpch_text_5 -d LOCATION=/hive-data-dir-benchmark/5&lt;/P&gt;&lt;P&gt;Logging initialized using configuration in jar:file:/opt/cloudera/parcels/CDH-5.15.2-1.cdh5.15.2.p0.3/jars/hive-common-1.1.0-cdh5.15.2.jar!/hive-log4j.properties&lt;BR /&gt;OK&lt;BR /&gt;Time taken: 2.225 seconds&lt;BR /&gt;OK&lt;BR /&gt;Time taken: 0.018 seconds&lt;BR /&gt;OK&lt;BR /&gt;Time taken: 0.802 seconds&lt;BR /&gt;OK&lt;BR /&gt;Time taken: 0.991 seconds&lt;BR /&gt;OK&lt;BR /&gt;Time taken: 0.506 seconds&lt;BR /&gt;OK&lt;BR /&gt;Time taken: 0.494 seconds&lt;BR /&gt;OK&lt;BR /&gt;Time taken: 0.504 seconds&lt;BR /&gt;OK&lt;BR /&gt;Time taken: 0.493 seconds&lt;BR /&gt;OK&lt;BR /&gt;Time taken: 0.505 seconds&lt;BR /&gt;OK&lt;BR /&gt;Time taken: 0.495 seconds&lt;BR /&gt;OK&lt;BR /&gt;Time taken: 0.502 seconds&lt;BR /&gt;OK&lt;BR /&gt;Time taken: 0.496 seconds&lt;BR /&gt;OK&lt;BR /&gt;Time taken: 0.503 seconds&lt;BR /&gt;OK&lt;BR /&gt;Time taken: 0.495 seconds&lt;BR /&gt;OK&lt;BR /&gt;Time taken: 0.502 seconds&lt;BR /&gt;OK&lt;BR /&gt;Time taken: 0.496 seconds&lt;BR /&gt;OK&lt;BR /&gt;Time taken: 0.503 seconds&lt;BR /&gt;OK&lt;BR /&gt;Time taken: 0.497 seconds&lt;BR /&gt;WARN: The method class org.apache.commons.logging.impl.SLF4JLogFactory#release() was invoked.&lt;BR /&gt;WARN: Please see &lt;A href="http://www.slf4j.org/codes.html#release" target="_blank"&gt;http://www.slf4j.org/codes.html#release&lt;/A&gt; for an explanation.&lt;BR /&gt;+ i=1&lt;BR /&gt;+ total=8&lt;BR /&gt;+ test 5 -le 1000&lt;BR /&gt;+ SCHEMA_TYPE=flat&lt;BR /&gt;+ DATABASE=tpch_flat_orc_5&lt;BR /&gt;+ MAX_REDUCERS=2600&lt;BR /&gt;++ test 5 -gt 2600&lt;BR /&gt;++ echo 5&lt;BR /&gt;+ REDUCERS=5&lt;BR /&gt;+ for t in '${TABLES}'&lt;BR /&gt;+ echo 'Optimizing table part (1/8).'&lt;BR /&gt;Optimizing table part (1/8).&lt;BR /&gt;+ COMMAND='hive -i settings/load-flat.sql -f ddl-tpch/bin_flat/part.sql -d DB=tpch_flat_orc_5 -d SOURCE=tpch_text_5 -d BUCKETS=13 -d SCALE=5 -d REDUCERS=5 -d FILE=orc'&lt;BR /&gt;+ runcommand 'hive -i settings/load-flat.sql -f ddl-tpch/bin_flat/part.sql -d DB=tpch_flat_orc_5 -d SOURCE=tpch_text_5 -d BUCKETS=13 -d SCALE=5 -d REDUCERS=5 -d FILE=orc'&lt;BR /&gt;+ '[' XON '!=' X ']'&lt;BR /&gt;+ hive -i settings/load-flat.sql -f ddl-tpch/bin_flat/part.sql -d DB=tpch_flat_orc_5 -d SOURCE=tpch_text_5 -d BUCKETS=13 -d SCALE=5 -d REDUCERS=5 -d FILE=orc&lt;/P&gt;&lt;P&gt;Logging initialized using configuration in jar:file:/opt/cloudera/parcels/CDH-5.15.2-1.cdh5.15.2.p0.3/jars/hive-common-1.1.0-cdh5.15.2.jar!/hive-log4j.properties&lt;BR /&gt;OK&lt;BR /&gt;Time taken: 2.126 seconds&lt;BR /&gt;OK&lt;BR /&gt;Time taken: 0.015 seconds&lt;BR /&gt;OK&lt;BR /&gt;Time taken: 0.049 seconds&lt;BR /&gt;Query ID = c095784_20200816115353_616b8c96-f2da-4ea7-94a3-2d1501c02691&lt;BR /&gt;Total jobs = 1&lt;BR /&gt;Launching Job 1 out of 1&lt;BR /&gt;Number of reduce tasks not specified. Estimated from input data size: 1&lt;BR /&gt;In order to change the average load for a reducer (in bytes):&lt;BR /&gt;set hive.exec.reducers.bytes.per.reducer=&amp;lt;number&amp;gt;&lt;BR /&gt;In order to limit the maximum number of reducers:&lt;BR /&gt;set hive.exec.reducers.max=&amp;lt;number&amp;gt;&lt;BR /&gt;In order to set a constant number of reducers:&lt;BR /&gt;set mapreduce.job.reduces=&amp;lt;number&amp;gt;&lt;BR /&gt;Starting Job = job_1597596829164_0002, Tracking URL = &lt;A href="http://sandbox.cluster.com:8088/proxy/application_1597596829164_0002/" target="_blank"&gt;http://sandbox.cluster.com:8088/proxy/application_1597596829164_0002/&lt;/A&gt;&lt;BR /&gt;Kill Command = /opt/cloudera/parcels/CDH-5.15.2-1.cdh5.15.2.p0.3/lib/hadoop/bin/hadoop job -kill job_1597596829164_0002&lt;BR /&gt;Hadoop job information for Stage-1: number of mappers: 1; number of reducers: 1&lt;BR /&gt;2020-08-16 11:54:44,433 Stage-1 map = 0%, reduce = 0%&lt;BR /&gt;2020-08-16 11:54:54,917 Stage-1 map = 100%, reduce = 0%, Cumulative CPU 3.9 sec&lt;BR /&gt;2020-08-16 11:55:13,757 Stage-1 map = 0%, reduce = 0%&lt;BR /&gt;2020-08-16 11:55:18,933 Stage-1 map = 100%, reduce = 0%, Cumulative CPU 3.41 sec&lt;BR /&gt;2020-08-16 11:55:19,971 Stage-1 map = 100%, reduce = 100%, Cumulative CPU 3.41 sec&lt;BR /&gt;MapReduce Total cumulative CPU time: 3 seconds 410 msec&lt;BR /&gt;Ended Job = job_1597596829164_0002 with errors&lt;BR /&gt;Error during job, obtaining debugging information...&lt;BR /&gt;Examining task ID: task_1597596829164_0002_m_000000 (and more) from job job_1597596829164_0002&lt;/P&gt;&lt;P&gt;Task with the most failures(4):&lt;BR /&gt;-----&lt;BR /&gt;Task ID:&lt;BR /&gt;task_1597596829164_0002_r_000000&lt;/P&gt;&lt;P&gt;URL:&lt;BR /&gt;&lt;A href="http://0.0.0.0:8088/taskdetails.jsp?jobid=job_1597596829164_0002&amp;amp;tipid=task_1597596829164_0002_r_000000" target="_blank"&gt;http://0.0.0.0:8088/taskdetails.jsp?jobid=job_1597596829164_0002&amp;amp;tipid=task_1597596829164_0002_r_000000&lt;/A&gt;&lt;BR /&gt;-----&lt;BR /&gt;Diagnostic Messages for this Task:&lt;BR /&gt;Error: org.apache.hadoop.mapreduce.task.reduce.Shuffle$ShuffleError: error in shuffle in fetcher#4&lt;BR /&gt;at org.apache.hadoop.mapreduce.task.reduce.Shuffle.run(Shuffle.java:134)&lt;BR /&gt;at org.apache.hadoop.mapred.ReduceTask.run(ReduceTask.java:376)&lt;BR /&gt;at org.apache.hadoop.mapred.YarnChild$2.run(YarnChild.java:164)&lt;BR /&gt;at java.security.AccessController.doPrivileged(Native Method)&lt;BR /&gt;at javax.security.auth.Subject.doAs(Subject.java:422)&lt;BR /&gt;at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1924)&lt;BR /&gt;at org.apache.hadoop.mapred.YarnChild.main(YarnChild.java:158)&lt;BR /&gt;Caused by: java.io.IOException: Exceeded MAX_FAILED_UNIQUE_FETCHES; bailing-out.&lt;BR /&gt;at org.apache.hadoop.mapreduce.task.reduce.ShuffleSchedulerImpl.checkReducerHealth(ShuffleSchedulerImpl.java:392)&lt;BR /&gt;at org.apache.hadoop.mapreduce.task.reduce.ShuffleSchedulerImpl.copyFailed(ShuffleSchedulerImpl.java:307)&lt;BR /&gt;at org.apache.hadoop.mapreduce.task.reduce.Fetcher.copyFromHost(Fetcher.java:366)&lt;BR /&gt;at org.apache.hadoop.mapreduce.task.reduce.Fetcher.run(Fetcher.java:198)&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;FAILED: Execution Error, return code 2 from org.apache.hadoop.hive.ql.exec.mr.MapRedTask&lt;BR /&gt;MapReduce Jobs Launched:&lt;BR /&gt;Stage-Stage-1: Map: 1 Reduce: 1 Cumulative CPU: 3.41 sec HDFS Read: 5155 HDFS Write: 0 FAIL&lt;BR /&gt;Total MapReduce CPU Time Spent: 3 seconds 410 msec&lt;BR /&gt;WARN: The method class org.apache.commons.logging.impl.SLF4JLogFactory#release() was invoked.&lt;BR /&gt;WARN: Please see &lt;A href="http://www.slf4j.org/codes.html#release" target="_blank"&gt;http://www.slf4j.org/codes.html#release&lt;/A&gt; for an explanation.&lt;BR /&gt;+ '[' 2 -ne 0 ']'&lt;BR /&gt;+ echo 'Command failed, try '\''export DEBUG_SCRIPT=ON'\'' and re-running'&lt;BR /&gt;Command failed, try 'export DEBUG_SCRIPT=ON' and re-running&lt;BR /&gt;+ exit 1&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
    <pubDate>Sun, 16 Aug 2020 19:15:46 GMT</pubDate>
    <dc:creator>nehanirmal</dc:creator>
    <dc:date>2020-08-16T19:15:46Z</dc:date>
  </channel>
</rss>

