Member since
07-20-2014
39
Posts
4
Kudos Received
2
Solutions
My Accepted Solutions
Title | Views | Posted |
---|---|---|
2650 | 08-10-2015 05:07 PM | |
3640 | 02-25-2015 03:10 AM |
10-18-2017
06:52 AM
Hi there, Thank you for following up. We have identified the cause and resolved it. Our DNS was setup incorrectly. This issue can be closed. Thank you, Vishal
... View more
08-10-2015
05:07 PM
Hi,
As described in the sort based shuffle design doc (https://issues.apache.org/jira/secure/attachment/12655884/Sort-basedshuffledesign.pdf), each map task should generate 1 shuffle data file 1 index file.
Regarding your second question, the property to specify the buffer for shuffle data is "spark.shuffle.memoryFraction". This is discussed in more detail in the following Cloudera blog:
http://blog.cloudera.com/blog/2015/03/how-to-tune-your-apache-spark-jobs-part-2/
Regards,
Bjorn
... View more
02-25-2015
03:10 AM
Hi, The stack trace reported here is identical to MAPREDUCE-5799. Its a classpath issue that can be resolved by adding the following property to your client configurations: <property> <name>yarn.app.mapreduce.am.env</name> <value>LD_LIBRARY_PATH=/opt/cloudera/parcels/CDH/lib/hadoop/lib/native</value> </property>
... View more