Archives of Support Questions (Read Only)

This is an archived board for historical reference. Information and links may no longer be available or relevant
Announcements
This board is archived and read-only for historical reference. To ask a new question, please post a new topic on the appropriate active board.

YARN Shuffle failure

avatar
New Member

Hi all,

I would to run a benchmark of YARN service using the following command:

yarn jar /usr/hdp/current/hadoop-mapreduce-client/hadoop-mapreduce-examples.jar terasort -Dmapred.reduce.tasks=10 -Dmapreduce.jobtracker.maxtasks.perjob=-1 -Dmapreduce.job.reduce.slowstart.completedmaps=1 /tmp/HA_ACCEPTANCE/teraInput1G /tmp/HA_ACCEPTANCE/teraOutput1G

But i'm having this error:

Error: org.apache.hadoop.mapreduce.task.reduce.Shuffle$ShuffleError: error in shuffle in fetcher#4  at org.apache.hadoop.mapreduce.task.reduce.Shuffle.run(Shuffle.java:134)  at org.apache.hadoop.mapred.ReduceTask.run(ReduceTask.java:376)  at org.apache.hadoop.mapred.YarnChild$2.run(YarnChild.java:170)  at java.security.AccessController.doPrivileged(Native Method)  at javax.security.auth.Subject.doAs(Subject.java:422)  at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1869)  at org.apache.hadoop.mapred.YarnChild.main(YarnChild.java:164)Caused by: java.io.IOException: Exceeded MAX_FAILED_UNIQUE_FETCHES; bailing-out.  at org.apache.hadoop.mapreduce.task.reduce.ShuffleSchedulerImpl.checkReducerHealth(ShuffleSchedulerImpl.java:366)  at org.apache.hadoop.mapreduce.task.reduce.ShuffleSchedulerImpl.copyFailed(ShuffleSchedulerImpl.java:288)  at org.apache.hadoop.mapreduce.task.reduce.Fetcher.copyFromHost(Fetcher.java:354)  at org.apache.hadoop.mapreduce.task.reduce.Fetcher.run(Fetcher.java:193)

Anyone has any idea how to fix it?

Thank you!

1 ACCEPTED SOLUTION

avatar
New Member

Hello,

I solved the issue. Now all is working fine.

The problem was an error in the sssd.conf file:

I added :

- use_fully_qualified_names = false

Andd i deleted:

- override_space: ^

- default_domain_suffix: DOMAIN

View solution in original post

1 REPLY 1

avatar
New Member

Hello,

I solved the issue. Now all is working fine.

The problem was an error in the sssd.conf file:

I added :

- use_fully_qualified_names = false

Andd i deleted:

- override_space: ^

- default_domain_suffix: DOMAIN