04-17-2018 08:27 AM
I use the same command and have no issues.
According to logs:
Caused by: org.apache.hadoop.util.DiskChecker$DiskErrorException: Could not find any valid local directory for output/attempt_1523546159827_0013_r_000000_0/map_0.out
So, I would guess that you csv is too big and when the reducer tries to load it, there is no sufficient space in local dirs of YARN nodemanager.
Can you try set more reducers by using :
or more (based on your partitions and the csv size). You can also set more mappers, but based on log the reducer is suffering.