Reply
Highlighted
New Contributor
Posts: 3
Registered: ‎04-21-2016

Remove dupes

Hi,

 

I have billion rows of hive table in parquet - contains id column which has dupes - used distinct but when map reduce runs, after some time job fails with return code of 2 - Any suggestions - thanks

Posts: 177
Topics: 8
Kudos: 28
Solutions: 19
Registered: ‎07-16-2015

Re: Remove dupes

Read the log files of the map/reduce job.

They will tell you what is wrong.

 

 

Announcements