Support Questions
Find answers, ask questions, and share your expertise
Announcements
Alert: Welcome to the Unified Cloudera Community. Former HCC members be sure to read and learn how to activate your account here.

Mape Reduce Simulation

Mape Reduce Simulation

New Contributor

Hi All,

I am new to Hadoop and started to learning. I want a sample program which need to run a task (Map) more than 5 minutes on data node. because I need to findout long running tasks, so that I can take a look on that particular node.

what will happen if I kill that task process? will it run on another node or whole job will fail?

1 REPLY 1
Highlighted

Re: Mape Reduce Simulation

You could run Teragen/Sort for this.

Here's a script on my gist.github.com page that can be run against an HDP cluster for this.

You can control the size, mappers and reducers from the commandline, even experiment with block sizes.

Don't have an account?
Coming from Hortonworks? Activate your account here