- Subscribe to RSS Feed
- Mark Question as New
- Mark Question as Read
- Float this Question for Current User
- Bookmark
- Subscribe
- Mute
- Printer Friendly Page
How to find the running time for a program in MapReduce
- Labels:
-
Apache Hadoop
Created ‎10-01-2018 10:49 AM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
I have executed a program in Hadoop MapReduce and I Wait to study the performance of my program with different input sizes, and I need the running time taken for each input?
the results are in the attached file
Thank youmapreduce.txt
Created ‎10-01-2018 01:21 PM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
You can use the mapreduce command line tool to get the execution time for each job. Use the below command to get all the stats of a job
mapred job -status {job-id}
.
To get the execution time, use
mapred job -status {job_id} | grep "CPU time"
.
If you have YARN application Id ,then replace 'application_xxx' with 'job_xxx' to get the mapreduce job id.
.
If you have written the Mapreduce application, then you can have custom counters as well to print extra information. See the example in the link
https://acadgild.com/blog/counters-in-mapreduce.
.
Please "Accept" the answer if this helps.
Created ‎10-05-2018 02:30 PM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Did this work for you? Please take a moment to login and "Accept" the answer if this helped. This will be really useful for other community users 🙂
