Created on 07-09-201601:30 AM - edited 08-17-201911:27 AM
Short Description:
Teragen and Terasort Performance testing on AWS
Article
This article should be used with extreme care. Do not use as benchmark. I performed this test to simply run a quick 1 Terabype teragen test on AWS to determine what type of performance I can get from mapreduce on AWS with VERY LITTLE configuration tweaking/tuning
On my github page here you will find the following:
teragen script
hadoop,yarn,mapred,capacity scheduler configurations used during testing