Community Articles

Find and share helpful community-sourced technical articles.
Labels (2)
avatar
Contributor

Dr. Elephant

Dr. Elephant is a performance monitoring and tuning tool for Hadoop and Spark. It automatically gathers a job's metrics, analyzes them, and presents them in a simple way for easy consumption. Its goal is to improve developer productivity and increase cluster efficiency by making it easier to tune the jobs. It analyzes the Hadoop and Spark jobs using a set of pluggable, configurable, rule-based heuristics that provide insights on how a job performed and then uses the results to make suggestions about how to tune the job to make it perform more efficiently. It also computes a number of metrics for a job which provides valuable information about the job performance on the cluster.

Build Steps


cd ~;
sudo yum update -y;
sudo yum upgrade -y;
sudo curl --silent --location https://rpm.nodesource.com/setup_8.x | sudo bash -
sudo yum install -y nodejs wget unzip git zip gcc-c++ make java-1.8.0-openjdk-devel;

wget https://github.com/linkedin/dr-elephant/archive/v2.1.7.tar.gz;
wget https://downloads.typesafe.com/typesafe-activator/1.3.12/typesafe-activator-1.3.12.zip;

tar -xvzf v2.1.7.tar.gz;
unzip typesafe-activator-1.3.12.zip

export ACTIVATOR_HOME=~/activator-dist-1.3.12/;
export PATH=$ACTIVATOR_HOME/bin:$PATH;

sudo npm config set strict-ssl false;
npm config set strict-ssl false;
sudo npm install ember-tooltips;
sudo npm install -g bower;

# Run using non root user
cd dr-elephant-2.1.7/;
cd web; bower install;
cd ..; ./compile.sh ;
2,334 Views
0 Kudos