Community Articles

Find and share helpful community-sourced technical articles.
Announcements
Celebrating as our community reaches 100,000 members! Thank you!
Labels (1)
avatar
Cloudera Employee

Default location for spark event log history is hdfs:///spark-history (Spark) and hdfs:///spark2-history/ (spark2).

This will help to debug spark-history page load issue or if you have huge count of event log files you can archive it by creating the new active location.

Following are the steps to change this default location.

1. Create new directory on hdfs for e.g

$hdfs dfs -mkdir /spark2-history_new 
$hdfs dfs -chown spark:hadoop /spark2-history_new

2. Login to Amabri==>Spark==>config.

3. Update following parameters with new path "hdfs:///spark2-history_new/ " as follow.

83457-screen-shot-2018-07-26-at-12613-pm.png

4. Save the configuration.

5. Restart Spark service to enable new changes.

6. Run spark job, your new event log file will get save in new location. Same you can view using the spark history UI.

4,312 Views
0 Kudos