Created 11-19-2015 03:20 AM
Are there any concerns with turning on HDFS snapshots on the Apps/Hive/Warehouse directory? I want to make sure the directory is recoverable if someone were to accidentally rmr it (or if anything else happened).
https://hadoop.apache.org/docs/stable/hadoop-project-dist/hadoop-hdfs/HdfsSnapshots.html
Created 11-19-2015 11:11 AM
Couple of points
1) Please see this. I found this really useful related to snapshots link
2) Hive exports
Personally, I won't enable snapshots for warehouse location and stick with Hive exports. "Performance concern"
Created 11-19-2015 11:11 AM
Couple of points
1) Please see this. I found this really useful related to snapshots link
2) Hive exports
Personally, I won't enable snapshots for warehouse location and stick with Hive exports. "Performance concern"
Created 11-19-2015 02:43 PM
@Neeraj Sabharwal and @Andrew Watson if you are using hdfs snapshot, it is important to backup hive metastore database as well, because hdfs snapshot only wont be enough without tables/partitions information if user accidentally execute drop tables or drop databases.
Created 11-20-2015 10:03 PM
A mistyped hadoop fs -rmr -skipTrash can have catastrophic consequences, which can be protected against with snapshots. What are the performance concerns ?