Archives of Support Questions (Read Only)

This is an archived board for historical reference. Information and links may no longer be available or relevant
Announcements
This board is archived and read-only for historical reference. To ask a new question, please post a new topic on the appropriate active board.

Snapshotting Apps/Hive/Warehouse

avatar

Are there any concerns with turning on HDFS snapshots on the Apps/Hive/Warehouse directory? I want to make sure the directory is recoverable if someone were to accidentally rmr it (or if anything else happened).

https://hadoop.apache.org/docs/stable/hadoop-project-dist/hadoop-hdfs/HdfsSnapshots.html

1 ACCEPTED SOLUTION

avatar
Master Mentor

@Andrew Watson

Couple of points

1) Please see this. I found this really useful related to snapshots link

2) Hive exports

Personally, I won't enable snapshots for warehouse location and stick with Hive exports. "Performance concern"

View solution in original post

3 REPLIES 3

avatar
Master Mentor

@Andrew Watson

Couple of points

1) Please see this. I found this really useful related to snapshots link

2) Hive exports

Personally, I won't enable snapshots for warehouse location and stick with Hive exports. "Performance concern"

avatar

@Neeraj Sabharwal and @Andrew Watson if you are using hdfs snapshot, it is important to backup hive metastore database as well, because hdfs snapshot only wont be enough without tables/partitions information if user accidentally execute drop tables or drop databases.

avatar
Rising Star

A mistyped hadoop fs -rmr -skipTrash can have catastrophic consequences, which can be protected against with snapshots. What are the performance concerns ?