I want to take a backup of my hive database (metadata + data)
Please guide me in how to do it.
Is it possible to do it using distcp?
Also, is it possible to take the backup on a local system?
1. The Hive Metastore
- This is in an RDBMS. MySQL by default. The RDBMS will have tools for backup.
- Metastore contains metadata, table partition info, DDL, info about tables
2. The data
- This is in HDFS
- You can use DistCp to copy the data to another cluster
View solution in original post
In addition to @Binu Mathew If you don't want to take separate backups, and there are limited table to take, even you can use Hive Import/Export option.