Created on 05-04-2017 11:54 AM - edited 09-16-2022 04:33 AM
I want to take a backup of my hive database (metadata + data)
Please guide me in how to do it.
Is it possible to do it using distcp?
Also, is it possible to take the backup on a local system?
Created 05-04-2017 12:38 PM
Backup:
1. The Hive Metastore
- This is in an RDBMS. MySQL by default. The RDBMS will have tools for backup.
- Metastore contains metadata, table partition info, DDL, info about tables
2. The data
- This is in HDFS
- You can use DistCp to copy the data to another cluster
Created 05-04-2017 12:38 PM
Backup:
1. The Hive Metastore
- This is in an RDBMS. MySQL by default. The RDBMS will have tools for backup.
- Metastore contains metadata, table partition info, DDL, info about tables
2. The data
- This is in HDFS
- You can use DistCp to copy the data to another cluster
Created 05-26-2017 02:07 PM
In addition to @Binu Mathew If you don't want to take separate backups, and there are limited table to take, even you can use Hive Import/Export option.
https://cwiki.apache.org/confluence/display/Hive/LanguageManual+ImportExport