- Subscribe to RSS Feed
- Mark Question as New
- Mark Question as Read
- Float this Question for Current User
- Bookmark
- Subscribe
- Mute
- Printer Friendly Page
Hive databse backup
- Labels:
-
Apache Hive
Created on ‎05-04-2017 11:54 AM - edited ‎09-16-2022 04:33 AM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
I want to take a backup of my hive database (metadata + data)
Please guide me in how to do it.
Is it possible to do it using distcp?
Also, is it possible to take the backup on a local system?
Created ‎05-04-2017 12:38 PM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Backup:
1. The Hive Metastore
- This is in an RDBMS. MySQL by default. The RDBMS will have tools for backup.
- Metastore contains metadata, table partition info, DDL, info about tables
2. The data
- This is in HDFS
- You can use DistCp to copy the data to another cluster
Created ‎05-04-2017 12:38 PM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Backup:
1. The Hive Metastore
- This is in an RDBMS. MySQL by default. The RDBMS will have tools for backup.
- Metastore contains metadata, table partition info, DDL, info about tables
2. The data
- This is in HDFS
- You can use DistCp to copy the data to another cluster
Created ‎05-26-2017 02:07 PM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
In addition to @Binu Mathew If you don't want to take separate backups, and there are limited table to take, even you can use Hive Import/Export option.
https://cwiki.apache.org/confluence/display/Hive/LanguageManual+ImportExport
