Archives of Support Questions (Read Only)

This is an archived board for historical reference. Information and links may no longer be available or relevant
Announcements
This board is archived and read-only for historical reference. To ask a new question, please post a new topic on the appropriate active board.

Hive databse backup

avatar
New Member

I want to take a backup of my hive database (metadata + data)

Please guide me in how to do it.

Is it possible to do it using distcp?

Also, is it possible to take the backup on a local system?

1 ACCEPTED SOLUTION

avatar

Backup:

1. The Hive Metastore

- This is in an RDBMS. MySQL by default. The RDBMS will have tools for backup.

- Metastore contains metadata, table partition info, DDL, info about tables

2. The data

- This is in HDFS

- You can use DistCp to copy the data to another cluster

View solution in original post

2 REPLIES 2

avatar

Backup:

1. The Hive Metastore

- This is in an RDBMS. MySQL by default. The RDBMS will have tools for backup.

- Metastore contains metadata, table partition info, DDL, info about tables

2. The data

- This is in HDFS

- You can use DistCp to copy the data to another cluster

avatar
@Saransh Sharma

In addition to @Binu Mathew If you don't want to take separate backups, and there are limited table to take, even you can use Hive Import/Export option.

https://cwiki.apache.org/confluence/display/Hive/LanguageManual+ImportExport