Support Questions

Find answers, ask questions, and share your expertise

Hive databse backup

avatar

I want to take a backup of my hive database (metadata + data)

Please guide me in how to do it.

Is it possible to do it using distcp?

Also, is it possible to take the backup on a local system?

1 ACCEPTED SOLUTION

avatar

Backup:

1. The Hive Metastore

- This is in an RDBMS. MySQL by default. The RDBMS will have tools for backup.

- Metastore contains metadata, table partition info, DDL, info about tables

2. The data

- This is in HDFS

- You can use DistCp to copy the data to another cluster

View solution in original post

2 REPLIES 2

avatar

Backup:

1. The Hive Metastore

- This is in an RDBMS. MySQL by default. The RDBMS will have tools for backup.

- Metastore contains metadata, table partition info, DDL, info about tables

2. The data

- This is in HDFS

- You can use DistCp to copy the data to another cluster

avatar
@Saransh Sharma

In addition to @Binu Mathew If you don't want to take separate backups, and there are limited table to take, even you can use Hive Import/Export option.

https://cwiki.apache.org/confluence/display/Hive/LanguageManual+ImportExport