Support Questions

Find answers, ask questions, and share your expertise
Announcements
Celebrating as our community reaches 100,000 members! Thank you!

Which is best method for taking backup of hbase data?

avatar

Hi,

Can anyone suggest me which is best method for taking backup of hbase data among distcp, copyTable, export/import, cluster replication?

1 ACCEPTED SOLUTION

avatar
Master Guru

Hi @Rushikesh Deshmukh, for a list of backup options check this. CopyTable is a nice option, using multiple mappers, you can copy individual tables to the same or another cluster. You can miss a few edits but you will end up with a useful copy.

View solution in original post

12 REPLIES 12

avatar

@Predrag Minovic, thanks for sharing this useful information.

avatar
Expert Contributor

Hi @Rushikesh Deshmukh

The following table provides an overview for quickly comparing these approaches, which I’ll describe in detail below.

http://blog.cloudera.com/blog/2013/11/approaches-to-backup-and-disaster-recovery-in-hbase/

i used distcp as well but that did not work for me , in the sense data was copied but while running hbck i had issue

if you want to create backup on same cluster then copytable and sanpshot are very easy

for inter cluster snapshot works good

let me know if you need more details


hbase-data-backup.png

avatar
Expert Contributor

Hi @Rushikesh Deshmukh

The following table provides an overview for quickly comparing these approaches, which I’ll describe in detail below.

http://blog.cloudera.com/blog/2013/11/approaches-to-backup-and-disaster-recovery-in-hbase/

i used distcp as well but that did not work for me , in the sense data was copied but while running hbck i had issue

if you want to create backup on same cluster then copytable and sanpshot are very easy

for inter cluster snapshot works good

let me know if you need more details

Also this below link is really very useful and clear

http://hbase.apache.org/0.94/book/ops.backup.html