Support Questions
Find answers, ask questions, and share your expertise

Data Migration between Hbase across major versions (1.x to 2.x) across data centers

Highlighted

Data Migration between Hbase across major versions (1.x to 2.x) across data centers

New Contributor

Hi,


We are using a new Datacenter and have to move the data from old cluster to new cluster.

Cluster
Old
New
HDFS
2.7.1.2.3
3.1.1.3.1
Hbase
1.1.2.2.3.2.0
2.0.0.3.1
Zookeeper
3.4.6.2.3
3.4.9.3.1
OS
Debian 7 (Wheezy)
Debian 9 (Stretch)


Since the major versions are different, I only know of one option for data migration: CopyTable. (+ SyncTable)

Are there any challenges to keep note of, since we have different OS.


Is there a way to online replicate the data?

We have around 40 tables and around 1TB of data for each. How many copy table operations can be run parallel?


Thank you

2 REPLIES 2
Highlighted

Re: Data Migration between Hbase across major versions (1.x to 2.x) across data centers

New Contributor

Wonderful question, too bad noone answered this. We are handling something similar (upgrading to HDP3.1.5 from HDP2.6.5 in diffrent cluster - HBase 1.1.2 to Hbase 2.0.2). We asked this through Cloudera support and offered us same solution  - copyTable / syncTable but this can't be run from destination cluster (since the source cluster is in current production) so we're looking at snapshots solution, but we still need to identify if is feasible and the challenges implied...

Any inputs from your experience? did you managed to do this migration/upgrade?

Thank You!

Re: Data Migration between Hbase across major versions (1.x to 2.x) across data centers

Contributor

@ValiD_M Did you find any solution for this?