Support Questions

Find answers, ask questions, and share your expertise
Announcements
Check out our newest addition to the community, the Cloudera Data Analytics (CDA) group hub.

Joining tables in hbase

Hi guys,

I have table employer like below in hbase

1 column=personal_data:age, timestamp=1472494356226, value=28

1 column=personal_data:name, timestamp=1472494329458, value=Kishore

2 column=personal_data:age, timestamp=1472494969330, value=35

2 column=personal_data:name, timestamp=1472494969330, value=Gopi


I have another table Department like below in hbase


1 column=personal_data:age, timestamp=1472494356333, value=28

1 column=personal_data:department, timestamp=1472494329433, value=Admin

2 column=personal_data:age, timestamp=1472494969345, value=35

2 column=personal_data:deparment, timestamp=1472494962340, value=IT


Can we join these two tables in hbase without using spark-scala and without loading this in to hive (since we can join these loading in to hive)

I want to join these two tables in hbase only. Can we achieve this ??????

Any suggestions would be appreciated.



1 ACCEPTED SOLUTION

Super Collaborator

A join itself is not implemented by HBase:

see here: http://hbase.apache.org/0.94/book/joins.html or https://community.hortonworks.com/questions/29295/hbase-for-joins.html

You'll have to consider alternatives, spark-scala is one of it. Normally you don't store tables in any way normalized in Hbase.

View solution in original post

1 REPLY 1

Super Collaborator

A join itself is not implemented by HBase:

see here: http://hbase.apache.org/0.94/book/joins.html or https://community.hortonworks.com/questions/29295/hbase-for-joins.html

You'll have to consider alternatives, spark-scala is one of it. Normally you don't store tables in any way normalized in Hbase.

Take a Tour of the Community
Don't have an account?
Your experience may be limited. Sign in to explore more.