Welcome to the Cloudera Community

Announcements
Celebrating as our community reaches 100,000 members! Thank you!

Who agreed with this topic

What is the best way to use Sqoop to import tables into Hive when these have no primary key?

avatar
New Contributor

Hi! I'm new to Hadoop, I've just started to learn about its ecosystem and all the tools it has.

 

Currently, I'm writing a batch script to migrate a source database into Hive. I want it to copy as much data as possible, and that includes tables that lack a primary key (like n-to-n relations). I don't mind if I have to create a new table with its own primary key in the process.

 

What would be the best procedure to do so? In case Sqoop and Hive are not the best tools for such a job, should I consider something else? I'll be grateful of any advice I can get.

Who agreed with this topic