Welcome to the Cloudera Community

Bozz · ‎10-16-2013

Hi! I'm new to Hadoop, I've just started to learn about its ecosystem and all the tools it has.

Currently, I'm writing a batch script to migrate a source database into Hive. I want it to copy as much data as possible, and that includes tables that lack a primary key (like n-to-n relations). I don't mind if I have to create a new table with its own primary key in the process.

What would be the best procedure to do so? In case Sqoop and Hive are not the best tools for such a job, should I consider something else? I'll be grateful of any advice I can get.

Cloudera Community

Welcome to the Cloudera Community

Who agreed with this topic

What is the best way to use Sqoop to import tables into Hive when these have no primary key?