28807
DISCUSSIONS
102218
MEMBERS
3161
ARTICLES
Created 10-16-2013 08:37 AM
Hi! I'm new to Hadoop, I've just started to learn about its ecosystem and all the tools it has.
Currently, I'm writing a batch script to migrate a source database into Hive. I want it to copy as much data as possible, and that includes tables that lack a primary key (like n-to-n relations). I don't mind if I have to create a new table with its own primary key in the process.
What would be the best procedure to do so? In case Sqoop and Hive are not the best tools for such a job, should I consider something else? I'll be grateful of any advice I can get.