Support Questions
Find answers, ask questions, and share your expertise
Announcements
Alert: Welcome to the Unified Cloudera Community. Former HCC members be sure to read and learn how to activate your account here.

Is there a proper way to keep data in Hive up to date?

Is there a proper way to keep data in Hive up to date?

I am sqooping tables to Hive to prepare data for analysis with HDP tools, but I am unsure what the best method of keeping the data up to date is.

  • I saw sqoop had incremental imports, but that didn't seem like an automated solution.
  • Considered using "Change Data Capture" with a Nifi flow and streaming the records to Hive.

What other options are there? Am I missing anything major here? Thanks in advance.

Don't have an account?
Coming from Hortonworks? Activate your account here