Support Questions
Find answers, ask questions, and share your expertise
Announcements
Alert: Welcome to the Unified Cloudera Community. Former HCC members be sure to read and learn how to activate your account here.

detec/remove duplicates by spark

Highlighted

detec/remove duplicates by spark

I am newbie to this field and here are several things i am interested in:

  • I want to know how can i detect duplicates when hive table is updated by another process?
  • after removing flowfile can i automcaticaly insert data into views through start job?
  • Is there any way i can start spark job through nifi, i am useing nifi 1.3.9?
Don't have an account?
Coming from Hortonworks? Activate your account here