Created on 04-02-2019 06:20 AM - edited 09-16-2022 07:17 AM
Hi all.
Need to generate unique id's in our hadoop cluster during data ingestion.
We have parallel processes ingesting data from different sources into hive tables, we'd like a unique ID for each data row inserted.
I understand zookeper offers Unique ID generation for distributed scenarios.
Please help with how do we do this, can't find sample of documentation.
Also please let me know If there is a better distributed unique id generator in the cloudera environment
Thanks
Created 05-08-2019 07:33 PM