04-02-2019 06:20 AM
Need to generate unique id's in our hadoop cluster during data ingestion.
We have parallel processes ingesting data from different sources into hive tables, we'd like a unique ID for each data row inserted.
I understand zookeper offers Unique ID generation for distributed scenarios.
Please help with how do we do this, can't find sample of documentation.
Also please let me know If there is a better distributed unique id generator in the cloudera environment
05-08-2019 07:33 PM