Hi all.
Need to generate unique id's in our hadoop cluster during data ingestion.
We have parallel processes ingesting data from different sources into hive tables, we'd like a unique ID for each data row inserted.
I understand zookeper offers Unique ID generation for distributed scenarios.
Please help with how do we do this, can't find sample of documentation.
Also please let me know If there is a better distributed unique id generator in the cloudera environment
Thanks