Support Questions
Find answers, ask questions, and share your expertise
Alert: Welcome to the Unified Cloudera Community. Former HCC members be sure to read and learn how to activate your account here.

splitting a field during load

splitting a field during load

Master Collaborator

I have a char field with comma separated elements that can go upto 40 elements e.g "1,4,5...40"

I want to load them into hbase but each value as a column in a column family.

what options do I have ?


Re: splitting a field during load

Expert Contributor

Hi Sami, Have you come up with anything? After a table has been defined, I don't see a way to add a column qualifier without adding a value at the same time.

Is it acceptable to create a table with 40 column qualifiers and add the value that is in the string? If you don't have 40 tokens, then populate the values of the other cells with a constant, like -1. I've been able to accomplish this.

So given a sequence, "1,2,3,4"

haganbrian column=f1:c1, timestamp=1528213851119, value=\x00\x00\x00\x01
haganbrian column=f1:c2, timestamp=1528213851119, value=\x00\x00\x00\x02
haganbrian column=f1:c3, timestamp=1528213851119, value=\x00\x00\x00\x03
haganbrian column=f1:c4, timestamp=1528213851119, value=\x00\x00\x00\x04 
haganbrian column=f1:c5, timestamp=1528213851119, value=\xFF\xFF\xFF\xFF
haganbrian column=f1:c6, timestamp=1528213851119, value=\xFF\xFF\xFF\xFF