Support Questions
Find answers, ask questions, and share your expertise
Announcements
Alert: Welcome to the Unified Cloudera Community. Former HCC members be sure to read and learn how to activate your account here.

Need to ingest a large XML object and convert to Json for processing

Highlighted

Need to ingest a large XML object and convert to Json for processing

New Contributor

We have a need to ingest a large XML object and store (for legal and regulatory reasons), we will then convert it Josn and store that (for analytics and downstream processing)

We have a difference of opinion on how to store this,

One group think both objects should be stored in the same column family (to reduce the number of column families and Hfiles)


The other group thinks the XML and the Json should be stored in different column familes (as the XML is seldom if ever read and the Json is read daily).


Any advice or best practice documents someone can point us to in order to resolve this?


Thanks