We have a need to ingest a large XML object and store (for legal and regulatory reasons), we will then convert it Josn and store that (for analytics and downstream processing)
We have a difference of opinion on how to store this,
One group think both objects should be stored in the same column family (to reduce the number of column families and Hfiles)
The other group thinks the XML and the Json should be stored in different column familes (as the XML is seldom if ever read and the Json is read daily).
Any advice or best practice documents someone can point us to in order to resolve this?