Support Questions
Find answers, ask questions, and share your expertise
Alert: Welcome to the Unified Cloudera Community. Former HCC members be sure to read and learn how to activate your account here.

ORC format and index for payload field


ORC format and index for payload field

New Contributor

Is there a way how I can disable creation of indices in ORC files for certain fields ? For example I don't want to create an index for payload type of fields which contains just base64 encoded content. I am having issues with OOM when working with ORC and unstructured data saved as base64 encoded content in HIVE. Such data varies in size dramatically from few kB to tens of MB and I think that unnecessary indices in ORC files may be the cause of such problem.

Feel free to suggest any ORC file format configuration for payload data, basically BLOB's.