Support Questions
Find answers, ask questions, and share your expertise
Announcements
Alert: Welcome to the Unified Cloudera Community. Former HCC members be sure to read and learn how to activate your account here.

Use gzip input codec on files without .gz extension for hive

Highlighted

Use gzip input codec on files without .gz extension for hive

Contributor

We have data that is gzip compressed with no file extension. Currently hive just outputs garbage as it assumes the data is raw text files. Is there a table/hive property we can use to flag this data as gzip compression?

This is an interesting solution, but hoping there may be an easier way by now: http://daynebatten.com/2015/11/override-hadoop-compression-codec-file-extension/

HDP: 2.6.4

Hive: 1.2.1000