Hi
I am using getmerge to combine multiple files like this:
hdfs dfs -getmerge /user/maria_dev/Folder3/* /Folder3/output1.csv
How can I exclude the header of each file? When I upload into hive table, it repeats each header row.
Alternatively, is there a query in Hive to exclude the actual header names? If I join 2 files and upload this into Hive, I have 2 lines of headers, and so on.
When I created my table, I included the following:
TBLPROPERTIES ("skip.header.line.count"="1");
However, this only skips the first line. How can I exclude the rest of the headers?
Thanks