In the case I want to export data, using Sqoop, from HDFS to an external destination (Teradata for example), is there a recommendation regarding the format of the input files?
AFAIK, supported formats are :
- Delimited text files
- Sequence files
- ORC files
Do we observe performance differences between input formats?
Thanks