In the case I want to export data, using Sqoop, from HDFS to an external destination (Teradata for example), is there a recommendation regarding the format of the input files?
AFAIK, supported formats are :
Do we observe performance differences between input formats?
Sqoop internally using yarn jobs for extracting data from HDFS. ORC is regarding as better performance for read even with Hive: You can refer to below link for details:
Hope this helps.
Thanks and Regards,
View solution in original post