Support Questions

Find answers, ask questions, and share your expertise

Sqoop : Teradata to HDFS using PARQUET file format not working

avatar
Expert Contributor

I am trying to execute below command in sqoop

sqoop import --connection-manager org.apache.sqoop.teradata.TeradataConnManager --connect jdbc:teradata://***.***.***.***/DATABASE=***** --username ***** --password **** --table mytable --target-dir /user/aps/test2 --as-parquetfile -m 1

Output :

-rw-r--r-- 3 ****** hdfs 0 2016-09-21 12:25 /user/aps/test2/_SUCCESS

-rw-r--r-- 3 ****** hdfs 18 2016-09-21 12:25 /user/aps/test2/part-m-00000

Above output is not in parquet format. If I use com.teradata.jdbc.TeraDriver , it is working. But I have to use org.apache.sqoop.teradata.TeradataConnManager for connection. Please help.

1 ACCEPTED SOLUTION

avatar
Super Guru
@Arkaprova Saha

The reason I think you are not able to import data in Parquet format is because Parquet is not supported by this driver. Please see the following link for supported formats which include Avro and RCfile beside other formats but does not include Parquet (check under section 1.2.1).

https://docs.hortonworks.com/HDPDocuments/HDP2/HDP-2.3.6/bk_HortonworksConnectorForTeradata/content/...

View solution in original post

3 REPLIES 3

avatar
Expert Contributor

Is there any update. Please help.

avatar
Super Guru
@Arkaprova Saha

The reason I think you are not able to import data in Parquet format is because Parquet is not supported by this driver. Please see the following link for supported formats which include Avro and RCfile beside other formats but does not include Parquet (check under section 1.2.1).

https://docs.hortonworks.com/HDPDocuments/HDP2/HDP-2.3.6/bk_HortonworksConnectorForTeradata/content/...

avatar
Expert Contributor

@mqureshi Thanks for your quick response.