Support Questions

Find answers, ask questions, and share your expertise
Check out our newest addition to the community, the Cloudera Data Analytics (CDA) group hub.

Performance of a Sqoop import

New Contributor

What decides the speed of Sqoop operation? Sqoop runs on the Edge node, isn' it? Then whether the number of data nodes in a cluster has any impact on the performance of a Sqoop import?



@joe sek

There are a couple of factors, it has best been summarized in this HCC document SQOOP Performance tuning

Hope that helps


There are some factors which are affecting performance of sqoop import.

1. Network Bandwidth between source server (Oracle, MySQL, PostgreSQL) and destination server (Datanodes).

2. Source server's connection policy for clients.

3. CPU, RAM, DISK Performance listing of servers.

Take a Tour of the Community
Don't have an account?
Your experience may be limited. Sign in to explore more.