I have a requirement to SQOOP out almost 1TB of data out from Cluster to SQL server. I am finding it time consuming to SQOOP out the same using sqlJDBC4.jar driver. IT tool me something like 12hrs for 137GB of data. I am looking for some bulk insert options. Would appreciate if any one can provide pointers to the same.
The Sqoop1 CLI supports the "--batch" option with translates into JDBC addBatch(). The implementation of addBatch (if and/or how) is completely up to the RDBMS vendor, Sqoop1 really does not have any control over it other than the ability to turn it on/off with --batch.
Table 29. Export control arguments:
--batch Use batch mode for underlying statement execution.
Sqoop1 does support MS SQL Server direct options however there does not seem to be anything specific to bulk load.
25.3. Microsoft SQL Connector
Is there a specific option that you are looking for that you could provide a MS SQL Server documentation reference to that we could review further?