We'd like to test Kudu and need to import data. Sqoop seems like the correct choice. I find references that you can import to Kudu but no specifics. Is there any way to import to Kudu using Sqoop?
I have seen people create Kudu connectors for Sqoop, however these are not supported by Cloudera. The main reason is likely that Sqoop is fundamentally designed for writing to HDFS.
If you are limited to CDH you could first use sqoop to get the data into your cluster, and then do an extra step to bring it to Kudu.
However, since CDF has been released we have now got standard NiFi processors for reading from a database and writing to Kudu. Hence this would be my recommended solution.
It's an old post, but my problem with NiFi, it will read data from Source, write to avro, load to KuDu. Sqoop will retry, redtart job on other node, if a node fails. If a NiFi node goes down, you have to bring it back online, to get your data back, or offload it (manual).