My team and me, tried to send data, from the Dataware/SQLServer to the Datalake/HDP 2.6.5 via the Apache Hive endpoint (through Apache Knox) but the performances are very bad/slow (more than 8 minutes, to push 100 lines in Hive). We tried also to send the same data via the WebHDFS endpoint (through Apache Knox). But in this second scenario, we were unable to make it work.
The simple workflow is following: With this "easy" configuration information (HTTPS and the WebHDFS host = knox.xxxxx.domain.com/gateway/default/webhdfs/v1)
Do you have any idea why this is not working? We have already contacted Microsoft about this, for weeks, without any result. They ask us to contact you (Cloudera support) Microsoft case (can't connect to HDP HDFS thru Hadoop connector - TrackingID#2202230030001219)