Support Questions
Find answers, ask questions, and share your expertise
Announcements
Alert: Welcome to the Unified Cloudera Community. Former HCC members be sure to read and learn how to activate your account here.

Sqoop with HiveServer2

Highlighted

Sqoop with HiveServer2

New Contributor

My understanding is that Sqoop uses Hiverser(CLI), which does not support authorization. Is there anyway to configure Sqoop to use HiverServer2, so it can use Hive Authorization?

Any help would be much appreciated!

4 REPLIES 4

Re: Sqoop with HiveServer2

@Raffi Abberbock

Hive authentication using kerberos or simple auth is supported in sqoop2 starting at version 1.99.4:

https://cwiki.apache.org/confluence/display/SQOOP/Security+Guide+On+Sqoop+2

https://issues.apache.org/jira/browse/SQOOP-1525

Nifi can import using hive with authentication using the PutHiveQL or PutHiveStreaming processors:

PutHiveQL specifies the kerberos principals and key tab on the HiveConnectionPool.

PutHiveStreaming sets the security options in the hive-site.xml config file.

Documentation for the Nifi Processors are at:

https://nifi.apache.org/docs.html

Re: Sqoop with HiveServer2

New Contributor

@cduby Thanks! Would I be able to use Ranger for authorization?

Re: Sqoop with HiveServer2

@Raffi Abberbock Yes, Nifi will use the setup of the hive server. Sqoop2 is not in the HDP distribution yet.

Re: Sqoop with HiveServer2

Expert Contributor

This is a great question. As @cduby pointed out its not available until sqoop 2. But, if you have to load data into tables using some sort of authorization, you could directly load the data in hdfs, then use beeline to move data into tables with authorization, then you can apply Ranger policies per user and other features. Its a bit of a long way to achieve the results but will get you closer to audit, authorization requirements.

Don't have an account?
Coming from Hortonworks? Activate your account here