Created on 10-10-2019 03:06 AM - last edited on 10-10-2019 08:47 PM by ask_bill_brooks
I need clarification about Metastore URI configuration of Hive data server. The problem is that in the official cloudera documentation you can see that it is a requirement to block external applications accessing Hive Metastore in order to secure cluster with Sentry:
Block the external applications from accessing the Hive metastore:
ODI is also an external application so its direct access requirement to Metastore contradicts with this requirement.
Why are we setting Metastore URI in Hive data server? What is it used for? If it is a requirement, how can we explain this contradiction?
Another problem is that we are configuring Metastore HA by default in our BDA. However how can we configure ODI Metadata Uri to support metadata HA configuration? Is there any way to write a Uri supporting connecting more than one Metastore? If not, how can we explain this situation?
Created 10-16-2019 08:18 AM
Hello,
Oracle Data Integrator connects to Hive by using JDBC and uses Hive and the Hive Query Language (HiveQL), a SQL-like language for implementing MapReduce jobs. Source - HERE
The points mentioned by you from the documentation is for the purpose of Blocking the external applications and non service users from accessing the Hive metastore.
Since, ODI connects to Hive using JDBC, it should connect to HiveServer2 as described in this documentation. Once connected, the query executed from ODI will connect with HiveServer2. Then, HiveServer2 will connect with HiveMetastore for getting the metadata details of the table against which you are querying and proceed with the execution. It is not necessary for ODI to connect to Hive MetaStore directly.
For details about Hive Metastore HA, please read HERE
Created 10-16-2019 08:18 AM
Hello,
Oracle Data Integrator connects to Hive by using JDBC and uses Hive and the Hive Query Language (HiveQL), a SQL-like language for implementing MapReduce jobs. Source - HERE
The points mentioned by you from the documentation is for the purpose of Blocking the external applications and non service users from accessing the Hive metastore.
Since, ODI connects to Hive using JDBC, it should connect to HiveServer2 as described in this documentation. Once connected, the query executed from ODI will connect with HiveServer2. Then, HiveServer2 will connect with HiveMetastore for getting the metadata details of the table against which you are querying and proceed with the execution. It is not necessary for ODI to connect to Hive MetaStore directly.
For details about Hive Metastore HA, please read HERE