Created 03-10-2017 08:24 PM
Hello,
I have hive client installed on my Linux machine, If I issue a query , would output data of the query flow directly from data nodes to my machine or would it go to hive server ( it's installed on a node other than the data nodes ) and then arrive at my machine?
Thanks in advance.
Created 03-11-2017 01:01 AM
This might be a good read:
Created 03-11-2017 01:01 AM
This might be a good read:
Created 03-14-2017 10:35 PM
Depending on the client the data flow from the query works differently. In case of using Hive CLI, you access the HDFS directly to get the data, and Hive Metastore for table metadata. In case of HiveServer2 client (ex Beeline CLI) all data transmission happens through HiveServer2. HiveServer2 is recommended from security and scalability perspective.