Member since
10-01-2016
6
Posts
0
Kudos Received
1
Solution
My Accepted Solutions
Title | Views | Posted |
---|---|---|
990 | 10-08-2016 09:31 PM |
10-20-2017
09:20 AM
Hi Sindhu, Thank you very much for taking notice to my question! MapReduce runs fine, the problem is that it generates as much Mappers as the number of input files and this creates a big overhead regarding the processing time - for 50 files it generates 50 Mappers while Tez generates around 15 (in accordance to the tez.grouping.* parameters that I have defined).
... View more
10-19-2017
10:21 PM
One quick question... I've been doing some tests, with both Hive on Tez and Hive on MapReduce, concerning the read of input compressed zip files from an external table. (I am using HDP 2.5.3) With Tez as the execution engine, and through the manipulation of the parameters tez.grouping.*, Hive can merge the compressed input files, but with MR as the execution engine, I cannot merge the input files using the CombineHiveInputFormat, supported by the setting of the parameters mapreduce.input.fileinputformat.split.*. Long story short, Tez can merge zip files, whereas MR cannot? Is there any format that will allow for MR to merge the compressed input files? Much appreciated!
... View more
Labels:
10-08-2016
09:31 PM
Just to give some closure and possibly help someone that may have the same problem... I was able to start the HiveServer2 Interactive after removing and adding the service again, but I always had the error regarding the LLAP Application. This error went way though when I also installed the role of NodeManager in the same node where all the Hive services are located. If it is related or not I do not know...
... View more
10-03-2016
04:33 PM
First of all, again my sincere thanks for your time looking into this. My small project (it's for my master thesis) is about comparing a transformation process in a RDBMS against the same process in a Hadoop cluster. I decided to go for HDP 2.5 since I'm completely new in Hortonworks while I already used Cloudera. If I'll still have time I'll use it again to compare Hive against Impala on my specific problem. I'll also want to add more DataNodes and see how performance is improved. I didn't go for the sandbox because installing something you don't know is always a learning challenge. Of course I'm aware of my limitations and that's why after many attempts I decided to come here for help. To my project the Interactive Hive Server is not very important since I'm not looking for fast replies to ad hoc selects, even though I'm curious about the results. For now I've disabled the Interactive Server and everything is running well. When you suggest me to reinstall LLAP, is it as simple as activating again the Interactive Server or so I need to go and manually install the component? Regarding the Cloud option, maybe when I have all the project implemented and I'll need to do the scalability tests. Again, thank you very much for your time.
... View more
10-02-2016
02:50 PM
Hi Artem, Thank you very much for your attention. I've attached the output shown in the Ambari fronted. Which other files can be useful?hs2.txt
... View more
10-01-2016
11:19 PM
Hi, I've been trying to set up a small cluster using HDP 2.5 on OL 7.2 with the purpose of running an academic project in Hive and after some days cleaning errors and alerts I still have one that I can't get rid of. HiveServer2 Interactive reports the following alert: Service - Hive Alert Definition Name: LLAP Application Status: Crit Response: The application reported a 'NOT RUNNING' state. Can please someone point me into the direction I should be looking? Thank you very much! ,
... View more
Labels: