- Subscribe to RSS Feed
- Mark Question as New
- Mark Question as Read
- Float this Question for Current User
- Bookmark
- Subscribe
- Mute
- Printer Friendly Page
relationship between Hive query and missing blocks on cluster
- Labels:
-
Apache Hive
-
HDFS
Created on ‎05-04-2018 10:37 AM - edited ‎09-16-2022 06:10 AM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
I decommissioned and deleted 2 out of 3 my HDFS data nodes. Although I expected blocks to have been replicated, it had not.
I started getting under replication error on my cluster. I have started HDFS balancer now but hive queries are terribly slow.
Is there some relation between two? Is it because it has to write to three nodes when files are underreplicated?
Created ‎05-07-2018 05:00 AM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
I believe I am already using Beeline. Yes, I tried switching back to MapReduce execution engine but still get the same error. @Geoffrey Shelton Okot I do have hive server 2 up and running on the cluster. Also, I don't find it ideal having to switch to Hive on Spark because of this unidentified issue. Do you mind pointing out what could be the other reasons for interruption on Map Reduce job or if it is possible to escalate it?
Created ‎05-07-2018 07:37 AM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Can you share the latest version of these 2 files /var/log/hive/*.err and /var/log/hive/*.log

- « Previous
- Next »