Reply
Expert Contributor
Posts: 109
Registered: ‎05-19-2016

relationship between Hive query and missing blocks on cluster

[ Edited ]

I decommissioned and deleted 2 out of 3 my HDFS data nodes. Although I expected blocks to have been replicated, it had not. I started getting under replication error on my cluster. I have started HDFS balancer now but hive queries are terribly slow. Is there some relation between two? Is it because it has to write to three nodes when files are underreplicated?

Announcements