Support Questions
Find answers, ask questions, and share your expertise
Announcements
Alert: Welcome to the Unified Cloudera Community. Former HCC members be sure to read and learn how to activate your account here.

relationship between Hive query and missing blocks on cluster

relationship between Hive query and missing blocks on cluster

Expert Contributor

I decommissioned and deleted 2 out of 3 my HDFS data nodes. Although I expected blocks to have been replicated, it had not. I started getting under replication error on my cluster. I have started HDFS balancer now but hive queries are terribly slow. Is there some relation between two? Is it because it has to write to three nodes when files are underreplicated?

Don't have an account?
Coming from Hortonworks? Activate your account here