Support Questions
Find answers, ask questions, and share your expertise
Announcements
Alert: Welcome to the Unified Cloudera Community. Former HCC members be sure to read and learn how to activate your account here.

HDFS Heterogeneous Storage

HDFS Heterogeneous Storage

Rising Star

I have some queries about HDFS Heterogeneous Storage on CDH 5.8 (http://www.cloudera.com/documentation/enterprise/5-8-x/topics/admin_heterogeneous_storage_oview.html)

 

  1. What hardware is typically used for archival storage? Do NAS / iSCSI make sense?
  2. If so, the [ARCHIVE] storage is a single point of failure. If it is really down, for Warm data (replicas in [ARCHIVE] and [DISK] tiers), will HDFS create blocks in [DISK] to maintain the replication factor, or leave it under-replicated?
  3. I see some references suggest using dedicate node with limiting compute power (e.g., https://hadoop.apache.org/docs/r2.7.2/hadoop-project-dist/hadoop-hdfs/ArchivalStorage.html) to manage all [ARCHIVE] data. What's the advantage ?