New Contributor
Posts: 2
Registered: ‎02-04-2016

Node Labels

Node labels were released in Apache Hadoop 2.6.  Is this feature enabled in CDH?  Where can I find documentation on configuring?





Cloudera Employee
Posts: 260
Registered: ‎01-16-2014

Re: Node Labels

Node labels is not considered ready by Cloudera or even by the upstream community. The basis for node labels was added to Hadoop 2.6 with a large number of limitations. The only scheduler that currently implements node labels support is the CapacityScheduler. None of the other schedulers supports it yet. Cloudera recommends, for a number of reasons, that you use the FairScheduler in your cluster.


Setting up node labels is partially supported through the command line interface but it still requires manual steps and configuration. Support for labels is also limited to one (1) label per YARN application. Using labels requires you to add them on the command line when an application is submitted. MapReduce does not implement any of the node label support yet (MAPREDUCE-6304) in the current release.


Node labels due to its limited implementation can also cause a large increase in scheduling delays which makes using them counter productive. We are working with the community to make node labels ready for production but currently it is not there.



New Contributor
Posts: 2
Registered: ‎08-31-2017

Re: Node Labels



Is there a good documentation on how can I configure or setup node labels for the cloudera clusters?