I manage a multi-tenant Hadoop cluster used by various teams. Capacity scheduler is used for Yarn Scheduling. We now want to create a charging model so that we can charge our clients for using the cluster. What are the best practices for creating such a model and what parameters shall be considered to charge our clients for using the cluster services?
While there are chargeback features planned for Ambari, I don't believe there is one out there yet. If you haven't please take a look at the following link. If you are open to looking outside of open source environment, then pepperdata provides chargeback capabilities for Hadoop clusters.