02-26-2018 05:08 AM
I am in the process of setting up a CDH based cluster (Installation Path B - Parcels/Packages) including Impala.
I would like to configure Impala Statestore and Catalog Service appropriately (maybe even on a dedicated host), however I cannot really find any documentation or best practices regarding the resource needs of these services.
For example I do not know how much memory or disk space should I reserve for these services: Based on my understanding they should be of relatively small footprint compared to other big data components, but I am not sure I would be able make any estimation on my own.
Could someone please point me into the right direction?