Support Questions
Find answers, ask questions, and share your expertise
Announcements
Alert: Welcome to the Unified Cloudera Community. Former HCC members be sure to read and learn how to activate your account here.

Is there a common practice for benchmarking in Hadoop?

Is there a common practice for benchmarking in Hadoop?

New Contributor

I would like to performance tune the data structures and processes in my Hadoop environment. There are many disparate blogs and videos online for benchmarking but there does not seem to be a consolidated list of best practices. Can you all share your step by step benchmarking processes and context behind your approach (performance tuning hive queries/tables/spark)?