Support Questions
Find answers, ask questions, and share your expertise

Is there a common practice for benchmarking in Hadoop?

New Contributor

I would like to performance tune the data structures and processes in my Hadoop environment. There are many disparate blogs and videos online for benchmarking but there does not seem to be a consolidated list of best practices. Can you all share your step by step benchmarking processes and context behind your approach (performance tuning hive queries/tables/spark)?