Reply
Highlighted
Explorer
Posts: 14
Registered: ‎08-07-2017

Impala query skew issue

Hello CM community,

 

I'm working on exploring impala query's real behavior on large clusters,I'm trying to reproduce a skew scenario to better understanding the Impala behavior.

 

I found the following article about one of the common skew causes:

 

https://www.cloudera.com/documentation/enterprise/5-2-x/topics/impala_perf_skew.html

 

However, I have some difficulties in reproducing it in the above way. I wonder if the CM community could help me on that. If you could point me to a skew dataset, that'll be really helpful. Or is there other ways of creating the Impala skew problem?

 

Thanks,

Announcements