Support Questions

Find answers, ask questions, and share your expertise
Announcements
Celebrating as our community reaches 100,000 members! Thank you!

Round Robin Queuing

avatar
Master Guru

I read in this tutorial http://hortonworks.com/hadoop-tutorial/supercharging-interactive-queries-hive-tez/ about

Multi-tenancy with HiveServer2

There could be contentions when multiple users run large queries simultaneously. Processing queries with many containers could lead to lower latency. For this, 3 controls could be put in place:

  • Container re-use timeout
  • Tez split wave tuning
  • Round Robin Queuing setup

What is Round Robin Queuing Setup within Hive/Tez? How would one go about the setup? No details were provided.

1 ACCEPTED SOLUTION

avatar
Master Mentor
@Sunile Manjee

This is related to connection to HS2 in round robin when we have multiple HS2 running and we don't want to hit the same HS2 instance. Link

The whole concept is derived from basic Round Robin algorithm

This leverages the Zookeeper

View solution in original post

3 REPLIES 3

avatar
Master Mentor

@Sunile Manjeethis is an educated guess but we're talking about YARN queues with RoundRobin schedule implementation. So instead of capacity scheduler we would implement another scheduler algorithm. I don't think it's viable unless cluster only dedicated to hive it would be cool to try having two root queues supporting multiple schedulers. http://m.linuxjournal.com/content/how-yarn-changed-hadoop-job-scheduling

So with that idea in mind you world assign an available queue in round robin fashion to each user.

avatar
Master Mentor
@Sunile Manjee

This is related to connection to HS2 in round robin when we have multiple HS2 running and we don't want to hit the same HS2 instance. Link

The whole concept is derived from basic Round Robin algorithm

This leverages the Zookeeper

avatar
Master Mentor