Support Questions

bbukacek · ‎11-23-2015

When using the importtsv tool is there a way to specify a YARN queue? Despite attempting to set a YARN queue it always goes to the 'default' queue.

jstraub · ‎11-23-2015

@Brad Bukacek Jr You could assign a default query to the user that is executing the importtsv statement.

What Scheduler are you using? Capacity or Fair Scheduler?

Here is the configuration for the capacity scheduler:

yarn.scheduler.capacity.queue-mappings

This configuration specifies the mapping of user or group to aspecific queue. You can map a single user or a list of users to queues. Syntax: [u or g]:[name]:[queue_name][,next_mapping]*. Here, u or g indicates whether the mapping is for a user or group. The value is u for user and g for group. name indicates the user name or group name. To specify the user who has submitted the application, %user can be used. queue_name indicates the queue name for which the application has to be mapped. To specify queue name same as user name, %user can be used. To specify queue name same as the name of the primary group for which the user belongs to, %primary_group can be used.

Example:

<property>
   <name>yarn.scheduler.capacity.queue-mappings</name>
   <value>u:user1:queue1,g:group1:queue2,u:%user:%user,u:user2:%primary_group</value>
   <description>
     Here, <user1> is mapped to <queue1>, <group1> is mapped to <queue2>, 
     maps users to queues with the same name as user, <user2> is mapped 
     to queue name same as <primary group> respectively. The mappings will be 
     evaluated from left to right, and the first valid mapping will be used.
   </description>
 </property>

View solution in original post

aervits · ‎11-23-2015

You need to use -Dmapreduce.job.queuename=anotherqueue

just confirmed the following works

hbase org.apache.hadoop.hbase.mapreduce.RowCounter -Dmapreduce.job.queuename=hive1 TestTable

jstraub · ‎11-23-2015

@Brad Bukacek Jr You could assign a default query to the user that is executing the importtsv statement.

What Scheduler are you using? Capacity or Fair Scheduler?

Here is the configuration for the capacity scheduler:

yarn.scheduler.capacity.queue-mappings

This configuration specifies the mapping of user or group to aspecific queue. You can map a single user or a list of users to queues. Syntax: [u or g]:[name]:[queue_name][,next_mapping]*. Here, u or g indicates whether the mapping is for a user or group. The value is u for user and g for group. name indicates the user name or group name. To specify the user who has submitted the application, %user can be used. queue_name indicates the queue name for which the application has to be mapped. To specify queue name same as user name, %user can be used. To specify queue name same as the name of the primary group for which the user belongs to, %primary_group can be used.

Example:

<property>
   <name>yarn.scheduler.capacity.queue-mappings</name>
   <value>u:user1:queue1,g:group1:queue2,u:%user:%user,u:user2:%primary_group</value>
   <description>
     Here, <user1> is mapped to <queue1>, <group1> is mapped to <queue2>, 
     maps users to queues with the same name as user, <user2> is mapped 
     to queue name same as <primary group> respectively. The mappings will be 
     evaluated from left to right, and the first valid mapping will be used.
   </description>
 </property>

nsabharwal · ‎11-23-2015

@Brad Bukacek J

Blog : https://www.linkedin.com/pulse/yarn-queues-hadoop-neeraj-sabharwal

Capacity sheduler view has an option for user mapping

format

Cloudera Community

Support Questions

Importtsv tool - Specify a YARN Queue?