About hosako

hosako · ‎02-11-2016

I think at this moment, controlling priority by capacity and preemption would be the only way to start and finish high priority jobs faster, am I correct? Ideally, I would like to set some priority *per* a job/application, but I found https://issues.apache.org/jira/browse/YARN-1963 . So I guess this is not possible. I also found http://blog.sequenceiq.com/blog/2014/03/14/yarn-capacity-scheduler/ which queue names are "highPriority" and "lowPriority", but if my reading is correct, this is not actually setting any priority but because the high has more capacity, the job finish faster. Until YARN-1963 is released, I would like to always start jobs in highPrority queue before any jobs in lowPriority queue, and if possible, I would like low priority jobs to wait until high priority jobs finishes. Any advice/hint is welcome.

hosako · ‎02-04-2016

If any rule matches, does it stop processing other rules? Stop or not wouldn't be issue in most of the case but in case someone put completely wrong rules...

hosako · ‎02-03-2016

Looks like if I wanted to use HCat, the column names need to match...

hosako · ‎02-02-2016

so... answer is it won't be fixed or middle of restoring process?

hosako · ‎01-25-2016

After reading above, I'm just curious to know what would happen in the following scenario: 1) Create queues (ex: Rack1, Rack2, Rack3...) 2) Create (exclusive=true) Node Labels and assign to queues per my physical rack layout 3) Didn't set up HDFS rack-awareness (so that replication won't care about rack) 4) Submit a job to the queue "Rack1" but all blocks for this data are in DataNodes in different rack (ex: Rack2) Would YARN AM try to create a remote container in a NameNode in Rack2? Or keep using container in Rack1 but fetch the data from a remote DataNode?

hosako · ‎01-21-2016

I read the 2nd link and couldn't find DockerFIle. Did I miss something? I just wanted to compare with mine to improve...

hosako · ‎01-19-2016

I would like to read http://hortonworks.com/kb/how-to-connect-tableau-to-hortonworks-sandbox/ which link is in http://hortonworks.com/hadoop-tutorial/making-things-tick-with-tableau/ Does anyone know what happened to this link?

hosako · ‎01-14-2016

If I don't mind some down time, can I skip decommission / recommission process? I have 12 DataNode locations and 12 disks. I would like to replace only one disk. I can schedule maintenance window.

hosako · ‎01-13-2016

Is there any possibility that this headless keytab is used when spark submits a job (to YARN or hive, maybe?) to identify itself? Not for ambari to start Spark service, maybe?

hosako · ‎01-07-2016

Is "/mnt/hadoop/storm" or "hadoop/storm" decided by the value of "storm.local.dir" parameter?

Online	Offline
Last Visited	‎05-16-2018 07:44 AM

Member Since	‎09-24-2015 11:58 PM
Last Visited	‎05-16-2018 07:44 AM
Posts	144
Kudos received	70

Cloudera Community

Re: How to GET Get Service Definition APIs in HDP ...

Re: How can I enable Kerberos debug logging for Sp...

Re: Install HUE on a separate Node

Re: In HDP 2.3.2 tez.container.max.java.heap.fract...

Re: How to delete/remove storage policy ?

How can I set priority to queues with Capacity Sch...

Re: Auth-to-local Rules Syntax

Re: java.lang.NullPointerException at org.apache....

Re: Missing link for Tableau tutorial

Re: Yarn Node Labels

Re: Docker - Installing HDP using Ambari and Creat...

Missing link for Tableau tutorial

Re: Replacing Disks on Datanode Hosts

Re: Why does Ambari use headless keytab for Spark ...

Re: Solutions for Storm Nimbus Failure