About hosako

bleonhardi · ‎03-02-2016

If you shutdown the OS all tasks running on that node will be stopped too so you don't need to worry about recovery. You might kill the running application masters on that node though. There is no graceful shutdown of a nodemanager that waits for running applications to finish as of yet ( AFAIK if someone knows better let me know ). Yarn depends on applications to handle task or AM failures gracefully. https://issues.apache.org/jira/browse/YARN-914

hosako · ‎08-12-2016

Now, to setup host and install HDP, only "./start_hdp.sh -a". It automatically sets up the latest HDP in your Ubuntu 14.04 (16 is not supported) To access, it starts a proxy on port 28080, so you can change the browser proxy setting to use Ubuntu_IP:28080. Or, if Ubuntu and your PC are in same network, just adding a route to containers works (eg: route add -net 172.17.100.0/24 Ubuntu_IP on your Mac)

bleonhardi · ‎02-11-2016

In the capacity scheduler your could set a high priority queue at 90% of the cluster with extension( max. Capacity) to 100% and a low priority queue wirh 10% with extension( max capacity) to 100%. In this case jobs the first queue would always get 90% of the cluster if it needs it and the second queue would only get a tiny amount of the cluster if the high priority Queues have Queries. The low priority queue would still be able to monopolize the cluster if if has very long running tasks. But you could fix that with preemption. ( Or by making sure tasks in your cluster don't run for too long which they shouldn't anyway.)

hosako · ‎03-18-2016

Thank you!

hosako · ‎02-02-2016

so... answer is it won't be fixed or middle of restoring process?

hosako · ‎01-13-2017

Just one question. AMBARI-12896 won't encrypt/obfscate password stored in ranger's xml file, will it?

hosako · ‎12-18-2015

Thank you very much!

hosako · ‎12-16-2015

Thank you! I will play with it.

kmungee · ‎12-10-2015

Hajime, the above scripts are for the yarn container and mapreduce memory settings. If you are trying to configure the memory of the nodemanager process itself then that shouldn't need more than 2GB - 4GB. If you are seeing outOfMemory there I suggest you turn on verbose GC for the nodemanager process and review the GC logs.

nsabharwal · ‎11-26-2015

@Hajime This makes sense hive.exec.reducers.bytes.per.reducer Default Value: 1,000,000,000 prior to Hive 0.14.0; 256 MB ( 256,000,000 ) in Hive 0.14.0 and later Added In: Hive 0.2.0; default changed in 0.14.0 with HIVE-7158 (and HIVE-7917) Size per reducer. The default in Hive 0.14.0 and earlier is 1 GB, that is, if the input size is 10 GB then 10 reducers will be used. In Hive 0.14.0 and later the default is 256 MB, that is, if the input size is 1 GB then 4 reducers will be used. Point to note: Calculate hive.exec.reducers.max should be set to a number which is less than the available reduce slots on the cluster. Hive calculate the reducers based on hive.exec.reducers.bytes.per.reducer (default 1GB). Consider setting this high based on the workloads and demand for the reducers on the cluster https://cwiki.apache.org/confluence/display/Hive/Configuration+Properties

Online	Offline
Last Visited	‎05-16-2018 07:44 AM

Member Since	‎09-24-2015 11:58 PM
Last Visited	‎05-16-2018 07:44 AM
Posts	144
Kudos received	70

Cloudera Community

Re: How to GET Get Service Definition APIs in HDP ...

Re: How can I enable Kerberos debug logging for Sp...

Re: Install HUE on a separate Node

Re: In HDP 2.3.2 tez.container.max.java.heap.fract...

Re: How to delete/remove storage policy ?

Re: Replace hardware on NodeManager server / yarn....

Re: How to semi-automate deploying dev cluster

Re: How can I set priority to queues with Capacity...

Re: Ranger policy is not applied

Re: Missing link for Tableau tutorial

Re: Can the Ranger LDAP bind password be encrypted...

Re: In HDFS, why corrupted block(s) happens?

Re: How would you download (copy) a directory with...

Re: NodeManager memory setting best practice?

Re: Is there any specific reason Ambari sets 64MB ...