Member since
06-10-2016
5
Posts
1
Kudos Received
0
Solutions
07-07-2016
12:23 PM
We're using a capacity scheduler on YARN with several queues. One of the queues is reserved for Spark notebooks (like jupyter/zeppelin). Many of our users leave their notebooks open for days on end. They are not using the resources they claimed (CPU and memory) most of the time. What would be a good configuration for this use case? Is it possible to configure YARN/Spark in such a way that inactive notebooks do not hinder other users?
... View more
Labels:
- Labels:
-
Apache Spark
-
Apache YARN
-
Apache Zeppelin
06-17-2016
07:50 AM
@sseethana: Thanks for the info! Is any of this functionality already available in a released Hadoop distribution of Apache or Hortonworks? If so, is there any documentation or a getting started guide?
... View more
06-15-2016
12:45 PM
@sseethana: You seem to have been working a lot on YARN-3611. Can you give an update on the current efforts and status of running Docker on YARN?
... View more
06-13-2016
08:44 AM
Thanks for the JIRA's Alex! The changes to them are over 1 year old. Do you think DCE will be supported on a kerberized cluster? DCE seems like the way to go for running isolated jobs in production while retaining data locality. Any thoughts on why these JIRA's aren't picked up?
... View more
06-10-2016
01:18 PM
1 Kudo
The hadoop documentation states that DCE does not support a cluster with secure mode (Kerberos): https://hadoop.apache.org/docs/r2.7.2/hadoop-yarn/hadoop-yarn-site/DockerContainerExecutor.html Are people working on this? Is there a way around this limitation?
... View more
Labels:
- Labels:
-
Apache YARN
-
Docker