Member since
10-01-2015
3933
Posts
1150
Kudos Received
374
Solutions
My Accepted Solutions
| Title | Views | Posted |
|---|---|---|
| 3485 | 05-03-2017 05:13 PM | |
| 2875 | 05-02-2017 08:38 AM | |
| 3123 | 05-02-2017 08:13 AM | |
| 3089 | 04-10-2017 10:51 PM | |
| 1578 | 03-28-2017 02:27 AM |
06-16-2016
02:37 AM
1 Kudo
@Chris Nauroth thank you very much, looking forward to your Hadoop Summit sessions.
... View more
05-04-2016
08:59 AM
Hi Artem, as we discussed, min.user.id (min_user_id is used only in Ambari) and container-executor.cfg are only referenced by LinuxContainerExecutor (LCE). By default, DefaultContainerExecutor is used. More details here (the doc is about Hadoop-2.7.2 but this part applies to 2.7.1 as well). Besides secure clusters LCE can also be used in non-secure ones to enable CGroups.
... View more
05-04-2016
01:21 AM
changing min_user_id in Ambari requires restarting the following: App Timeline Server, NodeManager, ResourceManager, YARN Client and since I had Oozie enabled, that too had stale configs.
... View more
05-12-2016
04:16 PM
Thank @Massimiliano Nigrelli for the information, it is helpful.
... View more
05-03-2016
02:30 AM
2 Kudos
I'm a long-time user of Apache Bigtop. My experience with Hadoop and Bigtop predates Ambari. I started using Bigtop with version 0.3. I remember pulling bigtop.repo file and install Hadoop, Pig and Hive for some quick development. Bigtop makes it convenient and easy. Bigtop has matured since then and there are now multiple ways of deployment. There's still a way to pull repo and install manually but there's better ways now with Vagrant and Docker. I won't rehash how to deploy Bigtop using Docker as it was beautifly described here. Admittedly, I'm running it on Mac and was not able to provision a cluster using Docker. I did not try with non-OSX. This post is about Vagrant. Let's get started: Install VirtualBox and Vagrant Download 1.1.0 release wget http://www.apache.org/dist/bigtop/bigtop-1.1.0/bigtop-1.1.0-project.tar.gz uncompress the tarball tar -xvzf bigtop-1.1.0-project.tar.gz change directory to bigtop-1.1.0/bigtop-deploy/vm/vagrant-puppet-vm cd bigtop-1.1.0/bigtop-deploy/vm/vagrant-puppet-vm here you can review the README but to keep it short you can edit the vagrantconfig.yaml for any additional customization like changing VM memory, OS, number of CPUs, components (e.g. hadoop, spark, tez, hama, solr) etc and also number of VMs you'd like to provision. This last part is the killer feature, you can provision a Sandbox with multiple nodes, not a single VM. Same is true with Docker provisioner but I can't confirm that for you. Feel free to read the README in bigtop-1.1.0/bigtop-deploy/vm/vagrant-puppet-docker for that approach. then you can start provisioning your custom sandbox with vagrant up wait 5-10min and then you can use standard Vagrant commands to interact with your custom Sandbox. vagrant ssh bigtop1 now just create your local user and off you go sudo -u hdfs hdfs dfs -mkdir /user/vagrant
sudo -u hdfs hdfs dfs -chown -R vagrant:hdfs /user/vagrant for your convenience, add the bigtop machine(s) to /etc/hosts Now, you're probably wondering why would I use Bigtop over regular sandbox? Well, Sandbox has been getting pretty resource heavy and has a lot of components. I like to provision a small cluster with just a few components like hadoop, spark, yarn and pig. Bigtop makes this possible and runs easily within a memory strapped VM. One downside is that with the latest release, Spark is at 1.5.0 and Hortonworks Sandbox is at 1.6.0, story is the same with other components. There are version gaps and if you can look past it, you have a quick way to prototype without much fuss! This is by no means meant to steal thunder from an excellent Ambari quick start guide, this is meant to demonstrate yet another approach from a rich ecosystem of Hadoop tools.
... View more
Labels:
06-02-2016
01:37 PM
1 Kudo
@Artem Ervits, context sharing in Spark just got better with the latest Tech preview of Zeppelin which is Livy integrated - https://hortonworks.com/hadoop-tutorial/apache-zeppelin-hdp-2-4-2/. Livy acts both as a Job server, and in addition enables multi-user scenarios, allowing the users to latch on to an existing session.
... View more
05-04-2016
01:47 PM
I was adding the wrong jar resource. Thanks for everyone's suggestions. If you want to help further, I'm now encountering this issue https://community.hortonworks.com/questions/31438/getting-all-nulls-when-selecting-from-a-hive-json.html
... View more
02-23-2018
11:50 AM
@Kuldeep Kulkarni Add "deploy JCE policies" steps as prerequisites. I tried without JCE and it fails for me. Let me know if i am missing anything.
... View more
04-22-2016
02:33 PM
@Francis Apel Awesome! Glad to hear it and thanks for letting everyone know it worked.
... View more
07-08-2016
04:27 PM
1 Kudo
@Artem Ervits This is a long standing bug in Ambari setup. For an external Postgres database the script is: Ambari-DDL-Postgres-CREATE.sql
... View more