Member since
07-06-2016
11
Posts
0
Kudos Received
0
Solutions
06-21-2017
05:44 PM
Hi folks, We have some cloud applications which connect to HA hortonworks backend. They connect to the usual suspects: hdfs, hbase, kafka, zk and the backend is kerberized. The cloud applications run under docker. What is the best practice to connect here? Currently we make each of the cloud hosts part of the ambari cluster and mount the /etc/hadoop/conf config folders into the containers.. adding to the classpath. This seems rather kludgy and not very portable, but it allows the apps to instantiate Configuration(..) for hdfs. Any better recommendations here or other approaches we could take? What I'd be hoping for is better portability i.e.: 1) the docker hosts to be vanilla docker hosts, possibly even in an os that might be less-suited to ambari like e.g. coreOS 2) the ability for us to scale out to nodes in other clouds without having to add them to ambari first 3) Still avail of the HA features of hadoop namenodes. Apologies if this is a dumb question.. thanks Mark.
... View more
Labels:
- Labels:
-
Apache Hadoop
-
Docker
09-06-2016
05:12 PM
To answer the original question, Confluent avro schema registry is probably the gold-standard here I think if it's kafka you are dealing with. There is a docker version if you want to try it quickly. It's not perfect but it's well ahead of the atlas approach for now I think since it does schema validation, compatibility checks, etc. If there is another approach coming out (e.g. atlas-backed soln) it might be nice if the apis were compatible,.. just sayin!
... View more
07-25-2016
04:40 PM
We are using HDP2.4 (for now!). Kafka admin support in ambari is quite nice, but it doesn't cover the ancilliary tools that come with kafka. E.g. mirrormaker, kafka-connect, etc. I haven't downloaded the 2.5 tech preview yet - are there plans to incorporate config and start/stopping of these ancilliary services into ambari?
... View more
Labels:
- Labels:
-
Apache Ambari
-
Apache Kafka
07-13-2016
04:43 PM
that's kafka connect, not kafka streams.. think you can get away with HDP 2.4 for that.
... View more
07-13-2016
04:31 PM
@Sean Byrne if you aren't set on nifi, you could also consider other approaches for this - e.g. kafka-connect (http://kafka.apache.org/documentation.html#connect), camus or gobblin. These are all designed to do exactly what you say - probably kafka connect is the most "official" of them.
... View more
07-07-2016
08:16 AM
Brilliant article - both the answer and the explanation I was looking for - thanks for posting!
... View more
07-07-2016
08:14 AM
Super answer and article, thanks Joy!
... View more
07-07-2016
08:13 AM
thanks Constantin, I have working DNS both ways via designate service.
... View more
07-07-2016
08:12 AM
Joy and Constantin thanks so much for your quick responses. You were exactly right Joy - I am deploying to openstack, and the public_hostname as reported by ambari-agent was the item in my case. I have working DNS with designate so the hosts file entries should not have been necessary (I was trying to avoid this to more easily scale the cluster). Thanks a lot again, delighted to get this sorted!
... View more
07-06-2016
04:50 PM
I am scripting an ambari install to openstack and it works great, however one annoying aspect is the quicklinks are retrieving the cloud-init domain name despite the fact that it's been changed everywhere - kernel, etc/hostname, hostnamectl, etc. I.e. the links use the domain "novalocal" i.e. storm-server.novalocal:8772 instead of storm-server.mydomain.com:8772. Ambari is retrieving the correct domain name elsewhere, just the links are wrong in the ui. Any idea how it's getting the domainname so that I can ensure that it's changed there before I go installing ambari?
... View more
Labels:
- Labels:
-
Apache Ambari