Member since
09-18-2015
191
Posts
81
Kudos Received
40
Solutions
My Accepted Solutions
Title | Views | Posted |
---|---|---|
1993 | 08-04-2017 08:40 AM | |
5311 | 05-02-2017 01:18 PM | |
1067 | 04-24-2017 08:35 AM | |
1080 | 04-24-2017 08:21 AM | |
1296 | 06-01-2016 08:54 AM |
05-09-2022
06:21 AM
Similar to this, I have a use case to compare Ansible Code with the Ambari Configs. The reason we are doing this is that we found several inconsistencies w.r.t to Ansible code and Ambari configs. But comparing both is a big task as there are many playbooks where we have Hadoop code so checking all the code base a heck. Any other option to do the comparison.....
... View more
02-09-2021
01:02 PM
I used the hive-driver to access Kerberos enabled Hive running on HDP (HortonWorks Data Platform, now CDP). It worked well. The sample code and the Hive docker instance helped to get it tested before trying on corporate instance.
... View more
01-04-2021
08:30 AM
@kalhan While it is possible to have a single ZK cluster to support multiple services, It is the recommendation that NiFi have its own dedicated ZK cluster. NiFi cluster stability is dependent on ZK and many of the NiFi processors that can be used depend on on Cluster state which is also stored in ZK. IF ZK becomes overburdened it can affect overall stability and performance of NiFi. If you found any of the answers provided on this query helped you, please select "accept solution" on each of them. Thank you, Matt Hope this helps.
... View more
08-10-2020
06:12 AM
Hi All, is there specific method to follow for installing ambari on python3..any one installed on python3 base
... View more
09-13-2018
04:52 PM
In this episode we welcome Phil Radley, Chief Data Architect at BT to talk about the Big Data deployment at BT. https://roaringelephant.org/2018/09/11/episode-105-big-data-at-british-telecom-with-phillip-radley/
Play in new window | Download (Duration: 1:06:32 — 45.9MB) Phillip Radley (Linkedin) Chief Data Architect @ BT https://home.bt.com/
Please use the Contact Form on this blog or our twitter feed to send us your questions, or to suggest future episode topics you would like us to cover.
... View more
09-13-2018
04:44 PM
In this Big Data News episode, we discuss an article with guidelines
on how you should arrange your data gathering projects with the customer
in mind. Dave brings a matrix of visualization products. https://roaringelephant.org/2018/09/04/episode-104-roaring-news/
Play in new window | Download (Duration: 36:55 — 25.6MB) The five Cs: Five framing guidelines to help you think about building data products.
https://www.oreilly.com/ideas/the-five-cs?utm_medium=social&utm_source=twitter.com&utm_campaign=awareness&utm_content=radar+content The Chartmaker Directory
http://chartmaker.visualisingdata.com/
Please use the Contact Form on this blog or our twitter feed to send us your questions, or to suggest future episode topics you would like us to cover.
... View more
08-30-2018
03:51 PM
Matteo and Sijie from Streamlio reached out to us and let us know
they had an update on Apache Pulsar. It turned out they had a lot to
talk about so we cut the interview in two parts. the first of which was
published in episode 101. Here is the second part with information on
version 2.0 and the future of the Apache Pulsar project. https://roaringelephant.org/2018/08/28/episode-103-apache-pulsar-version-2-0-with-matteo-and-sijie-from-streamlio/
Play in new window | Download (Duration: 43:31 — 30.1MB) The
first subject taken on by Sijie is Pulsar Functions, followed by Matteo
talking about the new schema registry and Topic Compaction. With a new
major version being released, users will probably want to upgrade so we
asked the guys about the upgrade path. The rest of the episode, Matteo
and Sijie share what they can regarding the future Pulsar Roadmap. Matteo Merli (https://www.linkedin.com/in/matteomerli/)
Co-Founder – Software Engineer Sijie Guo (https://www.linkedin.com/in/samuelguo/)
Co-Founder Apache Pulsar (incubating)
https://pulsar.apache.org/ Please use the Contact Form on this blog or our twitter feed to send us your questions, or to suggest future episode topics you would like us to cover.
... View more
Labels:
08-30-2018
03:49 PM
Big Data News at the end of the summer is not easy to find, but we
did end up with three topics to discuss: from isolating GPUs in Hadoop
3.x to replicating big data (to the cloud) and quick tips from Adam’s
blog. https://roaringelephant.org/2018/08/21/episode-102-roaring-news/
Play in new window | Download (Duration: 22:07 — 15.4MB) First Class GPUs support in Apache Hadoop 3.1, YARN & HDP 3.0
https://hortonworks.com/blog/gpus-support-in-apache-hadoop-3-1-yarn-hdp-3/ Replicating big datasets in the cloud
https://medium.com/hotels-com-technology/replicating-big-datasets-in-the-cloud-c0db388f6ba2 https://dataworkssummit.com/berlin-2018/session/tools-and-approaches-for-migrating-big-datasets-to-the-cloud/ https://www.slideshare.net/Hadoop_Summit/tools-and-approaches-for-migrating-big-datasets-to-the-cloud Quick Tip: The easiest way to grab data out of a web page in Python
https://medium.com/@ageitgey/quick-tip-the-easiest-way-to-grab-data-out-of-a-web-page-in-python-7153cecfca58 Please use the Contact Form on this blog or our twitter feed to send us your questions, or to suggest future episode topics you would like us to cover.
... View more
08-30-2018
03:47 PM
Matteo and Sijie from Streamlio reached out to us and let us know
they had an update on Apache Pulsar. It turned out they had a lot to
talk about so we cut the interview in two parts and here is the first
part where they introduce Apache Pulsar, go in depth on the correct
deployment scaling of a stable Pulsar cluster and clarify Pulsars “at
least once vs exactly once” strategy. Part two will go in more depth on
what’s new. Stay tuned! https://roaringelephant.org/2018/08/14/episode-101-apache-pulsar-update-with-matteo-and-sijie-from-streamlio/
Play in new window | Download (Duration: 1:05:48 — 45.4MB) Matteo Merli (https://www.linkedin.com/in/matteomerli/)
Co-Founder – Software Engineer Sijie Guo (https://www.linkedin.com/in/samuelguo/)
Co-Founder Apache Pulsar (incubating)
https://pulsar.apache.org/ Please use the Contact Form on this blog or our twitter feed to send us your questions, or to suggest future episode topics you would like us to cover.
... View more
Labels:
08-30-2018
03:43 PM
https://roaringelephant.org/2018/08/07/episode-100-celebrating-our-centennial/ 100
Big Data episodes! We made it, in no small part thanks to our audience:
you are who keeps us going! In this episode we celebrate our centennial
by going over the history of Hadoop releases, highlighting the most
noteworthy events along the way. Join us down the twisty paths of our
memory lanes! Play in new window | Download (Duration: 1:07:19 — 46.5MB) The blockchain related Linkedin post Jhon liked The sources for this episode:
http://hadoop.apache.org/releases.html https://en.wikipedia.org/wiki/Apache_Hadoop Debate over which company had contributed more to Hadoop:
http://hortonworks.com/blog/reality-check-contributions-to-apache-hadoop/ Thank you for being part of the ride and now on to episode 200!
Please use the Contact Form on this blog or our twitter feed to send us your questions, or to suggest future episode topics you would like us to cover.
... View more
Labels: