Member since
09-18-2015
191
Posts
81
Kudos Received
40
Solutions
My Accepted Solutions
Title | Views | Posted |
---|---|---|
1060 | 08-04-2017 08:40 AM | |
3448 | 05-02-2017 01:18 PM | |
700 | 04-24-2017 08:35 AM | |
675 | 04-24-2017 08:21 AM | |
800 | 06-01-2016 08:54 AM |
09-13-2018
04:52 PM
In this episode we welcome Phil Radley, Chief Data Architect at BT to talk about the Big Data deployment at BT. https://roaringelephant.org/2018/09/11/episode-105-big-data-at-british-telecom-with-phillip-radley/
Play in new window | Download (Duration: 1:06:32 — 45.9MB) Phillip Radley (Linkedin) Chief Data Architect @ BT https://home.bt.com/
Please use the Contact Form on this blog or our twitter feed to send us your questions, or to suggest future episode topics you would like us to cover.
... View more
- Find more articles tagged with:
- FAQ
- help
- roaringelephantpodcast
09-13-2018
04:44 PM
In this Big Data News episode, we discuss an article with guidelines
on how you should arrange your data gathering projects with the customer
in mind. Dave brings a matrix of visualization products. https://roaringelephant.org/2018/09/04/episode-104-roaring-news/
Play in new window | Download (Duration: 36:55 — 25.6MB) The five Cs: Five framing guidelines to help you think about building data products.
https://www.oreilly.com/ideas/the-five-cs?utm_medium=social&utm_source=twitter.com&utm_campaign=awareness&utm_content=radar+content The Chartmaker Directory
http://chartmaker.visualisingdata.com/
Please use the Contact Form on this blog or our twitter feed to send us your questions, or to suggest future episode topics you would like us to cover.
... View more
- Find more articles tagged with:
- FAQ
- help
- podcast
- roaringelephantpodcast
08-30-2018
03:51 PM
Matteo and Sijie from Streamlio reached out to us and let us know
they had an update on Apache Pulsar. It turned out they had a lot to
talk about so we cut the interview in two parts. the first of which was
published in episode 101. Here is the second part with information on
version 2.0 and the future of the Apache Pulsar project. https://roaringelephant.org/2018/08/28/episode-103-apache-pulsar-version-2-0-with-matteo-and-sijie-from-streamlio/
Play in new window | Download (Duration: 43:31 — 30.1MB) The
first subject taken on by Sijie is Pulsar Functions, followed by Matteo
talking about the new schema registry and Topic Compaction. With a new
major version being released, users will probably want to upgrade so we
asked the guys about the upgrade path. The rest of the episode, Matteo
and Sijie share what they can regarding the future Pulsar Roadmap. Matteo Merli (https://www.linkedin.com/in/matteomerli/)
Co-Founder – Software Engineer Sijie Guo (https://www.linkedin.com/in/samuelguo/)
Co-Founder Apache Pulsar (incubating)
https://pulsar.apache.org/ Please use the Contact Form on this blog or our twitter feed to send us your questions, or to suggest future episode topics you would like us to cover.
... View more
- Find more articles tagged with:
- Data Ingestion & Streaming
- FAQ
- Kafka
- podcast
- pulsar
- roaringelephantpodcast
Labels:
08-30-2018
03:49 PM
Big Data News at the end of the summer is not easy to find, but we
did end up with three topics to discuss: from isolating GPUs in Hadoop
3.x to replicating big data (to the cloud) and quick tips from Adam’s
blog. https://roaringelephant.org/2018/08/21/episode-102-roaring-news/
Play in new window | Download (Duration: 22:07 — 15.4MB) First Class GPUs support in Apache Hadoop 3.1, YARN & HDP 3.0
https://hortonworks.com/blog/gpus-support-in-apache-hadoop-3-1-yarn-hdp-3/ Replicating big datasets in the cloud
https://medium.com/hotels-com-technology/replicating-big-datasets-in-the-cloud-c0db388f6ba2 https://dataworkssummit.com/berlin-2018/session/tools-and-approaches-for-migrating-big-datasets-to-the-cloud/ https://www.slideshare.net/Hadoop_Summit/tools-and-approaches-for-migrating-big-datasets-to-the-cloud Quick Tip: The easiest way to grab data out of a web page in Python
https://medium.com/@ageitgey/quick-tip-the-easiest-way-to-grab-data-out-of-a-web-page-in-python-7153cecfca58 Please use the Contact Form on this blog or our twitter feed to send us your questions, or to suggest future episode topics you would like us to cover.
... View more
- Find more articles tagged with:
- FAQ
- help
- podcast
- roaringelephantpodcast
08-30-2018
03:47 PM
Matteo and Sijie from Streamlio reached out to us and let us know
they had an update on Apache Pulsar. It turned out they had a lot to
talk about so we cut the interview in two parts and here is the first
part where they introduce Apache Pulsar, go in depth on the correct
deployment scaling of a stable Pulsar cluster and clarify Pulsars “at
least once vs exactly once” strategy. Part two will go in more depth on
what’s new. Stay tuned! https://roaringelephant.org/2018/08/14/episode-101-apache-pulsar-update-with-matteo-and-sijie-from-streamlio/
Play in new window | Download (Duration: 1:05:48 — 45.4MB) Matteo Merli (https://www.linkedin.com/in/matteomerli/)
Co-Founder – Software Engineer Sijie Guo (https://www.linkedin.com/in/samuelguo/)
Co-Founder Apache Pulsar (incubating)
https://pulsar.apache.org/ Please use the Contact Form on this blog or our twitter feed to send us your questions, or to suggest future episode topics you would like us to cover.
... View more
- Find more articles tagged with:
- Data Ingestion & Streaming
- FAQ
- Kafka
- podcast
- pulsar
- roaringelephantpodcast
- streaming
Labels:
08-30-2018
03:43 PM
https://roaringelephant.org/2018/08/07/episode-100-celebrating-our-centennial/ 100
Big Data episodes! We made it, in no small part thanks to our audience:
you are who keeps us going! In this episode we celebrate our centennial
by going over the history of Hadoop releases, highlighting the most
noteworthy events along the way. Join us down the twisty paths of our
memory lanes! Play in new window | Download (Duration: 1:07:19 — 46.5MB) The blockchain related Linkedin post Jhon liked The sources for this episode:
http://hadoop.apache.org/releases.html https://en.wikipedia.org/wiki/Apache_Hadoop Debate over which company had contributed more to Hadoop:
http://hortonworks.com/blog/reality-check-contributions-to-apache-hadoop/ Thank you for being part of the ride and now on to episode 200!
Please use the Contact Form on this blog or our twitter feed to send us your questions, or to suggest future episode topics you would like us to cover.
... View more
- Find more articles tagged with:
- FAQ
- help
- podcast
- roaringelephantpodcast
Labels:
08-30-2018
03:36 PM
The
Roaring Elephant podcast was a guest at the Codemotion conference in
Amsterdam a little while ago. This episode contains the audio of the
talk we did on the State of Big Data. https://roaringelephant.org/2018/07/31/episode-99-the-state-of-big-data/
Play in new window | Download (Duration: 45:28 — 31.5MB) Our
talk was dfinitely light on slideware, but if you want to see the video
cast of our presentation, you can find it on the Codemotion youtube
channel:Codemotion Amsterdam 2018: The State of Big Data by Roaring Elephant podcast
Please use the Contact Form on this blog or our twitter feed to send us your questions, or to suggest future episode topics you would like us to cover.
... View more
- Find more articles tagged with:
- FAQ
- help
- podcast
- roaringelephantpodcast
08-30-2018
03:32 PM
In
this episode of Big Data Roaring News, Dave laments another
announcement of Hadoop’s demise and exposes A.I. imposters. Jhon has
articles comparing Ranger with Sentry and Apache Nifi reaching the ripe
age of 1.7 with a Minifi charged practical demo to prove the point. https://roaringelephant.org/2018/07/24/episode-98-roaring-news/
Play in new window | Download (Duration: 22:16 — 15.5MB) Hadoop’s star dims in the era of cloud object data storage and stream computing
https://siliconangle.com/blog/2018/07/09/hadoops-star-dims-era-cloud-object-data-storage-stream-computing/ The rise of “pseudo-ai” how tech firms quietly use humans to do bots work
https://www.theguardian.com/technology/2018/jul/06/artificial-intelligence-ai-humans-bots-tech-companies Apache Ranger Vs Sentry
https://www.linkedin.com/pulse/apache-ranger-vs-sentry-mythily-rajavelu/ How to build an IIoT system using Apache NiFi, MiNiFi, C2 Server, MQTT and Raspberry Pi
https://medium.freecodecamp.org/building-an-iiot-system-using-apache-nifi-mqtt-and-raspberry-pi-ce1d6ed565bc Apache Nifi Version 1.7.0 released: https://cwiki.apache.org/confluence/display/NIFI/Release+Notes
Please use the Contact Form on this blog or our twitter feed to send us your questions, or to suggest future episode topics you would like us to cover.
... View more
- Find more articles tagged with:
- FAQ
- help
- podcast
- roaringelephantpodcast
08-30-2018
03:29 PM
Episode 97 – ODPi: A new world for data governance https://roaringelephant.org/2018/07/17/episode-97-odpi-a-new-world-for-data-governance/ In
this episode, we welcome back John Mertic one more time. It was quite
obvious that John had lots more to talk about at the end of our last
interview with him. ODPi has recently reinvented itself, moving away
from a strict distribution standards body towards data governance and
reference specifications.
Play in new window | Download (Duration: 1:07:57 — 46.9MB) John Mertic Director of Program Management for ODPi, R Consortium, and Open Mainframe Project https://www.linkedin.com/in/jmertic/ ODPi website links:
https://www.odpi.org/ https://www.odpi.org/blog/2018/04/04/the-state-of-open-source-and-big-data-three-years-later https://www.odpi.org/projects/data-governance-pmc https://www.odpi.org/events
Please use the Contact Form on this blog or our twitter feed to send us your questions, or to suggest future episode topics you would like us to cover.
... View more
- Find more articles tagged with:
- Atlas
- FAQ
- governance
- Governance & Lifecycle
- help
- roaringelephantpodcast
Labels:
08-17-2018
01:32 PM
1 Kudo
Episode 96 – Roaring news https://roaringelephant.org/2018/07/10/episode-96-roaring-news/ In
this edition of Roaring news, Ward Bekker returns to discuss what is
happening in the world of Big Data. Ward brings news on GPUs in
supercomputers and how Big Data could be wrong about you. Dave and Jhon
found articles on Big data growth visualizations and GDPR.
Play in new window | Download (Duration: 46:05 — 31.9MB) 10 Charts that will change your perspective of Big Data’s Growth
https://www.forbes.com/sites/louiscolumbus/2018/05/23/10-charts-that-will-change-your-perspective-of-big-datas-growth/#1ea595702926 New GPU-Accelerated Supercomputers Change the Balance of Power on the TOP500
https://www.top500.org/news/new-gpu-accelerated-supercomputers-change-the-balance-of-power-on-the-top500/ GDPR: A Call to Remove Technical Debt from Data Science
https://medium.com/@kjarmul/gdpr-a-call-to-remove-technical-debt-from-data-science-c103a01c3102 Everything big data claims to know about you could be wrong
http://news.berkeley.edu/2018/06/18/big-data-flaws/ Our thanks to Ward for adding some variety to this News episode. Ward Bekker (Linkedin) Pre-Sales Solutions Engineer II @ Hortonworks Please use the Contact Form on this blog or our twitter feed to send us your questions, or to suggest future episode topics you would like us to cover.
... View more
- Find more articles tagged with:
- FAQ
- help
- podcast
- roaringelephantpodcast
08-17-2018
01:30 PM
1 Kudo
Episode 95 – DataWorks Summit in San Jose with Ward Bekker https://roaringelephant.org/2018/07/03/episode-95-dataworks-summit-in-san-jose-with-ward-bekker/ Since both Dave and Jhon were not able to attend the DataWorks Summit
in San Jose a couple of weeks ago, we have a guest, Ward Bekker, who
was happy to join and educate us on the subject.
Play in new window | Download (Duration: 1:52:50 — 77.7MB) In
this episode we discuss the daily keynotes and Wards’ selection of
sessions at the Summit ranging from the new things in Yarn 3.0,
Materialized views in Hive and much more. Ward Bekker (Linkedin) Pre-Sales Solutions Engineer II @ Hortonworks Some of the sessions and topics discussed are: Apache Hadoop State of the union
https://dataworkssummit.com/san-jose-2018/session/apache-hadoop-yarn-state-of-the-union-2/ What is new in Apache Hive
https://dataworkssummit.com/san-jose-2018/session/what-is-new-in-apache-hive/ Runing distributed tensorflow in production
https://dataworkssummit.com/san-jose-2018/session/running-distributed-tensorflow-in-production-challenges-and-solutions-on-yarn-3-0-2/ Just the sketch: advanced streaming analytics in Apache Metron
https://dataworkssummit.com/san-jose-2018/session/just-the-sketch-advanced-streaming-analytics-in-apache-metron/ Containers and Big Data
https://dataworkssummit.com/san-jose-2018/session/containers-and-big-data/ Catch a hacker in realtime: Live visuals of bots and bad guys
https://dataworkssummit.com/san-jose-2018/session/catch-a-hacker-in-realtime-live-visuals-of-bots-and-bad-guys/ HDFS tiered storage
https://dataworkssummit.com/san-jose-2018/session/hdfs-tiered-storage/ Geospatial data platform at Uber
https://dataworkssummit.com/san-jose-2018/session/geospatial-data-platform-at-uber/ What’s the Hadoop-la about Kubernetes?
https://dataworkssummit.com/san-jose-2018/session/whats-the-hadoop-la-about-kubernetes/
... View more
- Find more articles tagged with:
- FAQ
- help
- podcast
- roaringelephantpodcast
08-16-2018
12:57 PM
Episode 94 – Roaring news https://roaringelephant.org/2018/06/26/episode-94-roaring-news/ I
this weeks edition of Roaring Big Data News, Dave talks about
modernizing Hadoop and a billion java errors. Jhon has an article on
improving your learning data sets. We finish with a discussion about the
newly released HDP 2.6.5 with an emphasis on the deprecation notices
and Yarn Containers.
Play in new window | Download (Duration: 37:40 — 26.1MB) Dave
Modernizing Hadoop: Reaching the plateau of productivity
https://www.zdnet.com/article/modernizing-hadoop-reaching-the-plateau-of-productivity/ 1 billion Java errors, here’s what causes 97% of them
https://blog.takipi.com/we-crunched-1-billion-java-logged-errors-heres-what-causes-97-of-them/ https://blog.takipi.com/the-top-10-exceptions-types-in-production-java-applications-based-on-1b-events/ Jhon
Why you need to improve your training data, and how to do it
https://petewarden.com/2018/05/28/why-you-need-to-improve-your-training-data-and-how-to-do-it/amp/ Announcing the General Availability of Hortonworks Data Platform (HDP) 2.6.5, Apache Ambari 2.6.2 and SmartSense 1.4.5
https://hortonworks.com/blog/announcing-general-availability-hortonworks-data-platform-hdp-2-6-5-apache-ambari-2-6-2-smartsense-1-4-5/ Component Versions
https://docs.hortonworks.com/HDPDocuments/HDP2/HDP-2.6.5/bk_release-notes/content/comp_versions.html Deprecation Notices
https://docs.hortonworks.com/HDPDocuments/HDP2/HDP-2.6.5/bk_release-notes/content/deprecated_items.html YARN Containers
Trying out Containerized Applications on Apache Hadoop YARN 3.1
https://hortonworks.com/blog/trying-containerized-applications-apache-hadoop-yarn-3-1/ Containerized Apache Spark on YARN in Apache Hadoop 3.1
https://hortonworks.com/blog/containerized-apache-spark-yarn-apache-hadoop-3-1/
... View more
- Find more articles tagged with:
- FAQ
- help
- podcast
- roaringelephantpodcast
08-16-2018
12:08 PM
Episode 93 – Apache Kylin: Extreme OLAP Engine for Big Data https://roaringelephant.org/2018/06/19/episode-93-apache-kylin-olap-cubes-in-hadoop/ In
this episode Apache PMC member Dong Li joins us to explains how Apache
Kylin can deploy Analytical OLAP cubes in your Big Data environment. http://kylin.apache.org/
Play in new window | Download (Duration: 46:14 — 32.0MB) Dong Li Technical Partner & Senior Architect of Kyligence (linkedin) PMC Member of Apache Kylin http://en.kyligence.io/ Please use the Contact Form on this blog or our twitter feed to send us your questions, or to suggest future episode topics you would like us to cover.
... View more
- Find more articles tagged with:
- FAQ
- help
- kylin
- podcast
- roaringelephantpodcast
Labels:
07-16-2018
02:12 PM
Roaring Elephant Podcast – Episode 92 – Roaring news https://roaringelephant.org/2018/06/12/episode-92-roaring-news/ Another
week, another edition of Roaring Big Data News. This time, Dave talks
about driving teens and Jhon takes a detailed look at an Eventbrite data
pipeline article.
Breaking NewsPlay in new window | Download (Duration: 46:08 — 31.9MB) Dave
Driver monitoring isn’t just for teens; adults can benefit, too
https://arstechnica.com/cars/2018/05/buicks-smart-driver-explains-why-my-gas-mileage-sucks-and-my-editors-doesnt/ Jhon
Looking under the hood of the Eventbrite data pipeline!
https://www.eventbrite.com/engineering/looking-under-the-hood-of-the-eventbrite-data-pipeline/
Please use the Contact Form on this blog or our twitter feed to send us your questions, or to suggest future episode topics you would like us to cover.
... View more
- Find more articles tagged with:
- FAQ
- help
- podcast
- roaringelephantpodcast
06-13-2018
11:55 PM
Roaring Elephant Podcast - Episode 91 – ODPi is back and better than ever! https://roaringelephant.org/2018/06/05/episode-91-odpi-is-back-and-better-than-ever/ In
this episode, we welcome back John Mertic, director of Program
Management for ODPi, R Consortium, and the Open Mainframe Project. It’s
been almost two years since we checked in with John and the ODPi
initiative and as John mentions in the interview, a lot has changed in
Hadoop…
Play in new window | Download (Duration: 1:08:00 — 46.9MB) John Mertic Director of Program Management for ODPi, R Consortium, and Open Mainframe Project https://www.linkedin.com/in/jmertic/ ODPi website links:
https://www.odpi.org/ https://www.odpi.org/blog/2018/04/04/the-state-of-open-source-and-big-data-three-years-later https://www.odpi.org/projects/data-governance-pmc https://www.odpi.org/events
Please use the Contact Form on this blog or our twitter feed to send us your questions, or to suggest future episode topics you would like us to cover.
... View more
- Find more articles tagged with:
- FAQ
- help
- podcast
- roaringelephantpodcast
Labels:
06-13-2018
11:28 PM
Roaring Elephant Podcast - Episode 90 – Roaring news https://roaringelephant.org/2018/05/29/episode-90-roaring-news/ In
this weeks Roaring News episode, Dave brings up the resilience of
Apache Community open source projects and plays some Doom. Jhon has some
practical Apache NIFI guides and the emergence of multi modal NoSQL
databases.
Play in new window | Download (Duration: 38:09 — 26.4MB) DataWorks Summit Berlin video recordings are up:
https://www.youtube.com/user/HadoopSummit/playlists Find Dave on his Australian road-trip:
http://bit.ly/aus-nz-ibm-hwx-tour Dave
DataTorrent, Stream Processing Startup, Folds (Apache Apex)
https://www.datanami.com/2018/05/08/datatorrent-stream-processing-startup-folds/ DOOM!
https://arxiv.org/abs/1804.09154 https://www.technologyreview.com/s/611072/ai-generates-new-doom-levels-for-humans-to-play/ https://www.youtube.com/watch?v=K32FZ-tjQP4 Bonus doom news: https://www.rockpapershotgun.com/2018/03/28/dodge-fireballs-forever-in-a-neural-nets-doom-nightmare/ https://worldmodels.github.io/ Jhon
Accessing Feeds from EtherDelta on Trades, Funds, Buys and Sells (Apache NiFi)
https://community.hortonworks.com/articles/191146/accessing-feeds-from-etherdelta-on-trades-funds-bu.html?es_p=6741162 NiFi Processing and Flow with Couchbase Server
https://blog.couchbase.com/nifi-processing-flow-couchbase-server/ The new era of the Multi-Model Database
https://www.zdnet.com/article/the-new-era-of-the-multi-model-database/ Seven Databases in Seven Weeks, Second Edition – A Guide to Modern Databases and the NoSQL Movement
https://pragprog.com/book/pwrdata/seven-databases-in-seven-weeks-second-edition
... View more
- Find more articles tagged with:
- FAQ
- help
- podcast
- roaringelephantpodcast
Labels:
05-25-2018
11:28 AM
https://roaringelephant.org/2018/05/22/episode-89/ With the San Jose edition of the DataWorks Summit only a month away,
we go over the sessions that are available in the agenda today and offer
our top picks. If you’re going, or if you will be watching the replays
online, we hope to guide you on your selection of sessions.
DataWorks Summit San Jose 2018
Play in new window | Download (Duration: 1:12:20 — 49.9MB) And here is the dashboard we created with statistics on the San Jose sessions, for your enjoyment: https://aka.ms/DWS2018SJ The agenda is still in flux so we will be updating the dashboard regularly.
Please use the Contact Form on this blog or our twitter feed to send us your questions, or to suggest future episode topics you would like us to cover.
... View more
- Find more articles tagged with:
- FAQ
- help
- podcast
- roaringelephantpodcast
05-18-2018
04:57 PM
https://roaringelephant.org/2018/05/15/episode-88-roaring-news/ Returning to our more regular schedule, we have a Roaring News
episode today. Dave has articles on multi-cloud readiness, Big Data
being a pariah, and Google Duplex and Jhon came up with Synthetic data,
data engineers and scientists and a Neural Network sharing cake recipes.
Breaking NewsPlay in new window | Download (Duration: 35:07 — 24.4MB) Dave
Less than 10% ready for multi cloud
http://www.cloudpro.co.uk/cloud-essentials/hybrid-cloud/7451/idc-less-than-10-of-organisations-are-ready-for-multi-cloud Tech companies distancing themselves from Big Data
https://qz.com/1262102/tech-companies-are-distancing-themselves-from-big-data/ Google Duplex
https://ai.googleblog.com/2018/05/duplex-ai-system-for-natural-conversation.html Jhon
The Rise of Synthetic Data to Help Developers Create and Train AI Algorithms Quickly and Affordably
https://insidebigdata.com/2018/05/08/rise-synthetic-data-help-developers-create-train-ai-algorithms-quickly-affordably/ Data engineers vs. data scientists
https://www.oreilly.com/ideas/data-engineers-vs-data-scientists?utm_medium=social&utm_source=twitter.com&utm_campaign=awareness&utm_content=radar+content+datascience We asked a neural network to bake us a cake. The results were…interesting.
https://www.popsci.com/neural-network-bakes-a-cake Please use the Contact Form on this blog or our twitter feed to send us your questions, or to suggest future episode topics you would like us to cover.
... View more
- Find more articles tagged with:
- FAQ
- help
- podcast
- roaringelephantpodcast
05-18-2018
04:54 PM
https://roaringelephant.org/2018/05/08/episode-87-druid-high-performance-column-oriented-distributed-data-store-part-2/ This is the second part of an interview with Fangjin Yang, co-founder
and CEO at Imply and committer/PMC member for the Druid project. Druid:
a high-performance, column-oriented, distributed data store which has
entered the Hadoop environment with the recent integration with Apache
and we since Druid has been around for a while, we are grateful to FJ
for spending some time with our listeners.
Play in new window | Download (Duration: 31:53 — 22.1MB) Fangjin Yang Cofounder and CEO at Imply (linkedin)
Please use the Contact Form on this blog or our twitter feed to send us your questions, or to suggest future episode topics you would like us to cover.
... View more
- Find more articles tagged with:
- FAQ
- help
- podcast
- roaringelephantpodcast
Labels:
05-18-2018
04:51 PM
https://roaringelephant.org/2018/05/01/episode-86-druid-a-high-performance-column-oriented-distributed-data-store-part-1/ This
is the first part of an interview with Fangjin Yang, co-founder and CEO
at Imply and committer/PMC member for the Druid project. Druid: a
high-performance, column-oriented, distributed data store which has
entered the Hadoop environment with the recent integration with Apache
and we since Druid has been around for a while, we are grateful to FJ
for spending some time with our listeners.
Play in new window | Download (Duration: 31:57 — 22.2MB) Fangjin Yang Cofounder and CEO at Imply (linkedin)
Please use the Contact Form on this blog or our twitter feed to send us your questions, or to suggest future episode topics you would like us to cover.
... View more
- Find more articles tagged with:
- FAQ
- help
- podcast
- roaringelephantpodcast
Labels:
05-18-2018
04:48 PM
https://roaringelephant.org/2018/04/24/episode-85-dataworks-summit-community-showcase-exhibitor-soundbites/ This
is the final part of our coverage of the DataWorks Summit Berlin 2018.
Normally we would not have had an episode this week, since we were in
Berlin last week, but we had lightning interviews with the vendors in
the Community Expo Are and used that coverage to make this episode. Audio Player
00:00
00:00
Play in new window | Download (Duration: 30:34 — 21.0MB) So
less of “Dave & Jhon” and more “ecosystem tech” snippets this time.
Even though this does stray a bit from our usual content, we still hope
it is useful. This was recorded in a hotel room and on the expo
floor so the audio quality is not up to our usual standards, we hope
you’ll forgive us! Here is a timestamped list of the lightning interviews: 02:41 Hortonworks https://hortonworks.com/ 06:28 Alation https://alation.com/ 08:45 Arcadia Data https://www.arcadiadata.com/ 11:12 Attunity https://www.attunity.com/ 13:10 BlueMetrix https://www.bluemetrix.com/ 15:27 BMW https://www.bmw.com 18:04 IBM https://www.ibm.com 19:54 Microsoft https://www.microsoft.com 22:15 Nutanix https://www.nutanix.com/ 23:26 Syncsort https://www.syncsort.com 24:54 Synerscope http://www.synerscope.com/ 27:05 Talend https://www.talend.com 27:59 Teradata https://www.teradata.com/ 29:02 -Interview End-
... View more
- Find more articles tagged with:
- FAQ
- help
- podcast
- roaringelephantpodcast
05-18-2018
04:44 PM
https://roaringelephant.org/2018/04/19/episode-84-dataworks-summit-berlin-day-2-recap/ And with the end of day two of the 2018 DataWorks Summit in Berlin
comes the end of this years Europe Summit. But never fear, we have an
extra 90 minutes of DataWorks goodness for you to consume on your way
home. Audio Player
00:00
00:00
Play in new window | Download (Duration: 1:30:26 — 62.3MB) No
real editing on this one, recording in a hotel room so audio quality
may not be up to our usual standards, we hope you’ll forgive us! Enjoy!
Please use the Contact Form on this blog or our twitter feed to send us your questions, or to suggest future episode topics you would like us to cover.
... View more
- Find more articles tagged with:
- FAQ
- help
- podcast
- roaringelephantpodcast
05-18-2018
04:42 PM
https://roaringelephant.org/2018/04/18/episode-83-dataworks-summit-berlin-day-1-recap/ Another year, another European Dataworks Summit, and yes, another
daily recap show from Jhon and Dave. We walk through the keynotes and
sessions we attended and give our thoughts and views. This should be
useful for anyone who wasn’t able to attend or those seeking to peek
into sessions they couldn’t make. Audio Player
00:00
00:00
Play in new window | Download (Duration: 1:23:45 — 57.8MB) No
real editing on this one, recording in a hotel room so audio quality
may not be up to our usual standards, we hope you’ll forgive us! Enjoy!
Please use the Contact Form on this blog or our twitter feed to send us your questions, or to suggest future episode topics you would like us to cover.
... View more
- Find more articles tagged with:
- FAQ
- help
- podcast
- roaringelephantpodcast
04-11-2018
10:35 AM
https://roaringelephant.org/2018/04/10/episode-82-dataworks-summit-berlin-2018-preview/ Next
week is DataWorks Summit Berlin week! Your two hosts will be in
attendance and in this episode we go over the agenda and plan which
sessions we want to attend and why. Peppered throughout we add further
insights and experiences from previous years. Audio Player
00:00
00:00
Play in new window | Download (Duration: 47:38 — 33.0MB) Unfortunately, Dave’s network was a little unstable and there are a couple audio glitches in this episode. For
some session statistics or if you can use some help deciding what
sessions you want to attend, you can use the dashboard we created:
DSW2018 Berlin dashboard (http://aka.ms/DWS2018) Click the screenshot above or go to http://aka.ms/DWS2018
to access the dashboard. It is a dynamic report: clicking on graph
elements (bars of pie slices) will apply filters on all the
visualizations and the session list. Use control-click to combine
filters. The Summit agenda is still seeing some small changes here
and there. We will try and keep the dashboard up to date, but make sure
you double check with the official agenda! At some point the dashboard will dissapear because t is no longer relevant. for future reference, here is a large version of the screenshot.
Please use the Contact Form on this blog or our twitter feed to send us your questions, or to suggest future episode topics you would like us to cover.
... View more
- Find more articles tagged with:
- FAQ
- help
- podcast
- roaringelephantpodcast
04-11-2018
10:33 AM
https://roaringelephant.org/2018/04/03/episode-81-roaring-news/ In
this installment of Big Data News, we talk about the recent Facebook
leak, how everybody is still doing it wrong (according to some at least)
and installing Hadoop “the old-fashioned way”. Also briefly covered is
Elastic’s X-Pack, now even more “open” than before, but still rather
closed it would seem.
Breaking News Audio Player
00:00
00:00
Play in new window | Download (Duration: 26:19 — 18.3MB)
Please use the Contact Form on this blog or our twitter feed to send us your questions, or to suggest future episode topics you would like us to cover.
... View more
- Find more articles tagged with:
- FAQ
- help
- podcast
- roaringelephantpodcast
Labels:
04-09-2018
08:28 AM
Unfortunately I'm nothing to do with training however I have reached
out to the training team and someone should get back to you.
... View more
04-09-2018
08:28 AM
Unfortunately I'm nothing to do with training however I have reached out to the training team and someone should get back to you.
... View more
03-28-2018
08:01 AM
1 Kudo
NOTE: This was recorded before everything kicked off with Facebook and Cambridge Analytica. Interesting timing. https://roaringelephant.org/2018/03/27/episode-80-big-data-tracking/ Last June, Wolfie Christl published a 93 page report Corporate
Surveillance in Everyday Life using big data tracking. Apart from the
massive pdf that can be downloaded on the net, an extensive summary can
be found on the Cracked Labs website. In this episode we go over the content and give our views on the subject. Podcast: Play in new window | Download (Duration: 51:25 — 35.6MB) If you want to follow along with us while we are discussing the different point in the onlin earticle, here is the link: http://crackedlabs.org/en/corporate-surveillance Please use the Contact Form on this blog or our twitter feed to send us your questions, or to suggest future episode topics you would like us to cover.
... View more
- Find more articles tagged with:
- FAQ
- help
- podcast
- roaringelephantpodcast
08-04-2017
08:44 AM
1 Kudo
Hi there @Arnault Droz could you perhaps provide a link to the sample tutorial you're following? This might help others to see what's going on and help you. Many thanks and good luck!
... View more
08-04-2017
08:40 AM
2 Kudos
Hi @Alberto Ramon, three questions in one! Just as a hint, in the future you may get quicker responses if you break your questions down to single question per post. Anyway, to answer your question, Metastore HA is more of an Active/Standby type pattern, from the documentation: "Failover Scenario A
Hive metastore client always uses the first URI to connect with the
metastore server. If the metastore server becomes unreachable, the
client randomly picks up a URI from the list and attempts to connect
with that" For more information please look here https://docs.hortonworks.com/HDPDocuments/HDP2/HDP-2.6.0/bk_hadoop-high-availability/content/ha-hive-use-and-failover.html I would not recommend that you use the Metastore HA outside of its intended usage, there could be unforseen concequences. Hive Metastore HA is compatible with Ranger and is compatible with Kerberised clusters. Hope that helps!
... View more