Member since 
    
	
		
		
		09-18-2015
	
	
	
	
	
	
	
	
	
	
	
	
	
	
			
      
                191
            
            
                Posts
            
        
                81
            
            
                Kudos Received
            
        
                40
            
            
                Solutions
            
        My Accepted Solutions
| Title | Views | Posted | 
|---|---|---|
| 2693 | 08-04-2017 08:40 AM | |
| 6534 | 05-02-2017 01:18 PM | |
| 1422 | 04-24-2017 08:35 AM | |
| 1467 | 04-24-2017 08:21 AM | |
| 1780 | 06-01-2016 08:54 AM | 
			
    
	
		
		
		09-13-2018
	
		
		04:52 PM
	
	
	
	
	
	
	
	
	
	
	
	
	
	
		
	
				
		
			
					
				
		
	
		
					
							 In this episode we welcome Phil Radley, Chief Data Architect at BT to talk about the Big Data deployment at BT.       https://roaringelephant.org/2018/09/11/episode-105-big-data-at-british-telecom-with-phillip-radley/  
 
 
 
 Play in new window | Download (Duration: 1:06:32 — 45.9MB)     Phillip Radley (Linkedin)  Chief Data Architect @ BT  https://home.bt.com/   
 Please use the Contact Form on this blog or our twitter feed to send us your questions, or to suggest future episode topics you would like us to cover.      
						
					
					... View more
				
			
			
			
			
			
			
			
			
			
		
			
    
	
		
		
		09-13-2018
	
		
		04:44 PM
	
	
	
	
	
	
	
	
	
	
	
	
	
	
		
	
				
		
			
					
				
		
	
		
					
							 In this Big Data News episode, we discuss an article with guidelines 
on how you should arrange your data gathering projects with the customer
 in mind. Dave brings a matrix of visualization products.  https://roaringelephant.org/2018/09/04/episode-104-roaring-news/       
 
 
 
 Play in new window | Download (Duration: 36:55 — 25.6MB)  The five Cs: Five framing guidelines to help you think about building data products.
  https://www.oreilly.com/ideas/the-five-cs?utm_medium=social&utm_source=twitter.com&utm_campaign=awareness&utm_content=radar+content      The Chartmaker Directory
  http://chartmaker.visualisingdata.com/      
 Please use the Contact Form on this blog or our twitter feed to send us your questions, or to suggest future episode topics you would like us to cover.      
						
					
					... View more
				
			
			
			
			
			
			
			
			
			
		
			
    
	
		
		
		08-30-2018
	
		
		03:51 PM
	
	
	
	
	
	
	
	
	
	
	
	
	
	
		
	
				
		
			
					
				
		
	
		
					
							 Matteo and Sijie from Streamlio reached out to us and let us know 
they had an update on Apache Pulsar. It turned out they had a lot to 
talk about so we cut the interview in two parts. the first of which was 
published in episode 101. Here is the second part with information on 
version 2.0 and the future of the Apache Pulsar project.  https://roaringelephant.org/2018/08/28/episode-103-apache-pulsar-version-2-0-with-matteo-and-sijie-from-streamlio/     
 
 
 
 Play in new window | Download (Duration: 43:31 — 30.1MB) The
 first subject taken on by Sijie is Pulsar Functions, followed by Matteo
 talking about the new schema registry and Topic Compaction. With a new 
major version being released, users will probably want to upgrade so we 
asked the guys about the upgrade path. The rest of the episode, Matteo 
and Sijie share what they can regarding the future Pulsar Roadmap.   Matteo Merli (https://www.linkedin.com/in/matteomerli/)
  Co-Founder – Software Engineer    Sijie Guo (https://www.linkedin.com/in/samuelguo/)
  Co-Founder    Apache Pulsar (incubating)
  
  https://pulsar.apache.org/       Please use the Contact Form on this blog or our twitter feed to send us your questions, or to suggest future episode topics you would like us to cover.      
						
					
					... View more
				
			
			
			
			
			
			
			
			
			
		
		
			
				
						
							Labels:
						
						
		
	
					
			
		
	
	
	
	
				
		
	
	
			
    
	
		
		
		08-30-2018
	
		
		03:49 PM
	
	
	
	
	
	
	
	
	
	
	
	
	
	
		
	
				
		
			
					
				
		
	
		
					
							 Big Data News at the end of the summer is not easy to find, but we 
did end up with three topics to discuss: from isolating GPUs in Hadoop 
3.x to replicating big data (to the cloud) and quick tips from Adam’s 
blog.  https://roaringelephant.org/2018/08/21/episode-102-roaring-news/     
 
 
 
 Play in new window | Download (Duration: 22:07 — 15.4MB)  First Class GPUs support in Apache Hadoop 3.1, YARN & HDP 3.0
  https://hortonworks.com/blog/gpus-support-in-apache-hadoop-3-1-yarn-hdp-3/    Replicating big datasets in the cloud
  https://medium.com/hotels-com-technology/replicating-big-datasets-in-the-cloud-c0db388f6ba2  https://dataworkssummit.com/berlin-2018/session/tools-and-approaches-for-migrating-big-datasets-to-the-cloud/  https://www.slideshare.net/Hadoop_Summit/tools-and-approaches-for-migrating-big-datasets-to-the-cloud    Quick Tip: The easiest way to grab data out of a web page in Python
  https://medium.com/@ageitgey/quick-tip-the-easiest-way-to-grab-data-out-of-a-web-page-in-python-7153cecfca58     Please use the Contact Form on this blog or our twitter feed to send us your questions, or to suggest future episode topics you would like us to cover.      
						
					
					... View more
				
			
			
			
			
			
			
			
			
			
		
			
    
	
		
		
		08-30-2018
	
		
		03:47 PM
	
	
	
	
	
	
	
	
	
	
	
	
	
	
		
	
				
		
			
					
				
		
	
		
					
							 Matteo and Sijie from Streamlio reached out to us and let us know 
they had an update on Apache Pulsar. It turned out they had a lot to 
talk about so we cut the interview in two parts and here is the first 
part where they introduce Apache Pulsar, go in depth on the correct 
deployment scaling of a stable Pulsar cluster and clarify Pulsars “at 
least once vs exactly once” strategy. Part two will go in more depth on 
what’s new. Stay tuned!  https://roaringelephant.org/2018/08/14/episode-101-apache-pulsar-update-with-matteo-and-sijie-from-streamlio/       
 
 
 
 Play in new window | Download (Duration: 1:05:48 — 45.4MB)  Matteo Merli (https://www.linkedin.com/in/matteomerli/)
  Co-Founder – Software Engineer    Sijie Guo (https://www.linkedin.com/in/samuelguo/)
  Co-Founder    Apache Pulsar (incubating)
  
  https://pulsar.apache.org/       Please use the Contact Form on this blog or our twitter feed to send us your questions, or to suggest future episode topics you would like us to cover.      
						
					
					... View more
				
			
			
			
			
			
			
			
			
			
		
		
			
				
						
							Labels:
						
						
		
	
					
			
		
	
	
	
	
				
		
	
	
			
    
	
		
		
		08-30-2018
	
		
		03:43 PM
	
	
	
	
	
	
	
	
	
	
	
	
	
	
		
	
				
		
			
					
				
		
	
		
					
							
 https://roaringelephant.org/2018/08/07/episode-100-celebrating-our-centennial/  100
 Big Data episodes! We made it, in no small part thanks to our audience:
 you are who keeps us going! In this episode we celebrate our centennial
 by going over the history of Hadoop releases, highlighting the most 
noteworthy events along the way. Join us down the twisty paths of our  
memory lanes!     Play in new window | Download (Duration: 1:07:19 — 46.5MB)       The blockchain related  Linkedin post Jhon liked  The sources for this episode:
  http://hadoop.apache.org/releases.html  https://en.wikipedia.org/wiki/Apache_Hadoop    Debate over which company had contributed more to Hadoop:
  http://hortonworks.com/blog/reality-check-contributions-to-apache-hadoop/     Thank you for being part of the ride and now on to episode 200!   
 Please use the Contact Form on this blog or our twitter feed to send us your questions, or to suggest future episode topics you would like us to cover.      
						
					
					... View more
				
			
			
			
			
			
			
			
			
			
		
		
			
				
						
							Labels:
						
						
		
	
					
			
		
	
	
	
	
				
		
	
	
			
    
	
		
		
		08-30-2018
	
		
		03:36 PM
	
	
	
	
	
	
	
	
	
	
	
	
	
	
		
	
				
		
			
					
				
		
	
		
					
							 The
 Roaring Elephant podcast was a guest at the Codemotion conference in 
Amsterdam a little while ago. This episode contains the audio of the 
talk we did on the State of Big Data.  https://roaringelephant.org/2018/07/31/episode-99-the-state-of-big-data/        
 
 
 
 Play in new window | Download (Duration: 45:28 — 31.5MB) Our
 talk was dfinitely light on slideware, but if you want to see the video
 cast of our presentation, you can find it on the Codemotion youtube 
channel:Codemotion Amsterdam 2018: The State of Big Data by Roaring Elephant podcast   
 Please use the Contact Form on this blog or our twitter feed to send us your questions, or to suggest future episode topics you would like us to cover.      
						
					
					... View more
				
			
			
			
			
			
			
			
			
			
		
			
    
	
		
		
		08-30-2018
	
		
		03:32 PM
	
	
	
	
	
	
	
	
	
	
	
	
	
	
		
	
				
		
			
					
				
		
	
		
					
							 In
 this episode of Big Data Roaring News, Dave laments another 
announcement of Hadoop’s demise and exposes A.I. imposters. Jhon has 
articles comparing Ranger with Sentry and Apache Nifi reaching the ripe 
age of 1.7 with a Minifi charged practical demo to prove the point.  https://roaringelephant.org/2018/07/24/episode-98-roaring-news/       
 
 
 
 Play in new window | Download (Duration: 22:16 — 15.5MB)  Hadoop’s star dims in the era of cloud object data storage and stream computing
  
  https://siliconangle.com/blog/2018/07/09/hadoops-star-dims-era-cloud-object-data-storage-stream-computing/      The rise of “pseudo-ai” how tech firms quietly use humans to do bots work
  https://www.theguardian.com/technology/2018/jul/06/artificial-intelligence-ai-humans-bots-tech-companies    Apache Ranger Vs Sentry
  https://www.linkedin.com/pulse/apache-ranger-vs-sentry-mythily-rajavelu/    How to build an IIoT system using Apache NiFi, MiNiFi, C2 Server, MQTT and Raspberry Pi
  https://medium.freecodecamp.org/building-an-iiot-system-using-apache-nifi-mqtt-and-raspberry-pi-ce1d6ed565bc  Apache Nifi Version 1.7.0 released: https://cwiki.apache.org/confluence/display/NIFI/Release+Notes      
 Please use the Contact Form on this blog or our twitter feed to send us your questions, or to suggest future episode topics you would like us to cover.      
						
					
					... View more
				
			
			
			
			
			
			
			
			
			
		
			
    
	
		
		
		08-30-2018
	
		
		03:29 PM
	
	
	
	
	
	
	
	
	
	
	
	
	
	
		
	
				
		
			
					
				
		
	
		
					
							
 Episode 97 – ODPi: A new world for data governance  https://roaringelephant.org/2018/07/17/episode-97-odpi-a-new-world-for-data-governance/  In
 this episode, we welcome back John Mertic one more time. It was quite 
obvious that John had lots more to talk about at the end of our last 
interview with him. ODPi has recently reinvented itself, moving away 
from a strict distribution standards body towards data governance and 
reference specifications.       
 
 
 
 Play in new window | Download (Duration: 1:07:57 — 46.9MB)   John Mertic  Director of Program Management for ODPi, R Consortium, and Open Mainframe Project  https://www.linkedin.com/in/jmertic/   ODPi website links:
  https://www.odpi.org/  https://www.odpi.org/blog/2018/04/04/the-state-of-open-source-and-big-data-three-years-later  https://www.odpi.org/projects/data-governance-pmc  https://www.odpi.org/events      
 Please use the Contact Form on this blog or our twitter feed to send us your questions, or to suggest future episode topics you would like us to cover.      
						
					
					... View more
				
			
			
			
			
			
			
			
			
			
		
		
			
				
						
							Labels:
						
						
		
	
					
			
		
	
	
	
	
				
		
	
	
			
    
	
		
		
		08-17-2018
	
		
		01:32 PM
	
	
	
	
	
	
	
	
	
	
	
	
	
	
		
	
				
		
			
					
	
		1 Kudo
		
	
				
		
	
		
					
							
 Episode 96 – Roaring news  https://roaringelephant.org/2018/07/10/episode-96-roaring-news/  In
 this edition of Roaring news, Ward Bekker returns to discuss what is 
happening in the world of Big Data. Ward brings news on GPUs in 
supercomputers and how Big Data could be wrong about you. Dave and Jhon 
found articles on Big data growth visualizations and GDPR.     
 
 
 
 Play in new window | Download (Duration: 46:05 — 31.9MB)  10 Charts that will change your perspective of Big Data’s Growth
  https://www.forbes.com/sites/louiscolumbus/2018/05/23/10-charts-that-will-change-your-perspective-of-big-datas-growth/#1ea595702926    New GPU-Accelerated Supercomputers Change the Balance of Power on the TOP500
  https://www.top500.org/news/new-gpu-accelerated-supercomputers-change-the-balance-of-power-on-the-top500/    GDPR: A Call to Remove Technical Debt from Data Science
  https://medium.com/@kjarmul/gdpr-a-call-to-remove-technical-debt-from-data-science-c103a01c3102    Everything big data claims to know about you could be wrong
  http://news.berkeley.edu/2018/06/18/big-data-flaws/     Our thanks to Ward for adding some variety to this News episode.      Ward Bekker (Linkedin)  Pre-Sales Solutions Engineer II @ Hortonworks  Please use the Contact Form on this blog or our twitter feed to send us your questions, or to suggest future episode topics you would like us to cover.      
						
					
					... View more
				
			
			
			
			
			
			
			
			
			
		 
         
					
				













