Code Repositories
Find and share code repositories
Alert: Welcome to the Unified Cloudera Community. Former HCC members be sure to read and learn how to activate your account here.
Repo Description

This demo illustrates retail analytics using an online retail dataset containing transactions occurring between 01/12/2010 and 09/12/2011 for a UK-based and registered non-store online retail. This dataset is used to demonstrate an end-to-end retail analytic use case on the Hortonworks Data Platform distribution:

  • Data ingestion and cleansing using Apache Pig,
  • SQL on Hadoop using Hive
  • Analytics and visualization using SparkSQL on Apache Zeppelin
  • Market Basket Analysis using SparkMLLib on Apache Zeppelin
Repo Info
Github Repo URL
Github account name zoharsan
Repo name RetailAnalytics
Don't have an account?
Coming from Hortonworks? Activate your account here
Version history
Revision #:
1 of 1
Last update:
‎08-17-2016 02:17 PM
Updated by:
Top Kudoed Authors