Code Repositories
Find and share code repositories
Contributor
Repo Description

This demo illustrates retail analytics using an online retail dataset containing transactions occurring between 01/12/2010 and 09/12/2011 for a UK-based and registered non-store online retail. This dataset is used to demonstrate an end-to-end retail analytic use case on the Hortonworks Data Platform distribution:

  • Data ingestion and cleansing using Apache Pig,
  • SQL on Hadoop using Hive
  • Analytics and visualization using SparkSQL on Apache Zeppelin
  • Market Basket Analysis using SparkMLLib on Apache Zeppelin
Repo Info
Github Repo URL https://github.com/zoharsan/RetailAnalytics
Github account name zoharsan
Repo name RetailAnalytics
1,187 Views