I am walking through HDP Tutorial 2 (hadoop-tutorial-getting-started-with-hdp), Section 6 (data-reporting-with-zeppelin), and I am having trouble getting the query in step 8 (build-charts-using-zeppelin) to finish:
SELECT a.driverid, a.riskfactor, b.city, b.state
FROM riskfactor a, geolocation b where a.driverid=b.driverid
Zeppelin can start and finish both of the following queries quickly and easily:
SELECT b.city, b.state FROM geolocation b
SELECT a.driverid, a.riskfactor FROM riskfactor a
However, I have been running the select from geolocation and riskfactor for over 24 hours, and it still has not completed. I'm running the HDP Sandbox in VMware Workstation on a Dell ultrabook with a 2.3Ghz i5 proc and 16GB RAM. Any ideas why this query isn't completing?
... View more