Community Articles

Find and share helpful community-sourced technical articles.

Expert Contributor

Brandon Wilson has a great article that shows how to use the "CACHE TABLE" cmd in Tableau, however more recent drivers have come out and you can now connect directly to the thriftserver using a spark-sql driver. This is using HDP 2.5 and SimbaSparkOdbc.

First pull up a Tableau connection and select the thriftServer. Additionally had to open the virtualbox port 10015.

Next if you don't have the driver Tableau will jump you to a page where you can download a spark-sql driver and inside that package chose this driver.

Once you establish a valid connection you will see Tableau flag the connects based on the driver. Below you will see the Hive connection from Brandon's article and now the new Spark connection.