I have data in My hive DB with 2 tables, I want to connect tableau to these 2 tables to build my reports.
We have a business requirement to truncate the table quite often and fetch the new reports with new data(The key data of few columns will remain the same but there are other changes which keeps happening to other columns and we want to visualize them), that can't be changed.
We have hortonworks cluster. Used Hive ODBC to connect to the tables and it all works fine except the performance.
When we used spark ODBC and connected through Spark-thrift, performance is far better than hive odbc.
But this have a problem, Whenever we truncate, load the new data into tables, tableau will fail with below errors:
[Microsoft][SparkODBC] (35) Error from server: error code: '0' error message:
After uploading data again and refresh the reports, it still refers to one of the OLD hdfs path of the table data and doesn't work.
Interesting things is with hive CLI i can see the table OR query the data and also through Hive view in ambari and also If I use hive ODBC in tableau but it fails consistently with above error when tried for tableau --> SparkODBC --> SparkThrift --> Hive connection
Im quite sure if we remove the partition it should work, but as the data grows partition becomes necessary.
Anyone faced similar problems with SparkODBC ? Please share suggestions.