Sort of newbie question. If I have a very large table in HIVE. How can I split that up into smaller tables, aggregate the data and then present that aggregated data into tableau. What is the best approach. Could i split the data needed from the large HIVE table and load it into impala? For faster search results of data ?
I am not quite sure what the end goal is here but for hive tables that have vast amounts of data and if processing them is a concern, you could use partitions on the hive table. You could find some meaning way to divide the data into smaller files (example: by data or some range of values for columns etc).