Created on 09-29-2022 07:34 AM
If you made it this far, you probably already know why you're here. Connecting Dbeaver to an Impala virtual warehouse isn't difficult, but there are a few gotchas that can make it frustrating. So let's conquer those obstacles.
Provision an Impala Virtual Warehouse
Download the JDBC Driver
From your virtual warehouse tile, click on the kebab icon in the upper right. You'll find all sorts of fun options under there, but we're primarily interested in the Download JDBC/ODBC Driver option, which will download the Impala jar to your local machine. You can leave it in your Downloads folder, or move it to wherever you like to store your jars. It will be named similar to this:
impala_driver_jdbc_odbc.zip
You'll need to unzip it, which will create a new folder named impala_driver_jdbc_odbc. Inside that folder will be two additional folders, we're interested in the JDBC folder, named something like this:
ClouderaImpala_JDBC-2.6.23.1028
Within that folder will be the actual Impala drivers, named for JDBC versions 4.1 and 4.2. More info on these versions can be found here. You don't need to unzip these any further.
ClouderaImpalaJDBC41-2.6.23.1028.zip
ClouderaImpalaJDBC42-2.6.23.1028.zip
Copy the JDBC URL
Again, from the kebab icon in your virtual warehouse tile, copy the JDBC URL. This URL has all the necessary information to make the connection, and should be of the form:
jdbc:impala://coordinator-cnelson2-impala-vdw.dw-se-sandboxx-aws.a465-9q4k.cloudera.site:443/default;AuthMech=12;transportMode=http;httpPath=cliservice;ssl=1;auth=browser
Next, we will create a new connection within DBeaver.
Once the connection is created, you can navigate the database, tables, columns, etc, as well as query your data. Congratulations, you did it (and there was much rejoicing).
Tips
You might also be interested in my eerily similar article on connecting Dbeaver to a Hive Virtual Warehouse.