Is there a recommended way to connect a Node.js application to Impala? I cannot find and drivers in npmjs for it.
We are now in 2015, is there any update from Cloudera about this topic ?
Maybe a nodejs module is on its way ?
I'm also looking for the same. If anyone found a solution, please provide a sample application to read data from impala in nodejs app.
I've never used node.js before but there seem to be two options:
1) As Harsh suggested, thrift can be used to generate an impala client. There is a generic thrift example is at https://thrift.apache.org/tutorial/nodejs . The best impala specific example I could find is a python test case but it should be very similar to what would be done in js https://github.com/cloudera/Impala/blob/cdh5-2.3.0_5.5.0/tests/hs2/test_fetch.py#L127 .
2) An easier option may be to use https://www.npmjs.com/package/jdbc + http://www.cloudera.com/content/www/en-us/downloads/connectors/impala/jdbc/2-5-29.html .
I've created a node module (Impala Client) but haven't published yet. I will make some changes then publish it soon. If you need it as module immediately mail me but if you only need the code that gathers data from HDFS using Impala just mimic the code in the test folder. The code in the test.js file is enough, I have tested it and it's working fine.