Created 10-16-2015 12:22 PM
I am trying to install Cloudera live on AWS with Tableau. The stack creation is complete. I see 6 instances running on my account. I did not receive any email with instructuctions on how to access Cloudera. Can someone suggest how I can check if the installation is complete
Mark
Created 10-27-2015 03:27 PM
Created 10-30-2015 08:46 AM
Created 10-20-2015 01:34 PM
Hi Sean:
With Putty:
I am able to connect through Putty with ec2-user as userid. I ran the script and I get an error:
-bash: import-all-tables: command not found
Please explain why I need to change userid,pwd,db_name and mysql driver location. I am using cloudera live on AWS and I want to use the existing databases.
With Tableau:
I am able to connect using Impala but not through Hive. With Impala, when I connect, I don't see any schema.
I am using Hive Server 2 in ODBC configuration. It doesn't connect to the server. Please tell me the userid/pwd to connect through Hive Server 2.
Mark
Created 10-20-2015 01:43 PM
You must have 'sqoop' before 'import-all-tables'. The full command in the tutorial is as follows:
sqoop import-all-tables \ -m 3 \ --connect jdbc:mysql://cloudera1:3306/retail_db \ --username=retail_dba \ --password=cloudera \ --compression-codec=snappy \ --as-parquetfile \ --warehouse-dir=/user/hive/warehouse \ --hive-import
>> Please explain why I need to change userid,pwd,db_name and mysql driver location. I am using cloudera live on AWS and I want to use the existing databases.
I'm not sure what steps in the tutorial you're referring to here.
>> With Impala, when I connect, I don't see any schema.
The Sqoop command will import some data. If you haven't already imported data, you should not see any schema in Impala.
>> Please tell me the userid/pwd to connect through Hive Server 2.
There isn't a password set up for Hive Server 2. You may find this thread helpful: http://community.cloudera.com/t5/Cloudera-Live-End-to-end/Cannot-connect-to-Hive-thru-JDBC-Connectio...
Created 10-20-2015 02:15 PM
Thanks for the reply.
I tried with Sqoop in front and I get the error:
-bash: sqoop: command not found
It looks like I am not in the right environment or the master node I am connecting to doesn't have sqoop installed.
Please check and let me know.
Mark
Created 10-20-2015 02:19 PM
Created 10-20-2015 03:01 PM
Hi Sean,
I appreciate your feedback. With regards to Putty, I am not good at Unix. All I did was to connect to
ec2-52-91-172-186.compute-1.amazonaws.com using mykeypair.ppk. I used ec2-user as the userid. when I used /opt/cloudera/parcels/CDH/bin/sqoop at the prompt, I get an error - no such file or directory. I am not good at Unix and I need your help with commands if you want me to check something.
With regards to Tableau, I tried to create a odbc connection using Cloudera odbc driver for Hive. I did not use JDBC driver. Can you confirm if I should use ODBC or JDBC for connecting to manager node?
Mark
Created 10-20-2015 03:23 PM
Created 10-21-2015 05:13 AM
Hi Sean,
With regards to Sqoop:
I added the new service Sqoop Client 1 and it seems to be running. I went back to putty and ran the script again. I still get the same error.
-bash: Sqoop: Command not found
Is there any other way I can test if Sqoop is running?
Thanks for your help.
Mark
Created 10-21-2015 11:03 AM
Hi Sean,
I think I finally found out where the problem was. I was not connecting to the Manager node in Putty. I just did that and the script is running. I will let you know once it finishes. I am hoping new tables will be created and I can query through hive or impala. Thanks for your help.
I still need to work through Tableau. Please let me know if you find the solution on the right driver/connectivity parameters I should use.
Mark
Created 10-21-2015 01:05 PM
Hi Sean
I could get everything on the server to work and I finished all tutorial exercises.
The only outstanding issue is connectivity from Tableau. I am connecting from a windows machine using remote desktop. I am not sure why I even need odbc driver on my machine. It looks like there is something with connectivity parameters that is not correct. I would appreciate your help.
Mark
Created 10-21-2015 01:13 PM
So the Impala ODBC driver is installed on the Windows server that hosts Tableau Desktop. The Hive ODBC driver is separate. You can download a Windows installer for it here: http://www.cloudera.com/content/www/en-us/downloads/connectors/hive/odbc/2-5-16.html.html.
You can also find out more about Tableau and ODBC drivers here: http://kb.tableau.com/articles/knowledgebase/hadoop-hive-connection