I am trying to install Cloudera live on AWS with Tableau. The stack creation is complete. I see 6 instances running on my account. I did not receive any email with instructuctions on how to access Cloudera. Can someone suggest how I can check if the installation is complete
Did you solve the issue Mark? If so, please share the solution in case it can assist others. :)
Thanks for the reply. Cloudera live install went through fine. I could connect to all three environments(Hue, manager, & navigator). I have tried to query through Hive and Impala and they work. Now, I am trying to use Sqoop to transfer data from mysql to Hadoop and I need help with manager node IP:Port, userid and pwd. I will be using putty to connect.
Thanks for your reply. I used the ip address(220.127.116.11) and userid (ec2-user) through Putty. I get this error message:
"Disconnected: No supported authentication methods available(server sent: public key, gssapi-keyex, gssapi-with-mic"
Can you help me with this issue?
Regarding Tableau, I can access the tool and can log into the system. Then, I select "Cloudera Hadoop" as the server and enter 18.104.22.168 for server with port(10000).
I select "HiveServer" for Type. I see Authentication greyed out. I cannot enter userid or pwd. I click OK and I get a window with error message
"An error occurred while communicating with the Cloudera Hadoop data source '22.214.171.124'
I would appreciate if you can let me know where I am making a mistake in the workflow.
Regarding PuTTY, have you read through EC2's documentation on connecting to Linux instances from Windows? http://docs.aws.amazon.com/AWSEC2/latest/UserGuide/ec2-connect-to-instance-linux.html#using-putty. It seems you need to go through a process of converting the .pem file (the key you selected when deploying the CloudFormation template) to a PuTTY-specific .ppk format, and then configure your connection to use that file for authentication.
As for the issue connecting from Tableau, I would recommend you try using the Private IP for the Manager Node to connect to Hive. If you're using the public IP, a bunch of firewall rules get applied, and they will block access to Hive since the service is not secured by default in Live clusters. However, from inside the cluster, all access to private IPs should be open. Also note that Hive Server 2 is running on the Manager Node: this is distinct from Impala (which the Tableau tutorial in Cloudera Live has you connect to), which is running on all of the Worker Nodes instead.
Hope that helps!
Thanks for your reply. I tried using Putty and I can connect now. I still need to run the script to move tables from mysql to HDFS.
With Tableau, I am stil getting the same error as shown below. I am using Cloudera Hadoop as the server. I am also using private ip (10.0.0.81) and left the port at 10000.
Please let me know if I need to make any other changes.
The drivers necessary to connect to this database are not properly installed.
To connect to this database, perform the following steps:
Detailed Error Message:
1. Please refer to https://www.cloudera.com/content/www/en-us/developers/get-started-with-hadoop-tutorial/exercise-1.ht... for ingets ( Mysql to HDFS ).
It is straignt forward only need to change the db name, user id, pasword, mysql driver location and you should be good.
2. For ODBC driver: Did you install from below link?