Member since
12-06-2022
29
Posts
2
Kudos Received
1
Solution
My Accepted Solutions
Title | Views | Posted |
---|---|---|
1382 | 06-08-2023 11:41 PM |
05-17-2023
03:39 AM
I encountered the same error. After a few days of trying I decided to give up. Just go to Hbase config on Cloudera Manager UI and turn "hbase.security.authentication" to "simple" and "hbase.thrift.security.qop" to "none"
... View more
05-15-2023
05:04 AM
1 Kudo
@quangbilly79 NO, you should not be adding gateway to every node. This gateway should only be installed on the edge/utility nodes, where you give access to external systems and users. These gateway nodes then are able to reach rest of the service(s) nodes.
... View more
05-14-2023
07:53 PM
I successfully installed it on 3 nodes. Normally you only need to install everything on 1 node (things like java, python you have to install on 3 nodes first of course). When go to the CM UI website, you can add another node and Cloudera will automatically install everything for you. In case you want to install things manually. Install all 3 packages "cloudera-manager-daemons", "cloudera-manager-agent", "cloudera-manager-server" on your main node, and for other nodes only install "cloudera-manager-daemons" "cloudera-manager-agent" and start these agent services. After that, you will see that two nodes are "managed" on the CM UI, meaning that you can skip the "Install Agent" step (since you've already installed "cloudera-manager-agent" and start it)
... View more
04-27-2023
08:16 PM
Thank you so much, this work for me!
... View more
03-30-2023
04:31 AM
Hi @quangbilly79 Cloudera will support YARN and Kubernets deployment mode and it will not support Standalone mode (In standalone mode you can access the Spark Master using 7077 port). In order to check which node driver is launched and which node is executor is launched you need to go to Spark UI or Spark History Server UI of that application. From there go to Executors tab. You can see list of executors. In the second table you find executor id. Where the executor id is 'driver' that is the one Driver Node and remaining all are executors.
... View more
02-28-2023
09:04 PM
Hello @quangbilly79 Thanks for using Cloudera Community. The "Spark Master" refers to the Resource Manager responsible for allocating resources. Since you are using YARN, Your Team needs to use "--master yarn". The usage of "--master spark://<IP Address>:7077" is for Spark Standalone Cluster, which isn't the Case for your team. To your Observation concerning the "Driver Instance" & "Worker Instance" being added via "Add Role Instance", there is no such Option as YARN is the Resource Manager, which shall allocate the resources for Spark Driver & Executors. Review [1] for the usage of "--master" as well. Hope the above answers your Team's queries. Regards, Smarak [1] https://spark.apache.org/docs/latest/submitting-applications.html#launching-applications-with-spark-submit
... View more
01-02-2023
10:03 PM
Hello @quangbilly79 Thanks for using Cloudera Community. Based on your Post, you may consider "Kafka Gateway" as the Client for Kafka, which are setup on the Hosts wherein the same is added as per Cloudera Manager "Assign Roles". A Client/Gateway is familiar with the Service (Kafka in this Case) & all Client/Service Configs are available for the Client/Gateway without any manual intervention. Any changes made to the Service or Client Configs is pushed to the Service/Client Configuration by Cloudera Manager. Imagine a Scenario wherein you wish to run "hdfs dfs -ls" on a HDFS FileSystem. Simply running the Command won't work unless the Host wherein the Command "hdfs dfs -ls" is being run knows the Setup (HDFS FileSystem, NameNode, Port, Protocol). Review [1] for an Example. Adding an HDFS Gateway ensures User doesn't need to manually configure a Client/Gateway with Cloudera Manager doing the needful. Similarly, Kafka Gateway operates. Else, Customer need to manually configure the Client/Gateway Setup. Hope the above answer your query concerning the Gateway Role. Regards, Smarak [1] https://www.ibm.com/docs/en/spectrum-scale-bda?topic=hdfs-clients-configuration
... View more
12-28-2022
07:39 PM
Hi. Our company has already had Kafka and Zookeeper Instances on Cloudera. But it lacks of some useful functions that Confluent has for handling streaming data (mostly from Kafka Connect) so we want to use Confluent with Kafka instance from Cloudera. But I don’t know where to start. I read some guides on the Confluent homepage (link) but it’s for local installation with its own Zookeeper and Kafka. So I wonder is there any way to “integrate” Confluent with our Kafka/Zookeeper from Cloudera? Does anyone has already done this before and shown me how to do it? We're using Cloudera 6.2.0, I think it comes with Apache Kafka 2.1.0
... View more
Labels:
- Labels:
-
Apache Kafka
12-23-2022
09:19 PM
@quangbilly79 It should be /opt/cloudera/parcels/SPARK2/lib/
... View more
12-18-2022
10:41 PM
I encountered the same problem, any solutions?
... View more
- « Previous
-
- 1
- 2
- Next »