About quangbilly79

quangbilly79 · ‎05-17-2023

Where do you get/access to this kind of UI? I'm stuck with the user-group thingy when working with Sentry too.

quangbilly79 · ‎05-17-2023

Turn out there will be two icons if you need to "redeploy client conf" If the blue icon below appears, means you have to tick the "redeploy client conf" button to restart the whole cluster If only this orange icon appears, mean you don't need to do that

quangbilly79 · ‎05-17-2023

I encountered the same error. After a few days of trying I decided to give up. Just go to Hbase config on Cloudera Manager UI and turn "hbase.security.authentication" to "simple" and "hbase.thrift.security.qop" to "none"

steven-matison · ‎05-15-2023

@quangbilly79 NO, you should not be adding gateway to every node. This gateway should only be installed on the edge/utility nodes, where you give access to external systems and users. These gateway nodes then are able to reach rest of the service(s) nodes.

quangbilly79 · ‎05-14-2023

I successfully installed it on 3 nodes. Normally you only need to install everything on 1 node (things like java, python you have to install on 3 nodes first of course). When go to the CM UI website, you can add another node and Cloudera will automatically install everything for you. In case you want to install things manually. Install all 3 packages "cloudera-manager-daemons", "cloudera-manager-agent", "cloudera-manager-server" on your main node, and for other nodes only install "cloudera-manager-daemons" "cloudera-manager-agent" and start these agent services. After that, you will see that two nodes are "managed" on the CM UI, meaning that you can skip the "Install Agent" step (since you've already installed "cloudera-manager-agent" and start it)

quangbilly79 · ‎04-27-2023

Thank you so much, this work for me!

RangaReddy · ‎03-30-2023

Hi @quangbilly79 Cloudera will support YARN and Kubernets deployment mode and it will not support Standalone mode (In standalone mode you can access the Spark Master using 7077 port). In order to check which node driver is launched and which node is executor is launched you need to go to Spark UI or Spark History Server UI of that application. From there go to Executors tab. You can see list of executors. In the second table you find executor id. Where the executor id is 'driver' that is the one Driver Node and remaining all are executors.

smdas · ‎02-28-2023

Hello @quangbilly79 Thanks for using Cloudera Community. The "Spark Master" refers to the Resource Manager responsible for allocating resources. Since you are using YARN, Your Team needs to use "--master yarn". The usage of "--master spark://<IP Address>:7077" is for Spark Standalone Cluster, which isn't the Case for your team. To your Observation concerning the "Driver Instance" & "Worker Instance" being added via "Add Role Instance", there is no such Option as YARN is the Resource Manager, which shall allocate the resources for Spark Driver & Executors. Review [1] for the usage of "--master" as well. Hope the above answers your Team's queries. Regards, Smarak [1] https://spark.apache.org/docs/latest/submitting-applications.html#launching-applications-with-spark-submit

smdas · ‎01-02-2023

Hello @quangbilly79 Thanks for using Cloudera Community. Based on your Post, you may consider "Kafka Gateway" as the Client for Kafka, which are setup on the Hosts wherein the same is added as per Cloudera Manager "Assign Roles". A Client/Gateway is familiar with the Service (Kafka in this Case) & all Client/Service Configs are available for the Client/Gateway without any manual intervention. Any changes made to the Service or Client Configs is pushed to the Service/Client Configuration by Cloudera Manager. Imagine a Scenario wherein you wish to run "hdfs dfs -ls" on a HDFS FileSystem. Simply running the Command won't work unless the Host wherein the Command "hdfs dfs -ls" is being run knows the Setup (HDFS FileSystem, NameNode, Port, Protocol). Review [1] for an Example. Adding an HDFS Gateway ensures User doesn't need to manually configure a Client/Gateway with Cloudera Manager doing the needful. Similarly, Kafka Gateway operates. Else, Customer need to manually configure the Client/Gateway Setup. Hope the above answer your query concerning the Gateway Role. Regards, Smarak [1] https://www.ibm.com/docs/en/spectrum-scale-bda?topic=hdfs-clients-configuration

Kartik_Agarwal · ‎12-23-2022

@quangbilly79 It should be /opt/cloudera/parcels/SPARK2/lib/

Online	Offline
Last Visited	‎09-23-2025 07:10 PM

Member Since	‎12-06-2022 05:51 PM
Last Visited	‎09-23-2025 07:10 PM
Posts	31
Kudos received	1

Cloudera Community

Re: Is there any chance to use Spark 3 on CDH 6.x ...

Re: SWICHDATABASE privilege missing error in HUE. ...

Re: Should I tick on the "Redeploy Client Configur...

Re: Api Error: Unable to authenticate in Hue while...

Re: Is there any problem if I just add Gateway rol...

Re: Do I need to install Cloudera Manager (CDH) on...

Re: spark-shell command not finding correct Java J...

Re: How to know which Node is Driver Node, which N...

Re: Spark "Master Node" and "Worker Node" in Cloud...

Re: What is Kafka Gateway and Kafka MirrorMaker wh...

Re: Where is the Jar folder for Spark in Cloudera?