Support Questions

Find answers, ask questions, and share your expertise

Problem with Cloudera agent installation on CDP Private Cloud

avatar
New Contributor

Hi, 

 

we have a problem with agent installation on hosts which we want to add to the cluster.

On the Install Agents page, the process fails with an error "Failed to copy installation files.

 

Screenshot 2022-03-21 at 11.50.32.png Examining cloudera-scm-server.log we got this (The process fails when trying to copy files).

 

2022-03-21 10:55:01,075 INFO NodeConfiguratorThread-7-0:com.cloudera.server.cmf.node.NodeConfiguratorProgress: Transitioning from MAKE_TEMP_DIR (PT0.197S) to COPY_FILES

2022-03-21 10:55:01,081 INFO NodeConfiguratorThread-7-0:com.cloudera.server.cmf.node.NodeConfigurator: Using default key bundle URL

2022-03-21 10:55:01,111 INFO NodeConfiguratorThread-7-0:com.cloudera.server.cmf.node.NodeConfiguratorProgress: Setting COPY_FILES as failed and done state

2022-03-21 10:55:01,111 INFO NodeConfiguratorThread-7-0:net.schmizz.sshj.transport.TransportImpl: Disconnected - BY_APPLICATION

2022-03-21 10:55:01,111 INFO NodeConfiguratorThread-7-0:com.cloudera.cmf.model.HostInstallArgs: Deprecated option for unlimited strength JCE. Value set to False.

2022-03-21 10:55:05,863 INFO CommandPusher-1:com.cloudera.server.cmf.CommandPusherThread: Acquired lease lock on DbCommand:128

2022-03-21 10:55:05,866 ERROR CommandPusher-1:com.cloudera.cmf.command.flow.WorkOutputs: CMD id: 128 Failed to complete installation on host.

 

On the node where the agent should be installed (auth.log). The ssh connection opens and authenticates, but it disconnects a second later.

 

Mar 21 10:55:00 ubuntu-worker1 sshd[1782]: Accepted password for x from 10.56.23.11 port 36258 ssh2

Mar 21 10:55:00 ubuntu-worker1 sshd[1782]: pam_unix(sshd:session): session opened for user x by (uid=0)

Mar 21 10:55:00 ubuntu-worker1 systemd-logind[784]: New session 9 of user x.

Mar 21 10:55:01 ubuntu-worker1 sshd[1865]: Received disconnect from 10.56.23.11 port 36258:11: 

Mar 21 10:55:01 ubuntu-worker1 sshd[1865]: Disconnected from user troia 10.56.23.11 port 36258

Mar 21 10:55:01 ubuntu-worker1 sshd[1782]: pam_unix(sshd:session): session closed for user troia

Mar 21 10:55:01 ubuntu-worker1 systemd-logind[784]: Session 9 logged out. Waiting for processes to exit.

 

What could be the issue here? We are using ubuntu 20.04.4 servers, cloudera manager 7.4.4, and CDP private cloud base 7.1.7.

 

On the same configuration the agent installation was successful with Cloudera manager 7.5.4 and CPD private cloud base 7.1.7, but it failed when installing parcels. We have later found out that for ubuntu 20.04 those versions are not compatible, so we have rebuilt the cluster.

 

Thank you for your reply.

 

1 ACCEPTED SOLUTION

avatar
New Contributor

HI,

 

thank you for your reply. I have found the solution.

 

The problem was related to one of the known issues of Cloudera manager 7.4.4:

 

OPSAPS-61523 

Failure when installing Cloudera Manager AgentsWhen installing the Cloudera Manager Agent package on new hosts through either the Add Hosts wizard or Add Cluster wizard, if you select Cloudera Repository during the Select Repository step, the Agent Installation fails with message "Failed to Copy Installation Files".Select Custom Repository instead and enter the URL and your license credentials in the following format:

 

https://[username]:[password]@archive.cloudera.com/p/cm7/7.4.4 

 

 

 

View solution in original post

2 REPLIES 2

avatar
Super Guru

@asomrak ,

 

Please have a look at the individual nodes and look for a directory called /tmp/scm_prepare_node.*

Inside that directory there should be a log file called scm_prepare_node.log, which could tell you more about the error.

 

If you can't figure out the root cause from that log's content, please share the file here so that we can take a look.

 

Cheers,

André

--
Was your question answered? Please take some time to click on "Accept as Solution" below this post.
If you find a reply useful, say thanks by clicking on the thumbs up button.

--
Was your question answered? Please take some time to click on "Accept as Solution" below this post.
If you find a reply useful, say thanks by clicking on the thumbs up button.

avatar
New Contributor

HI,

 

thank you for your reply. I have found the solution.

 

The problem was related to one of the known issues of Cloudera manager 7.4.4:

 

OPSAPS-61523 

Failure when installing Cloudera Manager AgentsWhen installing the Cloudera Manager Agent package on new hosts through either the Add Hosts wizard or Add Cluster wizard, if you select Cloudera Repository during the Select Repository step, the Agent Installation fails with message "Failed to Copy Installation Files".Select Custom Repository instead and enter the URL and your license credentials in the following format:

 

https://[username]:[password]@archive.cloudera.com/p/cm7/7.4.4