Created 03-21-2022 04:12 AM
Hi,
we have a problem with agent installation on hosts which we want to add to the cluster.
On the Install Agents page, the process fails with an error "Failed to copy installation files.
Examining cloudera-scm-server.log we got this (The process fails when trying to copy files).
2022-03-21 10:55:01,075 INFO NodeConfiguratorThread-7-0:com.cloudera.server.cmf.node.NodeConfiguratorProgress: Transitioning from MAKE_TEMP_DIR (PT0.197S) to COPY_FILES
2022-03-21 10:55:01,081 INFO NodeConfiguratorThread-7-0:com.cloudera.server.cmf.node.NodeConfigurator: Using default key bundle URL
2022-03-21 10:55:01,111 INFO NodeConfiguratorThread-7-0:com.cloudera.server.cmf.node.NodeConfiguratorProgress: Setting COPY_FILES as failed and done state
2022-03-21 10:55:01,111 INFO NodeConfiguratorThread-7-0:net.schmizz.sshj.transport.TransportImpl: Disconnected - BY_APPLICATION
2022-03-21 10:55:01,111 INFO NodeConfiguratorThread-7-0:com.cloudera.cmf.model.HostInstallArgs: Deprecated option for unlimited strength JCE. Value set to False.
2022-03-21 10:55:05,863 INFO CommandPusher-1:com.cloudera.server.cmf.CommandPusherThread: Acquired lease lock on DbCommand:128
2022-03-21 10:55:05,866 ERROR CommandPusher-1:com.cloudera.cmf.command.flow.WorkOutputs: CMD id: 128 Failed to complete installation on host.
On the node where the agent should be installed (auth.log). The ssh connection opens and authenticates, but it disconnects a second later.
Mar 21 10:55:00 ubuntu-worker1 sshd[1782]: Accepted password for x from 10.56.23.11 port 36258 ssh2
Mar 21 10:55:00 ubuntu-worker1 sshd[1782]: pam_unix(sshd:session): session opened for user x by (uid=0)
Mar 21 10:55:00 ubuntu-worker1 systemd-logind[784]: New session 9 of user x.
Mar 21 10:55:01 ubuntu-worker1 sshd[1865]: Received disconnect from 10.56.23.11 port 36258:11:
Mar 21 10:55:01 ubuntu-worker1 sshd[1865]: Disconnected from user troia 10.56.23.11 port 36258
Mar 21 10:55:01 ubuntu-worker1 sshd[1782]: pam_unix(sshd:session): session closed for user troia
Mar 21 10:55:01 ubuntu-worker1 systemd-logind[784]: Session 9 logged out. Waiting for processes to exit.
What could be the issue here? We are using ubuntu 20.04.4 servers, cloudera manager 7.4.4, and CDP private cloud base 7.1.7.
On the same configuration the agent installation was successful with Cloudera manager 7.5.4 and CPD private cloud base 7.1.7, but it failed when installing parcels. We have later found out that for ubuntu 20.04 those versions are not compatible, so we have rebuilt the cluster.
Thank you for your reply.
Created 03-23-2022 04:39 AM
HI,
thank you for your reply. I have found the solution.
The problem was related to one of the known issues of Cloudera manager 7.4.4:
OPSAPS-61523
Failure when installing Cloudera Manager AgentsWhen installing the Cloudera Manager Agent package on new hosts through either the Add Hosts wizard or Add Cluster wizard, if you select Cloudera Repository during the Select Repository step, the Agent Installation fails with message "Failed to Copy Installation Files".Select Custom Repository instead and enter the URL and your license credentials in the following format:
https://[username]:[password]@archive.cloudera.com/p/cm7/7.4.4
Created 03-21-2022 03:19 PM
@asomrak ,
Please have a look at the individual nodes and look for a directory called /tmp/scm_prepare_node.*
Inside that directory there should be a log file called scm_prepare_node.log, which could tell you more about the error.
If you can't figure out the root cause from that log's content, please share the file here so that we can take a look.
Cheers,
André
--
Was your question answered? Please take some time to click on "Accept as Solution" below this post.
If you find a reply useful, say thanks by clicking on the thumbs up button.
Created 03-23-2022 04:39 AM
HI,
thank you for your reply. I have found the solution.
The problem was related to one of the known issues of Cloudera manager 7.4.4:
OPSAPS-61523
Failure when installing Cloudera Manager AgentsWhen installing the Cloudera Manager Agent package on new hosts through either the Add Hosts wizard or Add Cluster wizard, if you select Cloudera Repository during the Select Repository step, the Agent Installation fails with message "Failed to Copy Installation Files".Select Custom Repository instead and enter the URL and your license credentials in the following format:
https://[username]:[password]@archive.cloudera.com/p/cm7/7.4.4