Member since
02-02-2021
116
Posts
2
Kudos Received
5
Solutions
My Accepted Solutions
Title | Views | Posted |
---|---|---|
745 | 08-13-2021 09:44 AM | |
3692 | 04-27-2021 04:23 PM | |
1369 | 04-26-2021 10:47 AM | |
923 | 03-29-2021 06:01 PM | |
2746 | 03-17-2021 04:53 PM |
05-09-2023
04:23 PM
Hi Experts, So i am somewhat new to CDP or using cloudera's platform, and the employer i am with is currently using CDP. So when I am asked like in an interview or something, they ask what CDH version do you use, and when i say CDH 7.2.16, the people interviewing me asks "are you sure?" So is there a CDP version of CDH vs maybe on prem version of CDH? Is there some difference in versioning between on prem or CDP etc? Just curious so i can better answer this in the future and get a better understanding. Thanks,
... View more
Labels:
- Labels:
-
Training
10-21-2022
04:20 PM
Hi Experts, I am trying to use the nifi processor getHDFS from my CDP cluster in Azure and then use a PutFile processor to download that file to my local filesystem. My Nifi is a standalone server, separate from my CDP cluster. Currently I am seeing this error in the nifi-app.log 2022-10-21 18:18:29,631 ERROR [Timer-Driven Process Thread-5] o.apache.nifi.processors.hadoop.GetHDFS GetHDFS[id=fab61e35-0183-1000-2eb0-4d511c15db51] Failed to properly initialize Processor. If still scheduled to run, NiFi will attempt to initialize and run the Processor again after the 'Administrative Yield Duration' has elapsed. Failure is due to Unable to load custom token provider class. org.apache.hadoop.fs.azurebfs.contracts.exceptions.TokenAccessProviderException: Unable to load custom token provider class. Any help is much appreciated. Thanks,
... View more
Labels:
06-23-2022
12:54 PM
Hi experts, I was wondering what is the best way to troubleshoot an application or job that is taking longer than usual. Maybe a 5minute job that is taking 1 hour or longer to complete etc. What are some things I should start looking at first? Or can someone bring me through the process? Thanks,
... View more
Labels:
- Labels:
-
Apache YARN
06-23-2022
12:53 PM
Hi experts, I was wondering if it is possible to see based on the application logs alone that it is trying to find available resources from the cluster, assuming that there are currently other running jobs that are utilizing the cluster resources? If so, how would it look like in the application logs or what do I look for? Thanks,
... View more
Labels:
- Labels:
-
Apache YARN
01-12-2022
01:45 PM
Hi experts, Our hadoop cluster has an old version of log4j and we were wondering how to properly upgrade log4j? Can we just replace the log4j jar file with an upgraded version? Currently this is one of the log4j files in our hadoop cluster. /usr/hdp/2.6.1.0-129/hadoop/client/log4j-1.2.17.jar Any help is much appreciated. Thanks,
... View more
Labels:
- Labels:
-
Apache Hadoop
12-16-2021
12:40 PM
Hi @willx , Is there a way to see if the hadoop path is a volume or a directory?
... View more
12-15-2021
04:47 PM
Hi experts, Can someone please explain the difference between volumes and folders in hadoop? Thanks,
... View more
Labels:
- Labels:
-
Apache Hadoop
11-18-2021
08:28 AM
Is there any way to have ambari skip the "distro-select" yum installation? Thanks,
... View more
11-17-2021
03:01 PM
Hi experts, I am having issues installing hadoop 3.x with ambari. Currently Ambari is displaying the error below: Traceback (most recent call last):
File "/var/lib/ambari-agent/cache/stack-hooks/before-INSTALL/scripts/hook.py", line 37, in <module>
BeforeInstallHook().execute()
File "/usr/lib/ambari-agent/lib/resource_management/libraries/script/script.py", line 352, in execute
method(env)
File "/var/lib/ambari-agent/cache/stack-hooks/before-INSTALL/scripts/hook.py", line 33, in hook
install_packages()
File "/var/lib/ambari-agent/cache/stack-hooks/before-INSTALL/scripts/shared_initialization.py", line 37, in install_packages
retry_count=params.agent_stack_retry_count)
File "/usr/lib/ambari-agent/lib/resource_management/core/base.py", line 166, in __init__
self.env.run()
File "/usr/lib/ambari-agent/lib/resource_management/core/environment.py", line 160, in run
self.run_action(resource, action)
File "/usr/lib/ambari-agent/lib/resource_management/core/environment.py", line 124, in run_action
provider_action()
File "/usr/lib/ambari-agent/lib/resource_management/core/providers/packaging.py", line 30, in action_install
self._pkg_manager.install_package(package_name, self.__create_context())
File "/usr/lib/ambari-agent/lib/ambari_commons/repo_manager/yum_manager.py", line 219, in install_package
shell.repository_manager_executor(cmd, self.properties, context)
File "/usr/lib/ambari-agent/lib/ambari_commons/shell.py", line 753, in repository_manager_executor
raise RuntimeError(message)
RuntimeError: Failed to execute command '/usr/bin/yum -y install distro-select', exited with code '1', message: 'Error: Nothing to do
' Any help is much appreciated. Thanks,
... View more
Labels:
- Labels:
-
Apache Ambari
-
Apache Hadoop
10-16-2021
09:00 AM
1 Kudo
@Faizan_Ali Thanks for the explanation. Makes sense. So while an application is running, it logs the container logs into a local directory "$ {yarn.nodemanager.log-dirs}/application_${appid}" then after the application is completed, it aggregates the logs into yarn.nodemanager.remote-app-log-dir. Ok thanks for the explanation.
... View more