About rachid-berkane

rachid-berkane · ‎03-05-2025

Hello I had a similar problem in the past: i had snapshots policies that did hourly snapshots and retain the last hours + 3 days + 4 weeks . the disk space was released after i deleted snapshots as the deletion of files doesn't delete the snapshoted hdfs data's blocs i also emptied my HDFS Trash and checked that all users HDFS Trash DIRs where empty or small. maybe tha could help Regards

rachid-berkane · ‎09-17-2020

use hadoop as root password ( you may be asked to change it )

rachid-berkane · ‎07-22-2020

ambari files view (same PB for Hue File browser) is not the good tool if you want to upload (very) big files. it's running in JVMs, and uploading big files will use more memory (you will hit maximum availaible mem very quickly and cause perfs issues to other users while you are uploading ) BTW it's possible to add other ambari server views to increase perfs (it may be dedicated to some teams/projects ) for very big files prefer Cli tools : scp to EDGE NODE with a big FS + hdfs dfs -put. or distcp or use an object storage accessible from you hadoop cluster with a good network bandwidth

rachid-berkane · ‎05-21-2020

apt-get installation doesn't seem to install any bitcoin package same thing via python package manager (pip ...) it's probably a mistake in the dockerfile anyway the docker image is old, and the github repo doesn't seem to exist any more

rachid-berkane · ‎05-21-2020

cloudera CDP is based on a Cloudera Runtime Version + a Cloudera Manager version that is compatible (with that cloudera runtime) https://docs.cloudera.com/cdpdc/7.0/release-guide/topics/cdpdc-release-notes-links.html at time of writing: CDP DC 1.0 uses Cloudera runtime 7.0.3 and cloudera manager 7.0.3 the Cloudera Runtime Component Version is aimed to keep a set of consistent hadoop components versions that can work together. it will also make more easy to migrate from CDH/HDP if services/components versions are the same or close to the runtime components versions of CDP if i'm not wrong, there is actually only one CDP DC (1.0) version with minor updates of CM and cloudera components runtime versions https://docs.cloudera.com/cloudera-manager/7.0.3/release-notes/topics/cm-release-notes.html https://docs.cloudera.com/runtime/7.0.3/release-notes/topics/rt-runtime-component-versions.html

abhinav_joshi · ‎05-20-2020

Thanks , enabling the GIGC has helped along with reducing the Heap JVM settings in the bootstrap conf file. Thanks again

Madhur · ‎05-14-2020

Hello @rvillanueva , You can check how many threads are used by a user by running ps -L -u <username> | wc -l if the user’s open files limit ( ulimit -n <user name >) is hit then the user can’t spawn any further more threads. Most possible reasons in this case could be, Same user running other jobs and having open files on the node where it tries to launch/spawn the container. systems thread might have excluded. see which application is running and what is their current open files Kindly check application log (application_XXX),if available and see which phase it throw's the exception and on which node the issue is faced.

stevenmatison · ‎05-13-2020

@satishjan1 The initial question is asking about setting the hostname. The information you reference is telling you to do that, but for a different operating system. My first response was telling you how to do it for RHEL. For your next question, you do not have to set the hostname in /etc/sysconfig/network, you have to do it the way required for your operating system. See Above. The hostname must be set, and persist after reboot. If you do not set the hostname before installing the cluster you will have unmentionable problems with services and components later on down the road.

Gophalr · ‎05-12-2020

Hi, I'm able to solve the issue by running the ambari-server setup command again and select the 4 option instead of embedded DB. That solved the issue. Now I'm able to start the service without any issue. Thanks, GophalRaj

rachid-berkane · ‎05-05-2020

@Mondi just update dfs.cluster.administrators with the admin username you want in hdfs config (and restart hdfs,yarn,MR2,... services) ex: dfs.cluster.administrators = hdfs,ops you can also use a HDFS administrators group (only one administrator group) using dfs.permissions.superusergroup ex : dfs.permissions.superusergroup = operations to verify config has been updated, once services restarted hdfs getconf -confKey dfs.cluster.administrators or hdfs getconf -confKey dfs.permissions.superusergroup

Online	Offline
Last Visited	‎03-05-2025 09:25 AM

Member Since	‎10-25-2019 09:18 AM
Last Visited	‎03-05-2025 09:25 AM
Posts	16
Kudos received	8

Cloudera Community

Re: Does Impala or the framework have a bitcoin de...

Re: CDP releases

Re: Nifi cluster CPU Utilization / RAM always pea...

Re: How to can access the HDFS using my local linu...

Re: In HDFS 3.3.5 storage utilisation is marked cl...

Re: HDP 3.0.1 Sandbox has all services showing RED

Re: Unable to upload file into HDFS

Re: Does Impala or the framework have a bitcoin de...

Re: CDP releases

Re: Nifi cluster CPU Utilization / RAM always pea...

Re: Previously working spark jobs only now throwin...

Re: Security Recommendation Ambari 2.7.5 and RHEL ...

Re: Postgresql upgrade from 9.2 to 9.6 on Ambari 2...

Re: How to can access the HDFS using my local linu...