About Madhur

iamfromsky · ‎03-23-2022

oh, this is a long time ago issue, the root cause is because new machines charset is not utf-8, just keep all the machines chaset is utf-8 , then its ok.

Madhur · ‎04-25-2021

Hi @SimoneMasc , Thank you for reaching out to Cloudera Community! I would request you to please review the below documentation on basic requirements (OS/DB/Java/Network/Platform etc.) https://docs.cloudera.com/cdp/latest/release-guide/topics/cdpdc-requirements-supported-versions.html Also, https://docs.cloudera.com/cdp-private-cloud/latest/data-migration/topics/cdp-data-migration-machine-learning-to-cdp.html which describes about data migration.

CaptainJa · ‎12-04-2020

Hello @Madhur Thanks a lot for the reply. I can confirm that the operating system is rhel7. The base url used was a configuration setting passed down but we have used it for other clusters without issues. I will nonetheless check with the client to make sure it is correct. Concerning the link to the bug report, the upgrade was done for Ambari 2.6.2.2 while the mentioned bug was fixed in version 2.6.0.0. Also, the scenarios presented in the bug are a bit different in our case. Thanks a lot for the help

regeamor · ‎11-03-2020

Thanks, What I am experiencing is that the complete file, if 300GB, has to be assembled before upload to S3. This requires either 300GB of memory or disk. Distcp does not create a part file per block. I have not witnessed any file split being done. Multi part uploads require you get an upload ID and upload many part files with a numeric extension and in the end ask S3 to put them back together. I do not see any of this being done. I admit I do not know much about all this and it could be happening out of my sight.

matagyula · ‎09-30-2020

@rbloughThank you for the continued support. 2) The command is being run as the hdfs user. 1) The detailed output showed that there are 603,723 blocks in total. Looking at the HDFS UI, the Datanodes report having 586,426 blocks each. 3) hdfs fsck / -openforwrite says that there are 506,549 blocks in total. The discrepancy in block count seems to be there still. Below are the summaries of the different fsck outputs. hdfs fsck / -files -blocks -locations -includeSnapshots Status: HEALTHY Number of data-nodes: 3 Number of racks: 1 Total dirs: 64389 Total symlinks: 0 Replicated Blocks: Total size: 330079817503 B (Total open files size: 235302 B) Total files: 625308 (Files currently being written: 129) Total blocks (validated): 603723 (avg. block size 546740 B) (Total open file blocks (not validated): 122) Minimally replicated blocks: 603723 (100.0 %) Over-replicated blocks: 0 (0.0 %) Under-replicated blocks: 0 (0.0 %) Mis-replicated blocks: 0 (0.0 %) Default replication factor: 3 Average block replication: 3.0 Missing blocks: 0 Corrupt blocks: 0 Missing replicas: 0 (0.0 %) Blocks queued for replication: 0 Erasure Coded Block Groups: Total size: 0 B Total files: 0 Total block groups (validated): 0 Minimally erasure-coded block groups: 0 Over-erasure-coded block groups: 0 Under-erasure-coded block groups: 0 Unsatisfactory placement block groups: 0 Average block group size: 0.0 Missing block groups: 0 Corrupt block groups: 0 Missing internal blocks: 0 Blocks queued for replication: 0 FSCK ended at Wed Sep 30 12:23:06 CEST 2020 in 23305 milliseconds hdfs fsck / -openforwrite Status: HEALTHY Number of data-nodes: 3 Number of racks: 1 Total dirs: 63922 Total symlinks: 0 Replicated Blocks: Total size: 329765860325 B Total files: 528144 Total blocks (validated): 506549 (avg. block size 651004 B) Minimally replicated blocks: 506427 (99.975914 %) Over-replicated blocks: 0 (0.0 %) Under-replicated blocks: 0 (0.0 %) Mis-replicated blocks: 0 (0.0 %) Default replication factor: 3 Average block replication: 2.9992774 Missing blocks: 0 Corrupt blocks: 0 Missing replicas: 0 (0.0 %) Blocks queued for replication: 0 Erasure Coded Block Groups: Total size: 0 B Total files: 0 Total block groups (validated): 0 Minimally erasure-coded block groups: 0 Over-erasure-coded block groups: 0 Under-erasure-coded block groups: 0 Unsatisfactory placement block groups: 0 Average block group size: 0.0 Missing block groups: 0 Corrupt block groups: 0 Missing internal blocks: 0 Blocks queued for replication: 0 FSCK ended at Wed Sep 30 12:28:06 CEST 2020 in 11227 milliseconds

Madhur · ‎09-22-2020

Hello @Mondi , When you install CDP Trial Version, it includes an embedded PostgreSQL database and is not suitable for a production environment. Please check this information for more details. Also, find how to end the trial or upgrade trial version and Managing licenses

K_K · ‎08-24-2020

Hello Madhur, Thanks for your response. But, we can see the same issue in Ambari 2.7.4 which we have installed just last weeks to overcome this issue. Can you please help on this regards? Thanks, KK

Madhur · ‎08-19-2020

Hi @rohit19, As this is an older post you would have a better chance of receiving a resolution by starting a new thread. This will also provide the opportunity to provide details specific to your environment that could aid others in providing a more accurate answer to your question.

paras · ‎07-08-2020

@SeanU This level of detailed log scanning and alert functionality is not available. The existing service role logs for which rules can be set will not contain each application exceptions logged since detailed information is present in the application logs. You can check the available job history server logs and resource manager logs available to check if the logged in information during application run time helps serve your purpose.

Madhur · ‎07-07-2020

Hi @shrikant_bm , When ever Active NameNode server goes down, its associated daemon also goes down. HA works in the same way whenever Active NameNode daemon or server goes down. ZKFC will not receive the heartbeat and the ZooKeeper session will expire, notifying the other NameNode that a failover should be triggered. To answer your question: yes, in both the cases mentioned by you, HA should work.

Online	Offline
Last Visited	‎11-25-2025 07:18 AM

Member Since	‎04-14-2020 09:45 PM
Last Visited	‎11-25-2025 07:18 AM
Posts	4,035
Kudos received	3

Cloudera Community

Re: What are the limitations of a CDP 7.0.3 Trial ...

Re: HA Fails when Namenode server rebooted

Re: Bring down host for maintenance

Re: Spark jobs failing

Re: BlockSender.sendChuncks() error

Re: OS type For Apache Zeppelin

Re: HDP Local Repository update error - API Error ...

Re: Each map task of a distcp job to s3 is running...

Re: Cluster Block Count - which is the real number...

Re: What are the limitations of a CDP 7.0.3 Trial ...

Re: Unable to select NOSASL option from Ambari for...

Re: Impala-shell timeout via oozie workflow

Re: Monitoring Yarn application logs

Re: HA Fails when Namenode server rebooted