Member since
03-23-2015
1288
Posts
114
Kudos Received
98
Solutions
My Accepted Solutions
Title | Views | Posted |
---|---|---|
3349 | 06-11-2020 02:45 PM | |
5062 | 05-01-2020 12:23 AM | |
2859 | 04-21-2020 03:38 PM | |
3562 | 04-14-2020 12:26 AM | |
2360 | 02-27-2020 05:51 PM |
11-14-2019
08:17 PM
@deekshant To debug Namenode issue you need to check below - 1. Check active namenode[NN] logs [for time when it got reboot] 2. Check active NN zkfc logs [same time - if you see any issue] 3. Check for standby NN logs at same time if you see any error 4. Check for standby NN zkfc logs for any error at same timestamp 5. Check for Active NN .out file for any warnings/error 6. Check for system logs "/var/log/message" for any issue at particular moment of time. You will find error in one of the above file. accordingly you can go for RCA. Do revert if you need further help.
... View more
11-14-2019
04:10 PM
@Timothyw0 , No, you can't update or delete in DF. You have to use filter/transform DF and create a new DF. Cheers Eric
... View more
11-12-2019
10:48 PM
1 Kudo
Hi @ckk, I believe that we started supporting Sqoop import into S3A since CDH6.1 https://docs.cloudera.com/documentation/enterprise/6/6.1/topics/admin_sqoop_s3_import.html and you need to use "--target-dir". Can you share the full Sqoop command you used when using "--target-dir"? Can you also attach "--verbose" output file for review? Thanks Eric
... View more
11-07-2019
08:43 AM
Exactly I wanted! Thank you @EricL
... View more
11-05-2019
08:21 PM
Hi @wret_1311, Thanks for your response and I appreciate for confirming the solution. I'm glad, it helped you 🙂
... View more
10-31-2019
09:25 PM
Hi @DataMike, Yes, once CM is down, you can't update the configuration unless you "hack" into CM's backend database, which is also down in our scenario. So if you setup MySQL in such a way that CM will connect to LB / VIP host, which will route to the MySQL host that you want to, so in the event of one MySQL instance is down, simply update LB to only go to the working one. This avoids the needs to update the CM config. Hope that can help. Cheers Eric
... View more
10-24-2019
12:08 AM
thanks, i suppose one could just move that to wherever you want the home to be. this was part of a hack deployment, so ultimately not needed.
... View more
10-14-2019
04:46 PM
@Plop564 I am not an expert in Spark, but my understand is below: 1. I will have 100 output files >>> this depends how many partitions you have in your original DF. "coalesce" can only reduce number of partitions, so if you have less than 100 partitions before, then it won't do anything, as "coalesce" does not do shuffling. If you want to guarantee number of output files, I believe "repartition" function is better. 2. Each single CSV file is locally sorted, I mean by the "date" column ascending >>> Yes 3. Files are globally sorted, I mean CSV part-0000 have "date" inferior to CSV part-0001, CSV part-0001 have "date" inferior to CSV part-0002 and so on .. >>> I believe it is also Yes, but will wait for other Spark experts to confirm. Cheers Eric
... View more