Member since
01-16-2018
540
Posts
33
Kudos Received
82
Solutions
My Accepted Solutions
Title | Views | Posted |
---|---|---|
89 | 01-18-2023 12:10 AM | |
59 | 01-16-2023 01:54 AM | |
144 | 01-13-2023 01:59 AM | |
143 | 01-13-2023 01:35 AM | |
84 | 01-02-2023 10:03 PM |
01-24-2023
11:30 PM
Hello @Ellengogo I am not able to find the exact Case reference as this Post is few months old. Typically, such issues are caused by Resource Constraint, wherein the Engine Pod (Created when a Workbench Session is started) gets terminated owing to Resource Constraint. If the issue is persistent, a Support Case would be ideal as we require the review of the Kubernetes Output pertaining to the Engine Pod along with Resource Profile & other related artefacts. Regards, Smarak
... View more
01-18-2023
12:10 AM
Hello @Girija Since we haven't heard back from you, We shall mark the Post as Solved. If you happen to have any further ask, Feel free to Update the Post. In Summary, Internally, I wasn't able to replicate the issue being faced by you as I was able to create the ConfigSet using "_default" ConfigSet as baseConfig. Customer can use the below solrctl command to create a ConfigSet with Solr KeyTab: solrctl config --create Test_Config _default -p configSetProp.immutable=false Assuming the above Command fails, Running the solrctl command with "--trace" after "solrctl" & before "config" would print the trace logging & assist in troubleshooting the issue faced by your team. Regards, Smarak
... View more
01-17-2023
11:03 PM
Hello @pankshiv1809 Since we haven't heard from your side concerning the Post, We are marking the Post as Solved. If you have any further ask, Feel free to update the Post & we shall get back to you accordingly. Regards, Smarak
... View more
01-17-2023
10:46 PM
Hello @mingtian Note that Debug Logging won't ensure the Balancer would perform Region Movement, rather the same would confirm if Balancer is running yet the same isn't moving any Region owing to CostFactor. Example: I ensured 1 RegionServer didn't had any Region by RegionMovement & triggered a Balancer, which showed [1] & trigger a Region Movement (Note "Found A Solution That Moves 1 Region"). After the 1st Balancer is Completed, I triggered a 2nd Balancer, which printed [2], wherein the DEBUG report "Skipping Load Balancing". I believe your Team would see [2] i.e. Balancer is Skipping any Load Balancing owing to Cost Factor. As such, Your Team can consider the fact that HBase is rejecting Region Movement owing to the fact that any new Region-Movement is "Costlier" than Current Region Placement. Tweaking [3] Cost Parameters including setting " hbase.master.loadbalance.bytable" to "true" should help trigger a Balancer for your Team. Regards, Smarak [1] 2023-01-18 06:38:33,290 INFO org.apache.hadoop.hbase.master.balancer.StochasticLoadBalancer: Finished computing new moving plan. Computation took 95 ms to try 7200 different iterations. Found a solution that moves 1 regions; Going from a computed imbalance of 0.4961380973335763 to a new imbalance of 0.020487264673311183. funtionCost=RegionCountSkewCostFunction : (multiplier=500.0, imbalance=0.0); PrimaryRegionCountSkewCostFunction : (not needed); MoveCostFunction : (multiplier=7.0, imbalance=0.3333333333333333, need balance); ServerLocalityCostFunction : (multiplier=25.0, imbalance=0.0); RackLocalityCostFunction : (multiplier=15.0, imbalance=0.0); TableSkewCostFunction : (multiplier=35.0, imbalance=0.0); RegionReplicaHostCostFunction : (not needed); RegionReplicaRackCostFunction : (not needed); ReadRequestCostFunction : (multiplier=5.0, imbalance=1.0, need balance); WriteRequestCostFunction : (multiplier=5.0, imbalance=1.0, need balance); MemStoreSizeCostFunction : (multiplier=5.0, imbalance=0.0); StoreFileCostFunction : (multiplier=5.0, imbalance=0.0); [2] 2023-01-18 06:39:05,365 INFO org.apache.hadoop.hbase.master.balancer.StochasticLoadBalancer: Cluster wide - skipping load balancing because weighted average imbalance=0.013858568086431631 <= threshold(0.025). If you want more aggressive balancing, either lower hbase.master.balancer.stochastic.minCostNeedBalance from 0.025 or increase the relative multiplier(s) of the specific cost function(s). functionCost=RegionCountSkewCostFunction : (multiplier=500.0, imbalance=0.0); PrimaryRegionCountSkewCostFunction : (not needed); MoveCostFunction : (multiplier=7.0, imbalance=0.0); ServerLocalityCostFunction : (multiplier=25.0, imbalance=0.0); RackLocalityCostFunction : (multiplier=15.0, imbalance=0.0); TableSkewCostFunction : (multiplier=35.0, imbalance=0.0); RegionReplicaHostCostFunction : (not needed); RegionReplicaRackCostFunction : (not needed); ReadRequestCostFunction : (multiplier=5.0, imbalance=0.6685715976063684, need balance); WriteRequestCostFunction : (multiplier=5.0, imbalance=1.0, need balance); MemStoreSizeCostFunction : (multiplier=5.0, imbalance=0.0); StoreFileCostFunction : (multiplier=5.0, imbalance=0.0); [3] StochasticLoadBalancer (Apache HBase 3.0.0-alpha-4-SNAPSHOT API)
... View more
01-16-2023
09:09 PM
Hello @Ryan_2002 Thanks for engaging Cloudera Community. First of all, Thank You for the detailed description of the Problem. I believe your ask is Valid, yet reviewing the same over a Community Post isn't a suitable approach. Feasible for you to engage Cloudera Support to allow our Team to work with you, with the suitability of Screen-Sharing Session as well as Logs exchange, both of which aren't feasible in Community. That would greatly expedite the review of your ask. Regards, Smarak
... View more
01-16-2023
01:57 AM
Hello @mingtian Hope you are doing well. We wish to follow-up with you & check if the DEBUG Logging assisted in confirming the reasoning for Balancer Algorithm deciding against Region-Movement. If Yes, Kindly let us know if your Q in the Post has been answered or any further Q remains. Regards, Smarak
... View more
01-16-2023
01:55 AM
Hello @pankshiv1809 Hope you are doing well. We wish to follow-up on the Post & confirm whether your Team was requesting information into Dynamic Allocation to allow Spark adjust resources based on Workload requirement. Regards, Smarak [1] Job Scheduling - Spark 3.3.1 Documentation (apache.org)
... View more
01-16-2023
01:54 AM
Hello @panb We hope your Query has been addressed by us & shall mark the Post as Resolved. In Summary, Your Team needs to meet the requirement as stated in [1], which doesn't differentiate in Processor Type & I believe your Team is referring to Hygon Dhyana Processor. Note that we have shared the Hardware requirement is shared for CDP v7.1.8 as CDH isn't recommended now owing to End-Of-Life. As a Best Practise, I shall suggest engaging with Cloudera Account Team associated with Customer to perform any due diligence with respect to Supportability & Best Practices prior to onboarding Use-Cases into any new Platform, wherein Supportability is doubted by your Team. Regards, Smarak [1] Hardware Requirements | CDP Private Cloud (cloudera.com)
... View more
01-13-2023
01:59 AM
Hello @pankshiv1809 Thanks for using Cloudera Community. Based on your Post, Assuming [1] would help i.e. Using Dynamic Allocation to allow Spark adjust resources based on Workload requirement. Regards, Smarak [1] Job Scheduling - Spark 3.3.1 Documentation (apache.org)
... View more
01-13-2023
01:56 AM
Hello @panb Thanks for using Cloudera Community. As far as I am aware, your Team needs to meet the requirement as stated in [1], which doesn't differentiate in Processor Type & I believe your Team is referring to Hygon Dhyana Processor. Note that we have shared the Hardware requirement is shared for CDP v7.1.8 as CDH isn't recommended now owing to End-Of-Life. Regards, Smarak [1] Hardware Requirements | CDP Private Cloud (cloudera.com)
... View more
01-13-2023
01:47 AM
Hello @mingtian Thanks for using Cloudera Community. Based on your Post, We would suggest enabling DEBUG Logging for HMaster (Via HMaster UI To Avoid Any Restart) & trigger the Balancer. Generally, Balancer Algorithm may be deciding against running any Region-Alignment owing to Cost Factor [1]. The HMaster Debug Log would print such Balancer information for your review, upon which the Params discussed in [1] can be tuned to force Balancer, yet the Default Params are generally persisted for most Use-Cases. Note that Balancer Job isn't to merely fit Equal Regions per RegionServer. Balancer consider various Cost as defined by [1] to proceed with Region-Alignment. Regards, Smarak [1] StochasticLoadBalancer (Apache HBase 3.0.0-alpha-4-SNAPSHOT API)
... View more
01-13-2023
01:35 AM
1 Kudo
Hello @Ryan_2002 Thanks for using Cloudera Community. To your Q, the Driver Cap is the Engine/Resource Profile & the Executor's Resource Usage is defined by the SparkSession or "spark-defaults.conf" file within the Project wherein the Workbench Session is being created. Your Team can review the Pods in the User's Namespace & see the same i.e. upon a Workbench Session Creation, an Engine Pod is started with "Limits" set toEngine/Resource Profile Settings. After SparkSession is initialised, additional Pods are generated within the User's Namespace based on the Execution's Configs passed via SparkSession or "spark-defaults.conf" file. You may configure the Executor's Configs as per your usage yet the same depends on the CML Workspace AutoScale Range & InstanceType. Say, an InstanceType supporting 8 vCPU & Executors requesting 8 vCPU won't work. Similarly, AutoScale Max of 5 yet requesting Executors collectively utilising the Resource Limit of 5 Nodes. Hope the above helps answer your Post's queries. If Yes, Kindly mark the Post as Solved. If No, Feel free to share your concerns & we shall address accordingly. Regards, Smarak
... View more
01-02-2023
10:03 PM
Hello @quangbilly79 Thanks for using Cloudera Community. Based on your Post, you may consider "Kafka Gateway" as the Client for Kafka, which are setup on the Hosts wherein the same is added as per Cloudera Manager "Assign Roles". A Client/Gateway is familiar with the Service (Kafka in this Case) & all Client/Service Configs are available for the Client/Gateway without any manual intervention. Any changes made to the Service or Client Configs is pushed to the Service/Client Configuration by Cloudera Manager. Imagine a Scenario wherein you wish to run "hdfs dfs -ls" on a HDFS FileSystem. Simply running the Command won't work unless the Host wherein the Command "hdfs dfs -ls" is being run knows the Setup (HDFS FileSystem, NameNode, Port, Protocol). Review [1] for an Example. Adding an HDFS Gateway ensures User doesn't need to manually configure a Client/Gateway with Cloudera Manager doing the needful. Similarly, Kafka Gateway operates. Else, Customer need to manually configure the Client/Gateway Setup. Hope the above answer your query concerning the Gateway Role. Regards, Smarak [1] https://www.ibm.com/docs/en/spectrum-scale-bda?topic=hdfs-clients-configuration
... View more
12-27-2022
10:30 PM
Hello @sachin_saju Thanks for using Cloudera Community. You have 2 ask in the Post: 1. How to configure different Storage Policies with Cold & Hot Data, 2. Applying different Compression Algorithm in 1 Column Family. For Q2, I believe the same isn't feasible i.e. Compression Algorithm can be set at CF level. Review [1] for the Compression Algorithm recommendation around Hot & Cold type data. For Q1, I assume you are referring to HDFS Storage Policy. If Yes, the same is configured uniformly i.e. I am not sure if we can apply different HDFS Storage Policy for different data within the same CF. In HBase, We generally recommend SSD [2] for WAL, else the HBase Data relies on HDFS Storage Policy used. Alternatively, Use BackUp-Restore [3] for having a "Cold" Version of Data, which can be restored as per requirement. Regards, Smarak [1] https://hbase.apache.org/book.html#data.block.encoding.types [2] https://docs.cloudera.com/cdp-private-cloud-base/7.1.8/configuring-hbase/topics/hbase-configure-storage-policy-wal.html [3] https://hbase.apache.org/book.html#br.overview
... View more
12-27-2022
10:19 PM
1 Kudo
Hello @sachin_saju Thanks for using Cloudera Community. Your queries concerning the Read Path is discussed between a fellow Community User & myself in [1]. Kindly review the same & let us know if the same answer the queries around Read Path. In Summary, Read Path relies on a Merge of BlockCache & MemStore prior to returning the Output to the End-User, thereby avoiding any Inconsistent Read. Refer [2] for few Diagram around the same to help explain the Read Merge Path. Concerning Doubt # 3, Our Community User asked a similar Q in [3]. I haven't reviewed this Use-Case internally around Hit/Miss Ratio in the UI to answer the same. Henceforth, I shall let our fellow HBase Engineers to answer [3], which may answer your Q3 as well. Barring Q3, Let me know if your first 2 queries are addressed by [1] & [2]. Regards, Smarak [1] https://community.cloudera.com/t5/Support-Questions/Is-it-possible-for-inconsistent-read-in-Hbase-with-Memcache/m-p/359452#M238123 [2] https://nag-9-s.gitbook.io/hbase/hbase-architecture/hbase-read-merge [3] https://community.cloudera.com/t5/Support-Questions/Create-cache-miss-scenario-in-HBase-with-HDP-2-6-5/m-p/359795
... View more
12-25-2022
05:50 AM
Hello @Serhii This is an Old Post, yet I am answering the same as there are few changes with CDP recent release & ensuring Community awareness. With CDP v7.1.6 [1] allows Accumulo to be installed via Cloudera Manager. The Installation is documented via [1] & requires a Separate Parcel to be installed before attempting to add Accumulo via Cloudera Manager. Having said that, Feel free to engage with Cloudera Account Team for Customer as the investment into Accumulo isn't as par with other similar counterpart to review any long term engagement with Accumulo for meeting Customer's Use-Case. Regards, Smarak [1] https://docs.cloudera.com/cdp-private-cloud-base/7.1.6/opdb-accumulo-installation/topics/opdb-accumulo-install.html
... View more
12-23-2022
07:04 AM
Hello @Girija Internally, I wasn't able to replicate the issue being faced by you. I was able to create the ConfigSet using "_default" ConfigSet as baseConfig. I am assuming the issue is specific to your Environment & your team should use the CLI to better diagnose such issue. Your team can use the below solrctl command to create a ConfigSet with Solr KeyTab: solrctl config --create Test_Config _default -p configSetProp.immutable=false Assuming the above Command fails, Running the solrctl command with "--trace" after "solrctl" & before "config" would print the trace logging & assist in troubleshooting the issue faced by your team. Regards, Smarak
... View more
12-21-2022
07:04 AM
1 Kudo
Hello @SagarCapG Confirmed that Phoenix v5.1.0 has the Fix for " !primarykeys" to show the Primary Key linked with a Phoenix Table. Upon checking our Product Documentation, CDP v7.1.6 introduces Phoenix v5.1.0 [1]. As such, I am surprised your Team has Phoenix v5.0.0 with CDP v7.1.7, wherein Official v7.1.7 Doc [2] says Phoenix v5.1.1.7.1.7.0-551 is used. Since the Issue is fixed in Phoenix v5.1.x & CDP v7.1.6 onwards ship Phoenix v5.1.x, Kindly engage Cloudera Support to allow Support to review your Cluster for identifying the reasoning for CDP v7.1.7 using Phoenix v5.0.0. Or, Upgrade to Phoenix v5.1.x (If Customer is managing Phoenix outside of CDP) to use "!primarykeys" functionality. Regards, Smarak [1] What's New in Apache Phoenix | CDP Private Cloud (cloudera.com) [2] Cloudera Runtime component versions | CDP Private Cloud
... View more
12-20-2022
11:35 PM
Hello @brajeshreddy Since the Issue isn't replicated with CML release internally & your team have engaged Cloudera Support for further assistance, We shall Close the Post now. For our fellow Community Users, the Steps to perform the Team's Name Modification is shared above. Assuming the same isn't working for Customers, Ensure you are connected as MLAdmin & Caching is ruled out as well. Regards, Smarak
... View more
12-20-2022
08:28 AM
Hello @sekhar1 Since we haven't heard back from your side concerning the Post, We shall mark the Post as Resolved with the Action Plan to review if Traffic from your Machine isn't allowed to or from the Security Group linked with the VPC wherein the CML Workspace instances are deployed. Check if the Kubernetes Pods associated with the CML Workspace Kubernetes Cluster are Up/Running. If Yes, Such Exception should be reviewed from a Network Standpoint only. You may reach out to your Customer's AWS/Platform Team to review the Traffic between your Machine & the VPC within which the CML Workspace is deployed. Assuming your team fixed the issue outside of any Customer's Network concerns, We would appreciate your feedback to ensure our fellow Community Users can benefit from your experience. Regards, Smarak
... View more
12-20-2022
08:23 AM
Hello @brajeshreddy Thanks for using Cloudera Community. On a CML v 2.0.34-b116, I was able to rename the Team's Name via UI as shared below via [1] & [2]. Upon Clicking On Team's Name, I get the UI [3], which allows me to modify the Team's Name. Since I am using the same Version as yours, the Issue likely lies with Permission (You may review the Access granted at Environment Level), Caching (You may clear the Browser Cache) etc. If both of the above checks aren't helpful, I suggest engaging Cloudera Support to ensure a Screen-Sharing Session can be done with your team to review further. Regards, Smarak [1] Name Before Change: [2] Name After Change: [3] Name Change Provisioning:
... View more
12-20-2022
07:54 AM
Hello @SagarCapG Thanks for using Cloudera Community. While I haven't tested the above Use-Case in Phoenix v5.0.0, I tested in Phoenix v5.1.0 (Used in CDP v7.1.7) & the same shows the Primary Key correctly [1]. As such, I believe the above ask is fixed in CDP v7.1.8 atleast. I reviewed Upstream JIRA & it appears the same is actually fixed in Phoenix v5.1.x as per comments in https://issues.apache.org/jira/browse/PHOENIX-6651. As such, Kindly plan for CDP v7.1.8 Upgrade for obtaining the concerned functionality of identifying PK via " !primarykeys". Regards, Smarak [1] 0: jdbc:phoenix:> CREATE TABLE Employee (EmpId Integer Not Null Primary Key, Ename Varchar(50));
0: jdbc:phoenix:> !primarykeys Employee;
+-----------+-------------+------------+-------------+---------+---------+-------------+-----------+-----------+-------------+---------+---------------+
| TABLE_CAT | TABLE_SCHEM | TABLE_NAME | COLUMN_NAME | KEY_SEQ | PK_NAME | ASC_OR_DESC | DATA_TYPE | TYPE_NAME | COLUMN_SIZE | TYPE_ID | VIEW_CONSTANT |
+-----------+-------------+------------+-------------+---------+---------+-------------+-----------+-----------+-------------+---------+---------------+
| | | EMPLOYEE | EMPID | 1 | | A | 4 | INTEGER | null | 4 | |
+-----------+-------------+------------+-------------+---------+---------+-------------+-----------+-----------+-------------+---------+---------------+
... View more
12-19-2022
12:34 AM
Hello @anks1 Thanks for using Cloudera Community. Kindly confirm the following details: CML Deployment (CDP Public Cloud Or Private Cloud) CML Version Used (Via CML Workspace Details In CML Page) Any timeframe of how many days before an App transition to Failed State from Running State Regards, Smarak
... View more
12-19-2022
12:32 AM
Hello @sekhar1 We wish to follow-up on the Post & confirm if your Team have resolved the issue. In Summary, such Exception are received if Traffic from your Machine isn't allowed to or from the Security Group linked with the VPC wherein the CML Workspace instances are deployed. Check if the Kubernetes Pods associated with the CML Workspace Kubernetes Cluster are Up/Running. If Yes, Such Exception should be reviewed from a Network Standpoint only. You may reach out to your Customer's AWS/Platform Team to review the Traffic between your Machine & the VPC within which the CML Workspace is deployed. Regards, Smarak
... View more
12-19-2022
12:15 AM
Hi @pacman A Merge happens for each Read Operation i.e. BlockCache & MemStore. As such, Incorrect Values aren't observed. Having said that, If you observe any such scenario of Read/Write Inconsistency, Kindly share a Use-Case & any replication attempt to allow us to review accordingly. Regards, Smarak
... View more
12-13-2022
07:39 AM
1 Kudo
Hello @pacman HFile is only updated after flush of Memstore. In the scenario shared, Read Merge would share the Updated values without updating the Hfile. We can perform the same by following for verification: 1. Create a Table with 1 CF. Insert 1 Row & flush Table to ensure 1 Hfile is created. 2. Read the Table, which would read from Hfile & place in BlockCache. The BlockCache Size is visible via HBase UI at the CF family level within the Table's Level Statistics, 3. Read the Table again, which would read from BlockCache. Verify via Hit Ratio from the BlockCache Stats in HBase UI. 4. Update the Row by using the same RowKey, yet using a different Value for the Column Qualifier within the Column Family. 5. Read the Table again. You should get the Updated Value from Step 4. 6. To make things interesting, Remove the concerned Table's 1 Region hosting RegionServer WAL file & Kill the RegionServer PID. This ensure the MemStore isn't flushed owing to Ungraceful exit & WAL can't be replayed. 7. Start the RegionServer, which shall create the WAL file. Read the Table again. The same would show Value from Step 1. This would help confirm the HFile isn't Updated when HBase Read Merge the Values from BlockCache & MemStore while reading the Table as per Step 5. Regards, Smarak
... View more
12-13-2022
12:48 AM
Hello @ashokaakula As per [1], the Master Password isn't recoverable in the event of the same being lost. As such, We shall request your Team to engage Cloudera Support, if your Team have lost the same without further delay. Regards, Smarak [1] https://docs.cloudera.com/documentation/enterprise/6/6.3/topics/navigator_encrypt_register.html#concept_opn_tnn_mr__section_g5n_tnn_mr
... View more
12-09-2022
07:31 AM
Hello @pacman Thanks for using Cloudera Community. HBase handles such scenario by Read Merge. A Read "Merges" Key Values from the Block Cache, MemStore, and HFiles in the following steps: First, the Scanner looks for the Row Cells in theBlock Cache (Read Cache). Next, the Scanner looks in the MemStore (Write Cache). If the Scanner does not find all of the Row Cells in the MemStore and Block Cache, then Hfiles are referred. Hope the above answers your query. If Yes, Kindly mark the Post as Resolved. Regards, Smarak
... View more
12-09-2022
07:14 AM
Hello @quangbilly79 Thanks for using Cloudera Community. The Link [1] applies to CDE Service, which is available on Private Cloud & Public Cloud. In CDP Public Cloud, Your Team need to connect to the CDP Management Console & access the CDE Service as documented in [1]. In CDP Private Cloud, You need to install Data Service [2] to access CDE Service, upon which your Team can Schedule Job. The Cloudera Manager of CDP Private Cloud is used to install Data Service [2] & CDE is available after successful installation on Data Service. Without Data Service, Oozie can be used by your Team as shared above by Steven. Kindly review & let us know if you have any queries. Regards, Smarak [1] Scheduling jobs in Cloudera Data Engineering [2] CDP Private Cloud Data Services: Documentation | CDP Private Cloud (cloudera.com)
... View more
12-02-2022
01:26 AM
Hello @GregDol Thanks for using Cloudera Community. This is an Old Post, yet I am sharing a review for other Community Users, in case they encounter the concerned issue. Reviewing the Post indicates the Issue is likely caused by missing " rt.jar" in the JDK used by the Solr Service. In other words, the concerned Class " java.time.zone.ZoneRulesProvider" is present in " /usr/java/jdk1.8.0_232-cloudera/jre/lib/rt.jar" for CDP Cluster (Path & Version may be different for Customer). The concerned JDK is used by the various Services running including Solr. In your scenario, the concerned Class appears to be missing. Assuming your team wish to confirm whether a JAR consists of the Class, you may use Command [ /usr/java/jdk1.8.0_232-cloudera/bin/jar -tvf /usr/java/jdk1.8.0_232-cloudera/jre/lib/rt.jar] to list the Classes, which should include "java.time.zone.ZoneRulesProvider" as well. If your Team is using any different JDK, Review the same & ensure the "rt.jar" is present for Class "java.time.zone.ZoneRulesProvider" to be loaded successfully. Regards, Smarak
... View more