Member since
03-16-2017
37
Posts
6
Kudos Received
2
Solutions
My Accepted Solutions
| Title | Views | Posted |
|---|---|---|
| 2484 | 09-30-2019 08:34 PM | |
| 5087 | 08-22-2019 01:18 PM |
05-26-2021
07:36 PM
Hi! Those warning messages about dropped RPC requests due to backpressure is a sign that particular tablet server is likely overloaded. Consider the following remedies: Upgrade to the recent version of Kudu (1.14 as of now). Since Kudu 1.9.0 there have been many fixes which might help to reduce memory pressure for write-intensive workloads (e.g. see KUDU-2727, KUDU-2929), read-only workloads (KUDU-2836), and bunch of other improvements. BTW, if you are using CDH, then upgrading to CDH6.3.4 is a good first step in that direction: CDH6.3.4 contains fixes for KUDU-2727, KUDU-2929, KUDU-2836 (those were back-ported into CDH6.3.4). Make sure the tablet replica distribution is even across tablet servers: run the 'kudu cluster rebalance' CLI tool. If you suspect replica hot-spotting, consider re-creating the table in question to fan out the write stream across multiple tablets. I guess reading this guide might be useful: https://kudu.apache.org/docs/schema_design.html If nothing from the above helps, consider adding a few more tablet server nodes into your cluster. Once new nodes are added into the cluster, don't forget to run the 'kudu cluster rebalance' CLI tool. Kind regards, Alexey
... View more
09-30-2019
08:34 PM
Hi, I think you will need Impala to make Superset working with Kudu. At http://superset.apache.org/#databases it's mentioned the database engine needs '... proper DB-API driver and SQLAlchemy dialect ...' to be usable by Superset. I guess the '...proper DB-API driver ...' is based on JDBC, and there isn't JDBC for Kudu as of now. As far as I know, there isn't native Superset Kudu connector either. However, contributions are always welcome! Kind regards, Alexey
... View more
08-22-2019
01:20 PM
Whoops, the correct link to the WIP patch for PySpark integration work is http://gerrit.cloudera.org:8080/13088
... View more
08-22-2019
01:18 PM
Hi, I'm not sure there is a full-fledged documentation on Kudu PySpark API: the connector is still in early development phase, if I'm not mistaken. However, the following in-flight patch has a few examples that might be helpful: https://gerrit.cloudera.org/#/c/13102/2/docs/developing.adoc But it doesn't answer your question about KuduContext: I'm not sure that functionality is implemented at this point. There was a WIP patch posted some time ago: https://gerrit.cloudera.org/#/c/13086/ However, I don't know how what that status of that work at this point, unfortunately.
... View more