Member since
02-27-2020
173
Posts
42
Kudos Received
48
Solutions
My Accepted Solutions
Title | Views | Posted |
---|---|---|
1026 | 11-29-2023 01:16 PM | |
1127 | 10-27-2023 04:29 PM | |
1060 | 07-07-2023 10:20 AM | |
2440 | 03-21-2023 08:35 AM | |
871 | 01-25-2023 08:50 PM |
07-20-2020
08:48 PM
In the PutHBaseRecord processor, specify the Row Identifier Field Name to be ${MYCOL} per the NiFi Expression Language. Hope this helps!
... View more
07-20-2020
05:26 PM
An alternative approach to upload local file to HDFS is to use a command tool called distcp. The basic command would be as follows: hadoop distcp file:///<path to local file> /<path on hdfs> This will generate and submit a MapReduce job to upload your data to hdfs piece by piece.
... View more
07-20-2020
09:28 AM
Are you using distcp for migration? If reducing heavy load on network is your requirement and you are ok with the migration taking longer, then there is a -bandwidth option in distcp that can help. You can specify the maximum bandwith a map operation can use. You'd of course first need to estimate the number of map operations to be executed. Otherwise, I'm not aware of any rack aware hdfs migration approach.
... View more
07-20-2020
08:59 AM
Ok, I get your granularity point. Thanks for clarifying. Unfortunately we don't have a Cloudera supported tool that can do a simple backup of the Kafka cluster. I can only speculate on the reason, but this is likely a rare case where a backup (rather than replication) is required.
... View more
07-19-2020
09:47 PM
There's an open source tool kafka-backup that sounds like what you are looking for. I'm not sure I follow your granularity point though.
... View more
07-19-2020
09:38 PM
NiFi is one option to accomplish what you need. You can find an example here for a generic SQL database moving data "in real-time" to Hive. If you could describe your use case in more details, the community could assist you better.
... View more
07-17-2020
02:15 PM
MLOps will eventually be available as part of Cloudera Data Science Workbench (CDSW) product. Keep an eye out for new releases coming soon.
... View more
07-17-2020
02:11 PM
CDW doesn't give you access to that safety valve setting in Hue configuration. So, you won't be able to disable the download button. You do have access to that configuration if you setup a Data Hub cluster and have your users access the data through the Hue interface there.
... View more
07-16-2020
05:13 PM
There currently isn't a way to do this in CDP. Are you trying to block users from viewing the data (Ranger might help) or just downloading the data? What is your use case exactly?
... View more
07-16-2020
04:51 PM
To get the output like the Hive page you linked to you just need this: describe formatted <TABLE_NAME> <COLUMN_NAME>; That works in Hue. Can you further clarify what output you are looking for in an ideal scenario?
... View more