Member since
06-26-2015
515
Posts
137
Kudos Received
114
Solutions
My Accepted Solutions
Title | Views | Posted |
---|---|---|
2025 | 09-20-2022 03:33 PM | |
5610 | 09-19-2022 04:47 PM | |
3042 | 09-11-2022 05:01 PM | |
3354 | 09-06-2022 02:23 PM | |
5301 | 09-06-2022 04:30 AM |
02-22-2022
05:29 PM
Actually, an easier way to ignore the column name duplication and still process the columns correctly, would be to use a schema to describe your data. For example, say you have the following CSV: col_a,col_b,col_b
1,2,3
4,5,6 You can configure your CSVReader with the following: And the data will be processed correctly: HTH, André
... View more
02-22-2022
01:51 PM
1 Kudo
Thank you! Looks like what I wanted.
... View more
02-22-2022
02:19 AM
1 Kudo
Hi, @mehmetersoy , CDP does include Druid libraries, which are required by other components, but starting Druid as a standalone service is not possible and is not supported. Cheers, André
... View more
02-21-2022
03:56 PM
@spserd , You're right. The Kafka 0.10 and 0.11 connectors are very old have been discontinued. Cheers, André
... View more
02-21-2022
06:33 AM
Hi @araujo , This is exactly what I was looking for. Thanks very much!
... View more
02-20-2022
09:19 PM
@gfsupp, Has the reply helped resolve your issue? If so, please mark the appropriate reply as the solution, as it will make it easier for others to find the answer in the future.
... View more
02-20-2022
05:31 PM
I've added a "remove" operation to the chain to solve the problem I mentioned in the last post, removing the "null" in the first position of the "measures" array: [
{
"operation": "shift",
"spec": {
"LevelID": "assetId",
"PLCTime": "datapoints[0].timestamp",
"Data*Type": "datapoints[0].measures[&(0,1)].name",
"Data*": "datapoints[0].measures[&(0,1)].value"
}
},
{
"operation": "remove",
"spec": {
"datapoints": {
"*": {
"measures": {
"0": ""
}
}
}
}
}
] This should give you the exact output you were looking for. HTH, André
... View more
02-20-2022
01:24 AM
@akylai , The Cloudera Quickstart docker image contains a very old version from CDH and is no longer supported. If you want to try the Cloudera Data Platform (CDP), I'd recommend you use the Cloudera Private Cloud trial: https://www.cloudera.com/downloads/cdp-private-cloud-trial.html Regards, André
... View more
02-20-2022
01:16 AM
@CloudNaive , It seems that the "Message Key Field" property doesn't work with a nested structure. This property must refer to a top level field of the record that contains the key value. In the example below I used UpdateRecord to create a top level field called "id" and update it with the value of "metadata.aggregateId". Them, on the PublishKafkaRecord I was able to use the "id" field as the Kafka message key. Cheers, André
... View more