Member since
04-05-2018
4
Posts
2
Kudos Received
0
Solutions
06-27-2018
09:39 AM
Thank you @Dave Welden. Do you know to to add the Druid and Superset services to the HDF?
... View more
06-25-2018
04:14 PM
1 Kudo
Hello, I'm following the tutorials for HDF, but I'm struggling to do the "Real-Time Event Processing In NiFi, SAM, Schema Registry and SuperSet". I cannot find in the HDF virtualbox image the Druid and Superset that they use in the tutorial. When I try to add a new service in Ambari doesn't appear the options to install Druid or Superset. What I'm doing wrong? https://hortonworks.com/tutorial/real-time-event-processing-in-nifi-sam-schema-registry-and-superset/ Thank you, Rui
... View more
Labels:
- Labels:
-
Apache Ambari
-
Cloudera DataFlow (CDF)
04-06-2018
05:25 PM
@Shu thank you for this great explanation. For this to be done in the Hive table must have a timestamp for the moment that was created every single row right? other question is, the point 3.ExtractText (Extract the value got stored and keep as start_value) the value that I'm going to copy to start_value come from the current time?
... View more
04-05-2018
04:54 PM
1 Kudo
Hello Everyone, I'm want to copy all the content from a Hive table and tranform it to a JSON file, but must recurrently in order to copy new content that the Hive table could have. I managed to use the processor "SelectHiveQL" to extract the data. The problem is that I can't collect the data that was only created after the last collection of data. Everytime that I access the Hive is collecting all the data duplicating the information. I also tried using the "QueryDatabaseTable" and "GenerateTableFetch" processors but could not get it to work.
Does anyone have a hint how I can do this? Thank you.
... View more
Labels:
- Labels:
-
Apache NiFi