About TimothySpann

dfossouo · ‎11-27-2017

Hello, Great post need to correct this part : sudo wget http://download.opensuse.org/repositories/home:/oojah:/mqtt/CentOS_CentOS-6/home:oojah:mqtt.repo sudo cp *.repo /etc/yum.repos.d/ sudo yum -y update sudo yum -y install mosquitto step 1 and 2 are fused. Regards

egarelnabi · ‎08-15-2016

Hi @Timothy Spann It really all depends on your particular use case and requirements. First, I'm assuming you have a custom-built application that will be querying this data store. If so, how complex do the queries need to be? Do you need Relational (SQL) or Key-Value store? Also, how much latency can you afford? I would first explore if HBase (or HBase + Phoenix) would be sufficient. This will reduce the number of moving parts you have. If you're set on in-memory data grids/stores then some options would be Redis, Hazelcast, Teracotta Big Memory and GridGain (Apache Ignite). I believe the last two have connectors to Hadoop that allow writing results of MR jobs directly to the data grid (you'll need to confirm that functionality though) Like I said before though, I recommend you exhaust the HBase option before moving out-of-stack.

mburgess · ‎06-14-2017

I confirmed this to be a bug in ConvertJSONToSQL, I have written up NIFI-4071, please see the Jira for details.

jpercivall · ‎07-19-2016

When developing the MQTT processors I used Mosquitto to test. I found it to be a very easy to use and simple to configure broker that handled a decently high throughput even on my laptop. That said, the NiFi MQTT processors should be able to communicate with any broker that handles the vanilla MQTT Api.

TimothySpann · ‎01-24-2018

if you do a PutHDFS it generates an attribute hive.ddl that can be used to create a hive table. you can also generate hive.ddl with updateattribute with your code ${hive.ddl} LOCATION '${absolute.hdfs.path}'

TimothySpann · ‎07-18-2016

Thanks great catch, that was it. Typo.

Dominika · ‎11-30-2016

To get started with the HDCloud for AWS general availability version, visit http://docs.hortonworks.com/HDPDocuments/HDCloudAWS/HDCloudAWS-1.8.0/bk_hdcloud-aws/content/index.html

vjain · ‎07-25-2016

@Timothy Spann There is no officially supported processor to schedule VORA jobs using NiFi. However, A VORA agent communicates directly with the Spark Client when running in Yarn mode. You can write your program in Python or Scala which invokes the VORA classes and then call those scripts through spark-submit in NiFi using the ExecuteCommand processor.

TimothySpann · ‎07-15-2016

su hdfs hadoop fs -mkdir /udf hadoop fs -put urldetector-1.0-jar-with-dependencies.jar /udf/ hadoop fs -put libs/url-detector-0.1.15.jar /udf/ hadoop fs -chown -R hdfs /udf hadoop fs -chgrp -R hdfs /udf hadoop fs -chmod -R 775 /udf Create Hadoop Directories and upload the two necessary libraries. CREATE FUNCTION urldetector as 'com.dataflowdeveloper.detection.URLDetector' USING JAR 'hdfs:///udf/urldetector-1.0-jar-with-dependencies.jar', JAR 'hdfs:///udf/url-detector-0.1.15.jar'; Create Hive Function with those HDFS referenced JARs select http_user_agent,urldetector(remote_host)asurls,remote_host from AccessLogs limit 100; Test the UDF via Hive QL @Description(name="urldetector", value="_FUNC_(string) - detectsurls") public final class URLDetector extends UDF{} Java Header for the UDF set hive.cli.print.header=true; add jar urldetector-1.0-jar-with-dependencies.jar;CREATE TEMPORARY FUNCTION urldetector as 'com.dataflowdeveloper.detection.URLDetector';select urldetector(description) from sample_07 limit 100; You can test with a temporary function through Hive CLI before making the function permanent. mvn compile assembly:single Build the Jar File for Deployment The library from LinkedIn (https://github.com/linkedin/URL-Detector) must be compiled and the JAR used in your code and deployed to Hive. References See: https://github.com/tspannhw/URLDetector for full source code.

TimothySpann · ‎07-08-2016

So the issue was the library I was using was compiled with JDK 8 and everything else is JDK 7. There was no issue listed, JUnit and local Java applications ran fine. When I manually uploaded the JAR, it gave me the dreaded "Unsupported major.minor version 52.0" With a properly compiled library, we will be fine. So make sure you compile in JDK 7 if your Hadoop / Hive platform is JDK 7

Online	Offline
Last Visited	‎05-20-2024 05:42 PM

Member Since	‎01-07-2019 11:58 AM
Last Visited	‎05-20-2024 05:42 PM
Posts	1,973
Kudos received	1122

Cloudera Community

Re: Has anyone tried NiFi consuming (JMSConsume) f...

Re: NiFi Crash after runing chain of lookups

Re: Recommend approach for listening to RSS Feed i...

Re: NiFi ListenFTP Processor Default Data Port

Re: Nifi: Kafka Producer with Avro format in both ...

Re: IoT Example in Apache NiFi: Consuming and Pr...

Re: In-Memory Layer

Re: ConvertJSONtoSQL in Apache NiFi for Sending to...

Re: Best MQTT Broker to Use with HDF

Re: Using HiveQL Processors in Apache NiFi 1.2

Re: SelectHiveQL Fails on JDBC Error

Re: HDP-AWS (Hortonworks-AWS)

Re: SAP HANA / SAP HANA Vora Processor for Apache ...

Making a Hive UDF From A Useful Existing Library

Re: Exception when trying to run Hive UDF