About stevenmatison

stevenmatison · ‎04-02-2020

@Gubbi I think your ListFile proc is still executing 0 sec. Reference our private message.

stevenmatison · ‎04-02-2020

@Gubbi The next solution here is to just add each route for today, yesterday, day before yesterday. Then route all 3 to the next proc. Anything not matching won't be routed.

stevenmatison · ‎04-02-2020

@Gubbi The solution here is a now minute 24 hours: Yesterday: ${now():minus(86400000):format('MM-dd-yyyy hh:mm:ss') } Day Before Yesterday: ${now():minus(86400000):minus(86400000):format('MM-dd-yyyy hh:mm:ss') }

stevenmatison · ‎04-01-2020

@Saisreenath According to this article: you should be in the folder containing the script you want to execute to set the execution permissions. Nifi needs permissions to access the script. It also needs permission to be able to execute the command you gave it. In your error it is saying it cannot run the python command. You may need to adjust that command path. In summary make sure you give correct permissions to everything you put in Properties tab of the NiFi Processor, and use the correct paths for python.

MPraveen · ‎03-30-2020

Thanks Steven

mahfooz · ‎03-29-2020

Yes, by adding jar in classpath, i am able to import without using -libjars option also as given below. export HADOOP_CLASSPATH=/app/hadoop_users/Mahfooz/sqoop/mysql-connector-java-5.1.48.jar sqoop import \ --connect "jdbc:mysql://localhost:3306/move?verifyServerCertificate=false&zeroDateTimeBehavior=round" \ --username "something" \ --password "something" \ --delete-target-dir \ --table users \ --fields-terminated-by "," \ --hive-import \ --create-hive-table \ --hive-table test.users \ -- \ --schema "move" But what is the use of this -libjars option. I am still confused.

stevenmatison · ‎03-29-2020

@saaga119 The solution here is to create the external table in the format you need to query the raw xml data. Seems like you have this done already. Next create a native hive table. It could still be external too. The new table should be the schema for the fields you want with required ORC lines. You can also choose some other text format; for example: csv. Last INSERT INTO new_table SELECT * FROM external_table. The main idea here is to use external and staging tables, then INSERT INTO SELECT to fill the final table you want. Final table could then be optimized for performance (ORC). This kind of idea also allows multiple external/raw/staging tables able to be combined (select w/ join) into a single final optimized table (orc, compression, partitions, buckets, etc).

murali2425 · ‎03-28-2020

great and Thanks it was very useful for my work. Here I will be having data in the excel, I need to extract the column names(1st row) and form the create the table with primary key and foreign key and data(2nd row) in the database(mysql) from nifi.

stevenmatison · ‎03-27-2020

@Savi I think you only need one date format in your nifi expression language. Next, it seems you know what format it is upstream (source), and what it needs to be for downstream. Based on your expression need, you should play the flow up to the csv reader processor and then List Queue on the FlowFiles and inspect the attributes. Adjust the expression, and retest until the attribute format meets the required format. Inspecting the dates upstream won't show you what the date looks like after expression and before the reader. Often times I find myself testing a flow step by step, each step confirming the attributes needed are correct for the next step. Never moving forward to the next step until I am sure the attributes are correct after visual inspection.

ARVINDR · ‎03-26-2020

Hi @Shelton, We have disabled ranger authorization in Ambari & allow to run hive as end user instead of hive user. Still hiveserver2 is not coming up. 2020-03-26 05:36:05,838 - call['/usr/hdp/current/zookeeper-client/bin/zkCli.sh -server <master node>:2181,<datanode1>:2181,<data node3>:2181 ls /hiveserver2 | grep 'serverUri=''] {} 2020-03-26 05:36:06,497 - call returned (1, 'Node does not exist: /hiveserver2') 2020-03-26 05:36:06,498 - Will retry 1 time(s), caught exception: ZooKeeper node /hiveserver2 is not ready yet. Sleeping for 10 sec(s) Any clue on this ?

Online	Offline
Last Visited	‎06-01-2022 03:47 PM

Name	Steven Matison
Location	Florida
Member Since	‎07-19-2018 04:45 PM
Last Visited	‎06-01-2022 03:47 PM
Posts	613
Kudos received	101

Cloudera Community

Re: Apache nifi - how to convert a file .txt into ...

Re: Apache Nifi - Using PutParquet, the HDFS file ...

Re: How to extract csv column record and used it f...

Re: Could not connect to Distributed Map Cache ser...

Re: NiFi InvokeHTTP POST JSON

Re: Nifi process consuming >100%CPU while executi...

Re: Transfer files based on the YYYYMMDD in their ...

Re: How to derive yesterday's date using now expre...

Re: ExecuteStreamCommand Permission Denied NiFi

Re: PUTSQL which cannot be converted to a timestam...

Re: -libjar option is not working in sqoop import ...

Re: Hive and XML parsing and saving table as ORC o...

Re: Is it possible to create a table in mysql usin...

Re: Date is not parsed as expected in Csv Reader P...

Re: Unable to run Hive query through putty