About vivek_b3ds

vivek_b3ds · ‎09-29-2018

Is there any way to do the same task using zookeeper ?

vivek_b3ds · ‎09-28-2018

We are working on application which contains Spark, hdfs, and kafka. We want to deploy this application on existing HDP cluster. So what would be the best approach/way to deploy this application on HDP in less time. What I want to do is to create some script that will co-ordinate with ambari and findout which component is already installed on existing HDP. For. Ex if some HDP cluster doesnot contain spark then it will automatically download (from hortonwork repo) and configure spark on that HDP cluster otherwise simply load all tasks/jobs. Can I use zookeeper to detect which service is installed and detect its state (running/stopped/maintainence) ?

vivek_b3ds · ‎06-19-2018

I'd seen that already but its just for reading not for changing. I need to read and then manipulate the variable's value

vivek_b3ds · ‎06-19-2018

I want to access the process group variable from execute script and then need to change its value. I am using python. I have read the article which describe how to access flowfile attributes but not variables. My requirement is when some job complete successfully then some value is to be stored in variable.

vivek_b3ds · ‎06-04-2018

I am trying to create some tables in Hive from Apache NIFI but I didn't find any exact Processor for that. However I found a processor name PutHiveQl which can be used for DDL/DML operation but I didn't find any property in which I can write the query. If it is the right processor for this purpose then how it can be used in my case.

vivek_b3ds · ‎04-06-2018

I am trying to split an array of record using SplitJson processor. But it fails to split the record. I am unable to find the correct expression for my json. Here is my json [ {"id":"30fb76fa-acbe-463b-830e-66f203bb0911","session_id":"804e8d5b-c266-92b7-4a1d-eed3650d3b4a","tag_name":"call"}, {"id":"23986d19-c91f-4d98-8cfd-08fb26c5ff85","session_id":"804e8d5b-c266-92b7-4a1d-eed3650d3b4a","tag_name":"direct-call"}, {"id":"7c374ae9-b96a-4383-85ce-6d45cbc5f8a4","session_id":"804e8d5b-c266-92b7-4a1d-eed3650d3b4a","tag_name":"homepage"}, {"id":"599bf3e0-2c76-4d38-8349-91cc04e34c33","session_id":"b8f17ef9-d7df-dec0-71e3-3ed28991d396","tag_name":"bounce"}, {"id":"55791f8a-3243-48b3-bb4a-70404a21148d","session_id":"b8f17ef9-d7df-dec0-71e3-3ed28991d396","tag_name":"homepage"} ] I want split each record as seprate flowfile, means there will be five flow files. What is the correct Json path expression ?

vivek_b3ds · ‎04-05-2018

I am building a job in which I have to validate phone numbers and we wants to use 'google-libphonenumber' npm package. I am using javascript in executescript processor. What is the correct way to include the npm package?

vivek_b3ds · ‎04-02-2018

I am using apache nifi to convert json to csv. I want to change the headers of the generated csv . Is there any specific processor for this. I know how to achieve this using ExecuteScript processor but is there any easy approach. Ex. "_id", "name","time" to "id", "browser_name","duration"

vivek_b3ds · ‎03-29-2018

Thanks setting max bin age property works.

vivek_b3ds · ‎03-29-2018

No I am not getting any error. It just hang and do nothing. I mean flow files are reaching to MergeProcessor after after converting record but after that nothing happens. You can see in the new image.