Member since
10-01-2015
3933
Posts
1150
Kudos Received
374
Solutions
My Accepted Solutions
| Title | Views | Posted |
|---|---|---|
| 3557 | 05-03-2017 05:13 PM | |
| 2933 | 05-02-2017 08:38 AM | |
| 3183 | 05-02-2017 08:13 AM | |
| 3145 | 04-10-2017 10:51 PM | |
| 1621 | 03-28-2017 02:27 AM |
02-29-2016
11:02 AM
@nejm hadj Adding more information based on your comments https://nifi.apache.org/docs/nifi-docs/components/org.apache.nifi.processors.flume.ExecuteFlumeSink/additionalDetails.html You should stick with NiFi and use built in processor to ingest the data from various social media sources. Please do read docs. In NiFi, the contents of a FlowFile are accessed via a stream, but in Flume it is stored in a byte array. This means the full content will be loaded into memory when a FlowFile is processed by the ExecuteFlumeSink processor. You should consider the typical size of the FlowFiles you'll process and the batch size, if any, your sink is configured with when setting NiFi's heap size.
... View more
02-28-2016
05:23 AM
2 Kudos
Thank you!!! I am happy to join this network for the level of support being provided. It keeps me motivated!!!
... View more
07-11-2017
01:23 PM
Hello @Michael Dennis "MD" Uanang can you confirm that this worked on a 3-nodes Zookeeper install? I mean, I need to move my zk cluster from the original 3 hosts to other 3 hosts, will this work repeating 3 times the same procedure?
... View more
02-26-2016
04:28 AM
@ccasano I don't see any issues in having Isilon to store the workflow repositories. Isilon is scalable storage solution and based on my experience, Isilon can be a good solution based on
... View more
03-20-2016
02:25 PM
1 Kudo
Ah cool didn't see that!
... View more
07-26-2016
04:19 PM
on Sandbox 2.5, Datafu is indeed 1.3, validated the function albeit with different dataset DEFINE HCatLoader org.apache.hive.hcatalog.pig.HCatLoader();
DEFINE SampleByKey datafu.pig.sampling.SampleByKey('0.2');
ROWS = load 'sample_08' using HCatLoader();
SAMPLE_BY_total_emp = filter ROWS by SampleByKey(total_emp);
STORE SAMPLE_BY_total_emp into 'sample_total_emp';
[guest@sandbox ~]$ hdfs dfs -cat sample_total_emp/part-v000-o000-r-00000 | head -n 5
11-3011 Administrative services managers 246930 79500
11-9121 Natural sciences managers 43060 123140
13-1032 Insurance appraisers, auto damage 11280 53980
13-1051 Cost estimators 218400 60320
13-1072 Compensation, benefits, and job analysis specialists 116250 57060
... View more
03-03-2016
05:24 PM
2 Kudos
@Smart Solutions When you add Spark through Ambari, you will be asked to choose where to deploy master service (Spark History Service) And then to choose where to deploy clients services Finally you will be asked for several properties screen-shot-2016-03-03-at-61725-pm.png
... View more
12-02-2016
02:30 PM
3 Kudos
@Saurabh I have resolved this kind of error for multiple customers by following below steps: #Command 1: hadoop fs -put /usr/hdp/current/atlas-server/hook/hive/* hdfs://<NN>/user/oozie/share/lib/lib_<Timestamp>/hive/ #Command 2(Please run below command on Oozie server as 'oozie' user): oozie admin -oozie http://<oozie-server:11000/oozie -sharelibupdate Re-run your Oozie workflow, It should succeed without any issue. Hope this helps! Note - Update Oozie sharelib part is missing in the stackoverflow's answer.
... View more
07-20-2016
09:34 PM
@Artem Ervits @Mehrdad Niasari I believe we can lose this question. i have opened new one on default namespace here.
... View more