Member since
02-28-2017
11
Posts
0
Kudos Received
0
Solutions
06-17-2017
08:50 AM
${filename}.${now():toNumber()}
I am newbie when it comes to NIFI . My cluster is managed by Cloudera manager .
I am trying to do simple poc - for checking duplicates files .
I would like to get my file written in HDFS as filename_now_date_timestamps(HH:MM:SS)
I am trying the above expression unable to achive
could anyone help me out this .
... View more
Labels:
- Labels:
-
Apache NiFi
03-02-2017
08:07 AM
@mbigelow My English is not that good so I assume from ur answer that I can I set more than 8gb in yarn.scheduler.maximum-allocation-mb please correct me if I am wrong.
... View more
03-02-2017
01:00 AM
@mbigelow Thanks for the explanation with example. its clear. One last clarification The default - yarn.scheduler.maximum-allocation-mb = 8024 - Will i be able to increase more than 8GB if I have enough Ram in my system.
... View more
03-01-2017
09:57 PM
@mbigelow @mbigelow - Could you please clarify this - You could also increase the mapper memory as you increase the io.sort.mb. 1 . is it mandatory to increase the mapper memory as we increase io.sort.mb - does it have a dependencies . 2. Say if I increase the mapper memory then follow up I have to increase the yarn.scheduler.maximum-allocation-mb because of the yarn.nodemanager.vmem-pmem-ratio = 2.1 yarn.nodemanager.resource.memory.mb = 8192 mapreduce.map.java.opts = 2.5GB mapreduce.map.memory.mb = 3 gb mapreduce.task.io.sort.mb = 4gb - I can do this . 3. yarn.scheduler.maximum-allocation-mb = 8024 - Will i be able to increase the more than 8GB if I have enough Ram in my system. Thanks for the help
... View more
02-28-2017
11:05 PM
Thanks for the information. Does hadoop metrics are collected by default or should we have to enable it. ? Could you please tell me Also one more quick clarification if there is too much spill in mapreduce job does it mean we have to increase io-sort mb , if so whats an ideal number should be can i start with 1000. mapreduce.task.io.sort.mb
... View more
02-28-2017
07:57 PM
Hi Just started learning Hadoop, I have no idea about as to how to check if a mapreduce job is making spill or not . if so correct me if i am wrong we have to increase io-sort size , please help me out with this. 1 . Also what are all the other parameters that needs to be checked if there is too much spill in mapred-site.xml , hadoop-env.sh files.
... View more