<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: Hello all , what are the differences between services and processor :  what should i create if i a want to do a directory check per 15 min ..... in Archives of Support Questions (Read Only)</title>
    <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Hello-all-what-are-the-differences-between-services-and/m-p/202285#M68645</link>
    <description>&lt;P&gt;ok .... thank you&lt;/P&gt;</description>
    <pubDate>Thu, 28 Sep 2017 13:54:35 GMT</pubDate>
    <dc:creator>rtianamanantsoa</dc:creator>
    <dc:date>2017-09-28T13:54:35Z</dc:date>
    <item>
      <title>Hello all , what are the differences between services and processor :  what should i create if i a want to do a directory check per 15 min .....</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Hello-all-what-are-the-differences-between-services-and/m-p/202283#M68643</link>
      <description />
      <pubDate>Wed, 27 Sep 2017 14:11:41 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Hello-all-what-are-the-differences-between-services-and/m-p/202283#M68643</guid>
      <dc:creator>rtianamanantsoa</dc:creator>
      <dc:date>2017-09-27T14:11:41Z</dc:date>
    </item>
    <item>
      <title>Re: Hello all , what are the differences between services and processor :  what should i create if i a want to do a directory check per 15 min .....</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Hello-all-what-are-the-differences-between-services-and/m-p/202284#M68644</link>
      <description>&lt;P&gt;Hi &lt;A href="https://community.hortonworks.com/questions/139391/hello-all-what-are-the-differences-between-service.html#" rel="nofollow noopener noreferrer" target="_blank"&gt;@Xtr yarhid&lt;/A&gt;, In Apache NiFi, Controller Services are shared services that can be used by Processors, i.e let's take if you are thinking to get the data or store the data to Hbase Hive tables then these processors need Hive,Hbase controller services First we needs to enable there services and then use them in the processors.&lt;/P&gt;&lt;P&gt;Coming back to your question, Use&lt;STRONG&gt; ListHDFS&lt;/STRONG&gt; processor this processor will &lt;STRONG&gt;store the state&lt;/STRONG&gt; and run this processor for 15 mins using cron (or) timer driven &lt;/P&gt;&lt;OL&gt;&lt;LI&gt;if there is&lt;STRONG&gt; no changes made to the directory&lt;/STRONG&gt; (or) file it won't list the flowfile,&lt;/LI&gt;&lt;LI&gt;If there is &lt;STRONG&gt;any change&lt;/STRONG&gt; in the &lt;STRONG&gt;directory or file &lt;/STRONG&gt;then this&lt;STRONG&gt; processor gives&lt;/STRONG&gt; &lt;B&gt;only the new file&lt;/B&gt; that got changed in the directory and updates the state of processor with &lt;STRONG&gt;new file created timestamp&lt;/STRONG&gt; Configure the directory property.&lt;/LI&gt;&lt;LI&gt;In this way &lt;STRONG&gt;ListHDFS&lt;/STRONG&gt; processor gives an flowfile with &lt;STRONG&gt;path and filename&lt;/STRONG&gt; attributes which are used by&lt;STRONG&gt; FetchHDFS&lt;/STRONG&gt; processor to fetch the data from HDFS directory.&lt;/LI&gt;&lt;/OL&gt;&lt;P&gt;This processor &lt;STRONG&gt;won't do any fetching of files&lt;/STRONG&gt; it will do just &lt;STRONG&gt;listing all the available files&lt;/STRONG&gt; in the directory and&lt;STRONG&gt; FetchHDFS&lt;/STRONG&gt; processor will do &lt;STRONG&gt;actual fetching &lt;/STRONG&gt;of files.&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;&lt;U&gt;ListHDFS Configs:-&lt;/U&gt;&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="40560-listhdfs-config.png" style="width: 783px;"&gt;&lt;img src="https://community.cloudera.com/t5/image/serverpage/image-id/17365iAAE6F85B2857220F/image-size/medium?v=v2&amp;amp;px=400" role="button" title="40560-listhdfs-config.png" alt="40560-listhdfs-config.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;in this processor i have given &lt;STRONG&gt;/user/yashu/del_test&lt;/STRONG&gt; as the directory property and this process runs for &lt;STRONG&gt;every 900 sec&lt;/STRONG&gt; on &lt;STRONG&gt;Timer driven, &lt;/STRONG&gt;for the first time this processor lists all the files that are in &lt;STRONG&gt;del_test&lt;/STRONG&gt; directory and &lt;STRONG&gt;filename,path&lt;/STRONG&gt; will attributes with the flow file(if you are having &lt;STRONG&gt;2 files&lt;/STRONG&gt; in the &lt;STRONG&gt;directory&lt;/STRONG&gt; then there will be &lt;STRONG&gt;2 flowfiles&lt;/STRONG&gt; each ff will have &lt;STRONG&gt;file name and path&lt;/STRONG&gt; attribute to it).&lt;/P&gt;&lt;P&gt;if you want to see the &lt;STRONG&gt;state&lt;/STRONG&gt; in &lt;STRONG&gt;ListHDFS&lt;/STRONG&gt; processor &lt;STRONG&gt;right click&lt;/STRONG&gt; on &lt;STRONG&gt;processor&lt;/STRONG&gt; and &lt;STRONG&gt;click&lt;/STRONG&gt; on &lt;STRONG&gt;view state button&lt;/STRONG&gt;.&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;&lt;U&gt;FetchHDFS:-&lt;/U&gt;&lt;/STRONG&gt;&lt;BR /&gt;Then use &lt;STRONG&gt;FetchHDFS&lt;/STRONG&gt; processor and&lt;STRONG&gt; leave that with default configs&lt;/STRONG&gt; as this processor gets attributes &lt;STRONG&gt;${path}/${filename}&lt;/STRONG&gt; from&lt;STRONG&gt; ListHDFS&lt;/STRONG&gt; processor.&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="40559-fetch-hdfs.png" style="width: 784px;"&gt;&lt;img src="https://community.cloudera.com/t5/image/serverpage/image-id/17366iB2FF1DC9F14A8A74/image-size/medium?v=v2&amp;amp;px=400" role="button" title="40559-fetch-hdfs.png" alt="40559-fetch-hdfs.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;in this processor fetches &lt;STRONG&gt;actual data&lt;/STRONG&gt; from&lt;STRONG&gt; HDFS&lt;/STRONG&gt; as l&lt;STRONG&gt;ist hdfs processor&lt;/STRONG&gt; only&lt;STRONG&gt; lists the files&lt;/STRONG&gt; that are &lt;STRONG&gt;changed in last 15 mins&lt;/STRONG&gt;.&lt;/P&gt;&lt;P&gt;In addition, this way after ListHDFS processor you can use Site-to-site processor, S2S will distributes the work across the cluster and use FetchHDFS we can do actual fetching the data.&lt;/P&gt;&lt;P&gt;&lt;EM&gt;Hope this helps..!!&lt;/EM&gt;&lt;/P&gt;</description>
      <pubDate>Sun, 18 Aug 2019 05:22:18 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Hello-all-what-are-the-differences-between-services-and/m-p/202284#M68644</guid>
      <dc:creator>Shu_ashu</dc:creator>
      <dc:date>2019-08-18T05:22:18Z</dc:date>
    </item>
    <item>
      <title>Re: Hello all , what are the differences between services and processor :  what should i create if i a want to do a directory check per 15 min .....</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Hello-all-what-are-the-differences-between-services-and/m-p/202285#M68645</link>
      <description>&lt;P&gt;ok .... thank you&lt;/P&gt;</description>
      <pubDate>Thu, 28 Sep 2017 13:54:35 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Hello-all-what-are-the-differences-between-services-and/m-p/202285#M68645</guid>
      <dc:creator>rtianamanantsoa</dc:creator>
      <dc:date>2017-09-28T13:54:35Z</dc:date>
    </item>
    <item>
      <title>Re: Hello all , what are the differences between services and processor :  what should i create if i a want to do a directory check per 15 min .....</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Hello-all-what-are-the-differences-between-services-and/m-p/202286#M68646</link>
      <description>&lt;P&gt;Hi &lt;A rel="user" href="https://community.cloudera.com/users/18929/yaswanthmuppireddy.html" nodeid="18929"&gt;@Shu&lt;/A&gt; ,yes, your answer is helpful to resolve my problem , thank you very much&lt;/P&gt;</description>
      <pubDate>Mon, 02 Oct 2017 12:38:42 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Hello-all-what-are-the-differences-between-services-and/m-p/202286#M68646</guid>
      <dc:creator>rtianamanantsoa</dc:creator>
      <dc:date>2017-10-02T12:38:42Z</dc:date>
    </item>
  </channel>
</rss>

