<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: How Retrieve entire records in Hbase in Archives of Support Questions (Read Only)</title>
    <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-Retrieve-entire-records-in-Hbase/m-p/205291#M78847</link>
    <description>&lt;A rel="user" href="https://community.cloudera.com/users/81207/kolysall.html" nodeid="81207"&gt;@Koly SALL&lt;/A&gt;&lt;P&gt;&lt;STRONG&gt;Q:after start GetHase, we must stop this processor?&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;A: Stopping of GetHBase processor is going to done by left hand side flow once you start the flow then in first step we are going to stop GetHBase processor.&lt;/P&gt;&lt;P&gt;in out Left Hand side of the flow we are going to schedule the processor by using Cron (or) Timer driven&lt;/P&gt;&lt;UL&gt;&lt;LI&gt;First step we are going to trigger shell script and the shell script is going to stop GetHBase processor.&lt;/LI&gt;&lt;LI&gt;Second step clear the state of GetHBase processor&lt;/LI&gt;&lt;LI&gt;Third step start GetHBase processor&lt;/LI&gt;&lt;/UL&gt;&lt;P&gt;Now once we start &lt;STRONG&gt;GetHBase&lt;/STRONG&gt; processor and the processor will run based on the scheduling strategy.&lt;/P&gt;&lt;P&gt;Let's assume GetHBase processor scheduled to run for every 5 mins then processor will run every five minutes and checks is there any new records got added to HBase table.&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;What happens when we have scheduled the processor to run at 10000 min i.e ~167 days?&lt;/STRONG&gt;&lt;/P&gt;&lt;UL&gt;&lt;LI&gt;Processor will &lt;STRONG&gt;run once we start the processor&lt;/STRONG&gt; and the &lt;STRONG&gt;next run&lt;/STRONG&gt; will be triggered after&lt;STRONG&gt; 10000 min.&lt;/STRONG&gt;&lt;/LI&gt;&lt;LI&gt;So the processor will run once in 167 days by using this scheduling we will make sure we are not going to run the processor again and again.&lt;/LI&gt;&lt;/UL&gt;&lt;P&gt;&lt;STRONG&gt;Q:it is imperative to plan the processor at 10000 min?&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;By using this scheduling strategy we are going to run the processor once 10000 min.&lt;/P&gt;&lt;P&gt;In addition if you want to make sure that all the records that got pulled off from the GetHbase processor before stopping again then you need to check ActiveThreadCount value from the GetHbase processor if ActiveThreadCount is 0 then only you need to stop the processor --&amp;gt; clear the state --&amp;gt; start again the processor.&lt;/P&gt;</description>
    <pubDate>Tue, 29 May 2018 20:21:55 GMT</pubDate>
    <dc:creator>Shu_ashu</dc:creator>
    <dc:date>2018-05-29T20:21:55Z</dc:date>
    <item>
      <title>How Retrieve entire records in Hbase</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-Retrieve-entire-records-in-Hbase/m-p/205288#M78844</link>
      <description>&lt;P&gt;Hello everybody, I have a small problem with GetHabse, I manage to recover only the new recordings whereas my need is to recover with all the world all the recordings present in a table.&lt;/P&gt;&lt;P&gt;Thank you for your help.&lt;/P&gt;&lt;P&gt;Best Regards,&lt;/P&gt;&lt;P&gt;Koly&lt;/P&gt;</description>
      <pubDate>Tue, 29 May 2018 17:42:09 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-Retrieve-entire-records-in-Hbase/m-p/205288#M78844</guid>
      <dc:creator>koly_sall</dc:creator>
      <dc:date>2018-05-29T17:42:09Z</dc:date>
    </item>
    <item>
      <title>Re: How Retrieve entire records in Hbase</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-Retrieve-entire-records-in-Hbase/m-p/205289#M78845</link>
      <description>&lt;A rel="user" href="https://community.cloudera.com/users/81207/kolysall.html" nodeid="81207" target="_blank"&gt;@Koly SALL&lt;/A&gt;&lt;P&gt;You can use Scan Hbase processor introduced in NiFi-1.6 and this processor won't store the state.&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;(or)&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;By using RestApi you can clear the stored state in GetHbase processor before getting all the records from HBase table.&lt;/P&gt;&lt;P&gt;In this way we have to stop GetHbase processor first then Clear the state of GetHbase processor then StartHBase processor again&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;Flow:&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="77409-flow.png" style="width: 1471px;"&gt;&lt;img src="https://community.cloudera.com/t5/image/serverpage/image-id/17113iCD6D2FAB8AD3976D/image-size/medium?v=v2&amp;amp;px=400" role="button" title="77409-flow.png" alt="77409-flow.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;In left hand side of the flow screenshot you have to do &lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;&lt;EM&gt;Step1: &lt;/EM&gt;&lt;/STRONG&gt;Stop GetHBase Processor&lt;/P&gt;&lt;P style="margin-left: 20px;"&gt;Refer to &lt;A href="https://community.hortonworks.com/questions/122097/can-i-stop-invokehttp-processor-by-another-process.html" target="_blank" rel="nofollow noopener noreferrer"&gt;this&lt;/A&gt; link how to stop the processor using RestAPI and use Chrome developer tools to view the what are the api calls are making while stopping the processor.&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;&lt;EM&gt;Step2:&lt;/EM&gt; &lt;/STRONG&gt;Clear the state of GetHBase processor&lt;/P&gt;&lt;PRE&gt;curl -X POST &lt;A href="http://localhost:8080/nifi-api/processors/&amp;lt;processor-id&amp;gt;/state/clear-requests" target="_blank" rel="nofollow noopener noreferrer"&gt;http://localhost:8080/nifi-api/processors/&amp;lt;processor-id&amp;gt;/state/clear-requests&lt;/A&gt;&lt;/PRE&gt;&lt;P style="margin-left: 20px;"&gt;&lt;STRONG&gt;(or)&lt;/STRONG&gt;&lt;/P&gt;&lt;P style="margin-left: 20px;"&gt;Use InvokeHTTP processor with HTTP method as&lt;STRONG&gt; POST&lt;/STRONG&gt; to clear state requests.&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;STRONG&gt;&lt;EM&gt;Step3:&lt;/EM&gt; &lt;/STRONG&gt;Start GetHBase Processor&lt;/P&gt;&lt;P style="margin-left: 20px;"&gt; Refer to &lt;A href="https://community.hortonworks.com/questions/122097/can-i-stop-invokehttp-processor-by-another-process.html" rel="nofollow noopener noreferrer" target="_blank"&gt;this&lt;/A&gt; link how to start the processor using RestAPI and use Chrome developer tools to view the what are the api calls are making while stopping the processor.&lt;/P&gt;&lt;P&gt;Now on the right hand side of the flow schedule GetHbase processor to run once i.e use Timer driven as scheduling strategy and keep run schedule as 10000 min..(or) etc.&lt;/P&gt;&lt;P&gt;By using this way we have to schedule only left hand side flow as in the step3 we are going to starting the GetHBase processor and the processor scheduled to run only once.&lt;/P&gt;&lt;P&gt;Let us know if you are facing any issues..!!&lt;/P&gt;&lt;P&gt;-&lt;/P&gt;&lt;P&gt;If the Answer addressed your question, &lt;STRONG&gt;Click on Accept button below to accept the answer,&lt;/STRONG&gt; That would be great help to Community users to find solution quickly for these kind of issues.&lt;/P&gt;</description>
      <pubDate>Sun, 18 Aug 2019 04:52:11 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-Retrieve-entire-records-in-Hbase/m-p/205289#M78845</guid>
      <dc:creator>Shu_ashu</dc:creator>
      <dc:date>2019-08-18T04:52:11Z</dc:date>
    </item>
    <item>
      <title>Re: How Retrieve entire records in Hbase</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-Retrieve-entire-records-in-Hbase/m-p/205290#M78846</link>
      <description>&lt;P&gt;Hi &lt;A rel="user" href="https://community.cloudera.com/users/18929/yaswanthmuppireddy.html" nodeid="18929"&gt;@Shu&lt;/A&gt;,&lt;/P&gt;&lt;P&gt;Thank a lot for answer i am undrestood it.&lt;/P&gt;&lt;P&gt;Just one question please, after start GetHase, we must stop this processor?&lt;/P&gt;&lt;P&gt;it is imperative to plan the processor at 10000 min?&lt;/P&gt;&lt;P&gt;Best Regards,&lt;/P&gt;&lt;P&gt;Koly&lt;/P&gt;</description>
      <pubDate>Tue, 29 May 2018 19:20:10 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-Retrieve-entire-records-in-Hbase/m-p/205290#M78846</guid>
      <dc:creator>koly_sall</dc:creator>
      <dc:date>2018-05-29T19:20:10Z</dc:date>
    </item>
    <item>
      <title>Re: How Retrieve entire records in Hbase</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-Retrieve-entire-records-in-Hbase/m-p/205291#M78847</link>
      <description>&lt;A rel="user" href="https://community.cloudera.com/users/81207/kolysall.html" nodeid="81207"&gt;@Koly SALL&lt;/A&gt;&lt;P&gt;&lt;STRONG&gt;Q:after start GetHase, we must stop this processor?&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;A: Stopping of GetHBase processor is going to done by left hand side flow once you start the flow then in first step we are going to stop GetHBase processor.&lt;/P&gt;&lt;P&gt;in out Left Hand side of the flow we are going to schedule the processor by using Cron (or) Timer driven&lt;/P&gt;&lt;UL&gt;&lt;LI&gt;First step we are going to trigger shell script and the shell script is going to stop GetHBase processor.&lt;/LI&gt;&lt;LI&gt;Second step clear the state of GetHBase processor&lt;/LI&gt;&lt;LI&gt;Third step start GetHBase processor&lt;/LI&gt;&lt;/UL&gt;&lt;P&gt;Now once we start &lt;STRONG&gt;GetHBase&lt;/STRONG&gt; processor and the processor will run based on the scheduling strategy.&lt;/P&gt;&lt;P&gt;Let's assume GetHBase processor scheduled to run for every 5 mins then processor will run every five minutes and checks is there any new records got added to HBase table.&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;What happens when we have scheduled the processor to run at 10000 min i.e ~167 days?&lt;/STRONG&gt;&lt;/P&gt;&lt;UL&gt;&lt;LI&gt;Processor will &lt;STRONG&gt;run once we start the processor&lt;/STRONG&gt; and the &lt;STRONG&gt;next run&lt;/STRONG&gt; will be triggered after&lt;STRONG&gt; 10000 min.&lt;/STRONG&gt;&lt;/LI&gt;&lt;LI&gt;So the processor will run once in 167 days by using this scheduling we will make sure we are not going to run the processor again and again.&lt;/LI&gt;&lt;/UL&gt;&lt;P&gt;&lt;STRONG&gt;Q:it is imperative to plan the processor at 10000 min?&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;By using this scheduling strategy we are going to run the processor once 10000 min.&lt;/P&gt;&lt;P&gt;In addition if you want to make sure that all the records that got pulled off from the GetHbase processor before stopping again then you need to check ActiveThreadCount value from the GetHbase processor if ActiveThreadCount is 0 then only you need to stop the processor --&amp;gt; clear the state --&amp;gt; start again the processor.&lt;/P&gt;</description>
      <pubDate>Tue, 29 May 2018 20:21:55 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-Retrieve-entire-records-in-Hbase/m-p/205291#M78847</guid>
      <dc:creator>Shu_ashu</dc:creator>
      <dc:date>2018-05-29T20:21:55Z</dc:date>
    </item>
    <item>
      <title>Re: How Retrieve entire records in Hbase</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-Retrieve-entire-records-in-Hbase/m-p/205292#M78848</link>
      <description>&lt;P&gt;Hi &lt;A rel="user" href="https://community.cloudera.com/users/18929/yaswanthmuppireddy.html" nodeid="18929"&gt;@Shu&lt;/A&gt;&lt;/P&gt;&lt;P&gt;Thank you a lot it's very good &lt;span class="lia-unicode-emoji" title=":slightly_smiling_face:"&gt;🙂&lt;/span&gt;&lt;/P&gt;</description>
      <pubDate>Tue, 29 May 2018 21:32:06 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-Retrieve-entire-records-in-Hbase/m-p/205292#M78848</guid>
      <dc:creator>koly_sall</dc:creator>
      <dc:date>2018-05-29T21:32:06Z</dc:date>
    </item>
    <item>
      <title>Re: How Retrieve entire records in Hbase</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-Retrieve-entire-records-in-Hbase/m-p/295464#M78849</link>
      <description>&lt;P&gt;&lt;a href="https://community.cloudera.com/t5/user/viewprofilepage/user-id/55311"&gt;@Shu_ashu&lt;/a&gt;&amp;nbsp;I don't understand how ScanHBase would work as an alternative when it &lt;A href="https://community.cloudera.com/t5/Support-Questions/NiFi-ScanHBase-Requires-an-Incoming-Relationship/m-p/295000" target="_self"&gt;has an input requirement&lt;/A&gt;&amp;nbsp;i.e. it can't be used as a root node in a graph for gathering records from HBase indiscriminately.&amp;nbsp; Would you agree?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;It seems like the only viable solution then is use the RestApi as you've suggested.&lt;/P&gt;</description>
      <pubDate>Tue, 05 May 2020 18:45:05 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-Retrieve-entire-records-in-Hbase/m-p/295464#M78849</guid>
      <dc:creator>Santiago</dc:creator>
      <dc:date>2020-05-05T18:45:05Z</dc:date>
    </item>
  </channel>
</rss>

