<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: How Extract text from a multiline flow and create only one property with the all flow's content ? in Archives of Support Questions (Read Only)</title>
    <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-Extract-text-from-a-multiline-flow-and-create-only-one/m-p/104711#M29929</link>
    <description>&lt;P&gt;Hi &lt;A rel="user" href="https://community.cloudera.com/users/5078/pvillard.html" nodeid="5078"&gt;@Pierre Villard&lt;/A&gt; &lt;/P&gt;&lt;P&gt;My input is as shown below &lt;/P&gt;&lt;UL&gt;
&lt;LI&gt;i, John, $100 &lt;/LI&gt;&lt;LI&gt;ii, Kevin, $150 &lt;/LI&gt;&lt;LI&gt;iii, Steve, $200&lt;/LI&gt;&lt;/UL&gt;&lt;P&gt;I used ExtractText processor with Enable Multiline Mode=true, Enable DOTALL Mode=true and new property line=(.*).&lt;/P&gt;&lt;P&gt;After execution I see below in provenance event in attributes tab &lt;/P&gt;&lt;UL&gt;
&lt;LI&gt;line i, John, $100 ii, Kevin, $150 iii, Steve, $200&lt;/LI&gt;&lt;LI&gt;line.0 i, John, $100 ii, Kevin, $150 iii, Steve, $200&lt;/LI&gt;&lt;LI&gt;line.1 i, John, $100 ii, Kevin, $150 iii, Steve, $200&lt;/LI&gt;&lt;/UL&gt;&lt;P&gt;Expected output&lt;/P&gt;&lt;UL&gt;
&lt;LI&gt;line i, John, $100&lt;/LI&gt;&lt;LI&gt;line.0 ii, Kevin, $150&lt;/LI&gt;&lt;LI&gt;line.1 iii, Steve, $200&lt;/LI&gt;&lt;/UL&gt;&lt;P&gt;Please suggest.&lt;/P&gt;</description>
    <pubDate>Wed, 26 Oct 2016 21:10:50 GMT</pubDate>
    <dc:creator>vinaychandra_t</dc:creator>
    <dc:date>2016-10-26T21:10:50Z</dc:date>
    <item>
      <title>How Extract text from a multiline flow and create only one property with the all flow's content ?</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-Extract-text-from-a-multiline-flow-and-create-only-one/m-p/104706#M29924</link>
      <description>&lt;P&gt;Hi everybody&lt;/P&gt;&lt;P&gt;****   In input I've got a flow multiline like this one :&lt;/P&gt;&lt;P&gt;27/05/2016 06:28:34,000 ERROR [ACTIVE] ExecuteThread: '6' for queue: 'weblogic.kernel.Default (self-tuning)' fr.pe.sldng.integration.rest.GestionnaireExceptionSollicitationRest Exception lors du traitement
fr.pe.empl.service.data.exception.SLDNotFoundException: Mini site inconnu
        at fr.pe.empl.service.services.impl.MiniSiteServiceImpl.lire(MiniSiteServiceImpl.java:89)
        at &lt;/P&gt;&lt;P&gt;**** Then I use "ExtractText" processor with multiline mode=true and with a new property grok=^(.*)$&lt;/P&gt;&lt;P&gt;And in ouput this property ${grok} has only the first line.&lt;/P&gt;&lt;P&gt;*** My question, how can I retrieve all the input lines in this property ?&lt;/P&gt;&lt;P&gt;Thanks for your answer.&lt;/P&gt;</description>
      <pubDate>Fri, 27 May 2016 13:45:08 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-Extract-text-from-a-multiline-flow-and-create-only-one/m-p/104706#M29924</guid>
      <dc:creator>thierry_vernhet</dc:creator>
      <dc:date>2016-05-27T13:45:08Z</dc:date>
    </item>
    <item>
      <title>Re: How Extract text from a multiline flow and create only one property with the all flow's content ?</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-Extract-text-from-a-multiline-flow-and-create-only-one/m-p/104707#M29925</link>
      <description>&lt;P&gt;Hi &lt;A rel="user" href="https://community.cloudera.com/users/10685/thierryvernhet.html" nodeid="10685"&gt;@Thierry Vernhet&lt;/A&gt;,&lt;/P&gt;&lt;P&gt;To achieve what you are looking for, I believe you must set the property "Enable DOTALL mode" to true.&lt;/P&gt;&lt;P&gt;Below is a template that produces the expected result with the example you gave.&lt;/P&gt;&lt;P&gt;&lt;A href="https://community.cloudera.com/legacyfs/online/attachments/4599-extracttextall.xml"&gt;extracttextall.xml&lt;/A&gt;&lt;/P&gt;&lt;P&gt;Hope this helps.&lt;/P&gt;</description>
      <pubDate>Fri, 27 May 2016 15:22:23 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-Extract-text-from-a-multiline-flow-and-create-only-one/m-p/104707#M29925</guid>
      <dc:creator>pvillard</dc:creator>
      <dc:date>2016-05-27T15:22:23Z</dc:date>
    </item>
    <item>
      <title>Re: How Extract text from a multiline flow and create only one property with the all flow's content ?</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-Extract-text-from-a-multiline-flow-and-create-only-one/m-p/104708#M29926</link>
      <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/5078/pvillard.html" nodeid="5078"&gt;@Pierre Villard&lt;/A&gt;   It's OK, Thanks a lot.&lt;/P&gt;</description>
      <pubDate>Fri, 27 May 2016 15:50:21 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-Extract-text-from-a-multiline-flow-and-create-only-one/m-p/104708#M29926</guid>
      <dc:creator>thierry_vernhet</dc:creator>
      <dc:date>2016-05-27T15:50:21Z</dc:date>
    </item>
    <item>
      <title>Re: How Extract text from a multiline flow and create only one property with the all flow's content ?</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-Extract-text-from-a-multiline-flow-and-create-only-one/m-p/104709#M29927</link>
      <description>&lt;P&gt;Keep in mind that FlowFile Attributes live in memory.   Loading a FlowFile Attribute with the entire content of the file is going to have an impact on heap usage in your flow. That being said, there are two things to consider when building dataflows like this:
1. Increasing the the size of the available heap for the NiFi application. Heap space thresholds for NiFi are configured in the bootstrap.conf file and by default are very small  (512 MB).
            &lt;/P&gt;&lt;P&gt;# JVM memory settings &lt;/P&gt;&lt;P&gt;java.arg.2=-Xms512m &lt;/P&gt;&lt;P&gt;java.arg.3=-Xmx512m&lt;/P&gt;&lt;P&gt;2. You must take in to consideration the data volumes you will be working with in the particular dataflow.  To help prevent out of memory error in NiFi, we have established a threshold on how much data can queue on a connection before FlowFile's attributes are swapped out of heap to disk. The default configuration in the nifi.properties file is 20,000.  ( nifi.queue.swap.threshold=20000 )  this is per connection not per flow.  So if the FlowFiles you extracted content in begin to queue on numerous connections, you run the risk of hitting the out of memory condition quicker.  You can decrease this value so swapping happens sooner, but that will in turn have an impact on performance. &lt;/P&gt;&lt;P&gt;     I would start with increasing the heap memory for your NiFi and the go from there.&lt;/P&gt;</description>
      <pubDate>Fri, 27 May 2016 19:53:37 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-Extract-text-from-a-multiline-flow-and-create-only-one/m-p/104709#M29927</guid>
      <dc:creator>MattWho</dc:creator>
      <dc:date>2016-05-27T19:53:37Z</dc:date>
    </item>
    <item>
      <title>Re: How Extract text from a multiline flow and create only one property with the all flow's content ?</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-Extract-text-from-a-multiline-flow-and-create-only-one/m-p/104710#M29928</link>
      <description>&lt;P&gt;Hi &lt;A rel="user" href="https://community.cloudera.com/users/525/mclark.html" nodeid="525"&gt;@mclark&lt;/A&gt;&lt;/P&gt;&lt;P&gt;Thanks. Before ExtractText we use Tailfile. So every flow contains only a few records of the entire file.&lt;/P&gt;</description>
      <pubDate>Mon, 30 May 2016 14:39:32 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-Extract-text-from-a-multiline-flow-and-create-only-one/m-p/104710#M29928</guid>
      <dc:creator>thierry_vernhet</dc:creator>
      <dc:date>2016-05-30T14:39:32Z</dc:date>
    </item>
    <item>
      <title>Re: How Extract text from a multiline flow and create only one property with the all flow's content ?</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-Extract-text-from-a-multiline-flow-and-create-only-one/m-p/104711#M29929</link>
      <description>&lt;P&gt;Hi &lt;A rel="user" href="https://community.cloudera.com/users/5078/pvillard.html" nodeid="5078"&gt;@Pierre Villard&lt;/A&gt; &lt;/P&gt;&lt;P&gt;My input is as shown below &lt;/P&gt;&lt;UL&gt;
&lt;LI&gt;i, John, $100 &lt;/LI&gt;&lt;LI&gt;ii, Kevin, $150 &lt;/LI&gt;&lt;LI&gt;iii, Steve, $200&lt;/LI&gt;&lt;/UL&gt;&lt;P&gt;I used ExtractText processor with Enable Multiline Mode=true, Enable DOTALL Mode=true and new property line=(.*).&lt;/P&gt;&lt;P&gt;After execution I see below in provenance event in attributes tab &lt;/P&gt;&lt;UL&gt;
&lt;LI&gt;line i, John, $100 ii, Kevin, $150 iii, Steve, $200&lt;/LI&gt;&lt;LI&gt;line.0 i, John, $100 ii, Kevin, $150 iii, Steve, $200&lt;/LI&gt;&lt;LI&gt;line.1 i, John, $100 ii, Kevin, $150 iii, Steve, $200&lt;/LI&gt;&lt;/UL&gt;&lt;P&gt;Expected output&lt;/P&gt;&lt;UL&gt;
&lt;LI&gt;line i, John, $100&lt;/LI&gt;&lt;LI&gt;line.0 ii, Kevin, $150&lt;/LI&gt;&lt;LI&gt;line.1 iii, Steve, $200&lt;/LI&gt;&lt;/UL&gt;&lt;P&gt;Please suggest.&lt;/P&gt;</description>
      <pubDate>Wed, 26 Oct 2016 21:10:50 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-Extract-text-from-a-multiline-flow-and-create-only-one/m-p/104711#M29929</guid>
      <dc:creator>vinaychandra_t</dc:creator>
      <dc:date>2016-10-26T21:10:50Z</dc:date>
    </item>
  </channel>
</rss>

