<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question How ingest and group multiline logs files with nifi ? in Archives of Support Questions (Read Only)</title>
    <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-ingest-and-group-multiline-logs-files-with-nifi/m-p/165292#M29512</link>
    <description>&lt;P&gt;For example I've got this 9 lines in input&lt;/P&gt;&lt;P&gt;24/05/2016 13:40:18,739 ERROR [ACTIVE] ExecuteThread: '6' for queue: 'weblogic.kernel.Default (self-tuning)'&lt;/P&gt;&lt;P&gt;fr.pe.sldng.integration.rest.GestionnaireExceptionSollicitationRest Exception lors du traitement&lt;/P&gt;&lt;P&gt;fr.data.exception.SLDNotFoundException: Mini site inconnu&lt;/P&gt;&lt;P&gt;at fr.services.impl.MiniSiteServiceImpl.lire(MiniSiteServiceImpl.java:89)&lt;/P&gt;&lt;P&gt;at fr.services.impl.EnvoiMailSignalementDs3ServiceImpl.envoyerUnMail(EnvoiMailSignalementDs3ServiceImpl.java:60)&lt;/P&gt;&lt;P&gt;at fr.ressources.MailRessource.envoyerMailSignalementContenuInaproprie(MailRessource.java:41)&lt;/P&gt;&lt;P&gt;24/05/2016 15:40:18,739 ERROR [ACTIVE] ExecuteThread: '6' for queue: 'weblogic.kernel.Default (self-tuning)'&lt;/P&gt;&lt;P&gt;fr.rest.GestionnaireExceptionSollicitationRest Exception lors du traitement &lt;/P&gt;&lt;P&gt;fr.data.exception.SLDNotFoundException: Mini site inconnu&lt;/P&gt;&lt;P&gt;and I'd like with an "extract text" processor to have a property with the value beginning with "24/05/2016 13:40:18,739 ERROR..." and ending just before the next timestamp "24/05/2016 15:40:18,739... " so with the first 6 input lines.&lt;/P&gt;&lt;P&gt;and another property beginning at the second timestamp and ending at the end of the input lines so with the last three input lines.&lt;/P&gt;&lt;P&gt;Is it possible to do this with Nifi ?&lt;/P&gt;&lt;P&gt;Thanks&lt;/P&gt;</description>
    <pubDate>Tue, 24 May 2016 21:42:45 GMT</pubDate>
    <dc:creator>thierry_vernhet</dc:creator>
    <dc:date>2016-05-24T21:42:45Z</dc:date>
    <item>
      <title>How ingest and group multiline logs files with nifi ?</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-ingest-and-group-multiline-logs-files-with-nifi/m-p/165292#M29512</link>
      <description>&lt;P&gt;For example I've got this 9 lines in input&lt;/P&gt;&lt;P&gt;24/05/2016 13:40:18,739 ERROR [ACTIVE] ExecuteThread: '6' for queue: 'weblogic.kernel.Default (self-tuning)'&lt;/P&gt;&lt;P&gt;fr.pe.sldng.integration.rest.GestionnaireExceptionSollicitationRest Exception lors du traitement&lt;/P&gt;&lt;P&gt;fr.data.exception.SLDNotFoundException: Mini site inconnu&lt;/P&gt;&lt;P&gt;at fr.services.impl.MiniSiteServiceImpl.lire(MiniSiteServiceImpl.java:89)&lt;/P&gt;&lt;P&gt;at fr.services.impl.EnvoiMailSignalementDs3ServiceImpl.envoyerUnMail(EnvoiMailSignalementDs3ServiceImpl.java:60)&lt;/P&gt;&lt;P&gt;at fr.ressources.MailRessource.envoyerMailSignalementContenuInaproprie(MailRessource.java:41)&lt;/P&gt;&lt;P&gt;24/05/2016 15:40:18,739 ERROR [ACTIVE] ExecuteThread: '6' for queue: 'weblogic.kernel.Default (self-tuning)'&lt;/P&gt;&lt;P&gt;fr.rest.GestionnaireExceptionSollicitationRest Exception lors du traitement &lt;/P&gt;&lt;P&gt;fr.data.exception.SLDNotFoundException: Mini site inconnu&lt;/P&gt;&lt;P&gt;and I'd like with an "extract text" processor to have a property with the value beginning with "24/05/2016 13:40:18,739 ERROR..." and ending just before the next timestamp "24/05/2016 15:40:18,739... " so with the first 6 input lines.&lt;/P&gt;&lt;P&gt;and another property beginning at the second timestamp and ending at the end of the input lines so with the last three input lines.&lt;/P&gt;&lt;P&gt;Is it possible to do this with Nifi ?&lt;/P&gt;&lt;P&gt;Thanks&lt;/P&gt;</description>
      <pubDate>Tue, 24 May 2016 21:42:45 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-ingest-and-group-multiline-logs-files-with-nifi/m-p/165292#M29512</guid>
      <dc:creator>thierry_vernhet</dc:creator>
      <dc:date>2016-05-24T21:42:45Z</dc:date>
    </item>
    <item>
      <title>Re: How ingest and group multiline logs files with nifi ?</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-ingest-and-group-multiline-logs-files-with-nifi/m-p/165293#M29513</link>
      <description>&lt;P&gt;You can do this by using ReplaceText to replace ^(\d{2}\/\d{2}\/\d{4}) with some delimiter not in the set (e.g. ~$1), ie. prepend a magic character to the beginning on each Real line. &lt;/P&gt;&lt;P&gt;You can then use SplitContent by the byte you chose to prepend with. This gives you flow files for each log entry. &lt;/P&gt;&lt;P&gt;However, this can be a little heavy. Make sure you're running the latest version of NiFi, and if you're working with large log files, you may need to consider increasing file handle limits. &lt;/P&gt;&lt;P&gt;The flow (template here: &lt;A href="https://community.cloudera.com/legacyfs/online/attachments/4548-split-multi-line-example.xml" target="_blank"&gt;split-multi-line-example.xml&lt;/A&gt;) works for prepending and splitting. You can see here that 2 flowfiles have come out of the 5 line log file sample I put in.&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="4547-log-split.png" style="width: 1970px;"&gt;&lt;img src="https://community.cloudera.com/t5/image/serverpage/image-id/20400i81F96C156042C4FF/image-size/medium?v=v2&amp;amp;px=400" role="button" title="4547-log-split.png" alt="4547-log-split.png" /&gt;&lt;/span&gt;&lt;/P&gt;</description>
      <pubDate>Sun, 18 Aug 2019 11:21:02 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-ingest-and-group-multiline-logs-files-with-nifi/m-p/165293#M29513</guid>
      <dc:creator>sball</dc:creator>
      <dc:date>2019-08-18T11:21:02Z</dc:date>
    </item>
    <item>
      <title>Re: How ingest and group multiline logs files with nifi ?</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-ingest-and-group-multiline-logs-files-with-nifi/m-p/165294#M29514</link>
      <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/104/sball.html" nodeid="104"&gt;@Simon Elliston Ball&lt;/A&gt;&lt;/P&gt;&lt;P&gt;Thanks, I'm going to test your solution.&lt;/P&gt;</description>
      <pubDate>Wed, 25 May 2016 13:07:50 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-ingest-and-group-multiline-logs-files-with-nifi/m-p/165294#M29514</guid>
      <dc:creator>thierry_vernhet</dc:creator>
      <dc:date>2016-05-25T13:07:50Z</dc:date>
    </item>
    <item>
      <title>Re: How ingest and group multiline logs files with nifi ?</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-ingest-and-group-multiline-logs-files-with-nifi/m-p/165295#M29515</link>
      <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/104/sball.html" nodeid="104"&gt;@Simon Elliston Ball&lt;/A&gt; &lt;/P&gt;&lt;P&gt;The test is KO. &lt;/P&gt;&lt;P&gt;Before  "replace text" &lt;/P&gt;&lt;PRE&gt;25/05/2016 08:40:18,739 ERROR [ACTIVE] ExecuteThread: '6' for queue: 'weblogic.kernel.Default (self-tuning)' fr.pe.sldng.integration.rest.GestionnaireExceptionSollicitationRest Exception lors du traitement
fr.pe.empl.service.da016.recruteur.minisite.data.exception.SLDNotFoundException: Mini site inconnu
        at fr.pe.empl.service.da016.recruteur.minisite.services.impl.MiniSiteServiceImpl.lire(MiniSiteServiceImpl.java:89)
        at fr.pe.empl.service.da016.recruteur.minisite.services.impl.EnvoiMailSignalementDs3ServiceImpl.envoyerUnMail(EnvoiMailSignalementDs3ServiceImpl.java:60)
        at fr.pe.empl.service.da016.recruteur.minisite.ressources.MailRessource$Proxy$_$_WeldSubclass.envoyerMailSignalementContenuInaproprie(MailRessource$Proxy$_$_WeldSubclass.java)
        at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
25/05/2016 08:40:18,739 ERROR [ACTIVE] ExecuteThread: '6' for queue: 'weblogic.kernel.Default (self-tuning)' fr.pe.sldng.integration.rest.GestionnaireExceptionSollicitationRest Exception lors du traitement
fr.pe.empl.service.da016.recruteur.minisite.data.exception.SLDNotFoundException: Mini site inconnu&lt;/PRE&gt;&lt;P&gt;After "replace text" and the magic character "£|£|£|"&lt;/P&gt;&lt;PRE&gt;£|£|£| 25/05/2016 08:40:18,739 ERROR [ACTIVE] ExecuteThread: '6' for queue: 'weblogic.kernel.Default (self-tuning)' fr.pe.sldng.integration.rest.GestionnaireExceptionSollicitationRest Exception lors du traitement
fr.pe.empl.service.da016.recruteur.minisite.data.exception.SLDNotFoundException: Mini site inconnu
        at fr.pe.empl.service.da016.recruteur.minisite.services.impl.MiniSiteServiceImpl.lire(MiniSiteServiceImpl.java:89)
        at fr.pe.empl.service.da016.recruteur.minisite.services.impl.EnvoiMailSignalementDs3ServiceImpl.envoyerUnMail(EnvoiMailSignalementDs3ServiceImpl.java:60)
        at fr.pe.empl.service.da016.recruteur.minisite.ressources.MailRessource$Proxy$_$_WeldSubclass.envoyerMailSignalementContenuInaproprie(MailRessource$Proxy$_$_WeldSubclass.java)
        at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
£|£|£| 25/05/2016 08:40:18,739 ERROR [ACTIVE] ExecuteThread: '6' for queue: 'weblogic.kernel.Default (self-tuning)' fr.pe.sldng.integration.rest.GestionnaireExceptionSollicitationRest Exception lors du traitement
fr.pe.empl.service.da016.recruteur.minisite.data.exception.SLDNotFoundException: Mini site inconnu&lt;/PRE&gt;&lt;P&gt;But after "split text" Output claim has no change. It does not split... Have you got an idea ? Below split prperties :&lt;/P&gt;&lt;DIV&gt;Byte Sequence Format&lt;IMG src="https://ip1.i.lithium.com/ce2436479c3a5b8f60c59d2f734b6e822ae9b95f/687474703a2f2f31302e3230302e33302e36363a32383038302f6e6966692f696d616765732f69636f6e496e666f2e706e67" alt="Info" style="float: right; margin-right: 6px; margin-top: 4px;" /&gt;
&lt;DIV&gt;&lt;PRE&gt;Text&lt;/PRE&gt;
&lt;/DIV&gt;Byte Sequence&lt;IMG src="https://ip1.i.lithium.com/ce2436479c3a5b8f60c59d2f734b6e822ae9b95f/687474703a2f2f31302e3230302e33302e36363a32383038302f6e6966692f696d616765732f69636f6e496e666f2e706e67" alt="Info" style="float: right; margin-right: 6px; margin-top: 4px;" /&gt;
&lt;DIV&gt;&lt;PRE&gt;£|£|£|&lt;/PRE&gt;
&lt;/DIV&gt;Keep Byte Sequence&lt;IMG src="https://ip1.i.lithium.com/ce2436479c3a5b8f60c59d2f734b6e822ae9b95f/687474703a2f2f31302e3230302e33302e36363a32383038302f6e6966692f696d616765732f69636f6e496e666f2e706e67" alt="Info" style="float: right; margin-right: 6px; margin-top: 4px;" /&gt;
&lt;DIV&gt;&lt;PRE&gt;false&lt;/PRE&gt;
&lt;/DIV&gt;Byte Sequence Location&lt;IMG src="https://ip1.i.lithium.com/ce2436479c3a5b8f60c59d2f734b6e822ae9b95f/687474703a2f2f31302e3230302e33302e36363a32383038302f6e6966692f696d616765732f69636f6e496e666f2e706e67" alt="Info" style="float: right; margin-right: 6px; margin-top: 4px;" /&gt;
&lt;DIV&gt;&lt;PRE&gt;Trailing&lt;/PRE&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;P&gt;Thanks&lt;/P&gt;</description>
      <pubDate>Wed, 25 May 2016 13:46:36 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-ingest-and-group-multiline-logs-files-with-nifi/m-p/165295#M29515</guid>
      <dc:creator>thierry_vernhet</dc:creator>
      <dc:date>2016-05-25T13:46:36Z</dc:date>
    </item>
    <item>
      <title>Re: How ingest and group multiline logs files with nifi ?</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-ingest-and-group-multiline-logs-files-with-nifi/m-p/165296#M29516</link>
      <description>&lt;P style="margin-left: 40px;"&gt;Hi &lt;A rel="user" href="https://community.cloudera.com/users/10685/thierryvernhet.html" nodeid="10685"&gt;@Thierry Vernhet&lt;/A&gt; I've added a template and screenshot of a worked example, which should make it clearer. I suspect the problem you're seeing is around the relation being used to output from the SplitContent processor. If you use the original, or worse, both outputs you will just get the original content back. &lt;/P&gt;&lt;P style="margin-left: 40px;"&gt;Note also that I've used the "Leading" location in my template, since the marker is inserted at the front of a line, and have also used Line-By-Line evaluation in the marker replace text for better memory usage.&lt;/P&gt;</description>
      <pubDate>Thu, 26 May 2016 17:51:01 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-ingest-and-group-multiline-logs-files-with-nifi/m-p/165296#M29516</guid>
      <dc:creator>sball</dc:creator>
      <dc:date>2016-05-26T17:51:01Z</dc:date>
    </item>
    <item>
      <title>Re: How ingest and group multiline logs files with nifi ?</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-ingest-and-group-multiline-logs-files-with-nifi/m-p/165297#M29517</link>
      <description>&lt;P&gt;Hi &lt;A rel="user" href="https://community.cloudera.com/users/104/sball.html" nodeid="104"&gt;@Simon Elliston Ball&lt;/A&gt;&lt;/P&gt;&lt;P&gt;Thanks a lot for your answer. I understand now. But I cannot ignore relation ship "original" because without this relation Nifi doesn't validate my processor. How can you use "splits" relationship without the "original" one ?&lt;/P&gt;&lt;P&gt;Hope It's my last question for this.&lt;/P&gt;</description>
      <pubDate>Thu, 26 May 2016 21:09:28 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-ingest-and-group-multiline-logs-files-with-nifi/m-p/165297#M29517</guid>
      <dc:creator>thierry_vernhet</dc:creator>
      <dc:date>2016-05-26T21:09:28Z</dc:date>
    </item>
    <item>
      <title>Re: How ingest and group multiline logs files with nifi ?</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-ingest-and-group-multiline-logs-files-with-nifi/m-p/165298#M29518</link>
      <description>&lt;P&gt;The way to deal with this is to mark the original relation as auto-terminated in the SplitContent settings tab. &lt;/P&gt;</description>
      <pubDate>Thu, 26 May 2016 21:23:58 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-ingest-and-group-multiline-logs-files-with-nifi/m-p/165298#M29518</guid>
      <dc:creator>sball</dc:creator>
      <dc:date>2016-05-26T21:23:58Z</dc:date>
    </item>
    <item>
      <title>Re: How ingest and group multiline logs files with nifi ?</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-ingest-and-group-multiline-logs-files-with-nifi/m-p/165299#M29519</link>
      <description>&lt;P&gt;Wonderful&lt;/P&gt;&lt;P&gt;Now it's ok Simon&lt;/P&gt;</description>
      <pubDate>Thu, 26 May 2016 21:34:03 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-ingest-and-group-multiline-logs-files-with-nifi/m-p/165299#M29519</guid>
      <dc:creator>thierry_vernhet</dc:creator>
      <dc:date>2016-05-26T21:34:03Z</dc:date>
    </item>
  </channel>
</rss>

