<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: nifi UnpackContent source file missing in Archives of Support Questions (Read Only)</title>
    <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/nifi-UnpackContent-source-file-missing/m-p/183635#M58728</link>
    <description>&lt;P&gt;Ok what are the exact attribute names that you see in the queue going into UnpackContent that are being lost?&lt;/P&gt;</description>
    <pubDate>Tue, 04 Apr 2017 22:05:14 GMT</pubDate>
    <dc:creator>bbende</dc:creator>
    <dc:date>2017-04-04T22:05:14Z</dc:date>
    <item>
      <title>nifi UnpackContent source file missing</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/nifi-UnpackContent-source-file-missing/m-p/183632#M58725</link>
      <description>&lt;P&gt;I am trying to utilize the UnpackContent processor being fed from a "FetchHDFS", but I am having an issue where the source path of the zip file, prior to being unpacked, is being dropped after being unpacked. Is there a way to either retain the path through the processor or another method I can ensure that path gets added as an attribute to the unpacked flow file?&lt;/P&gt;</description>
      <pubDate>Fri, 31 Mar 2017 23:44:36 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/nifi-UnpackContent-source-file-missing/m-p/183632#M58725</guid>
      <dc:creator>camatulli</dc:creator>
      <dc:date>2017-03-31T23:44:36Z</dc:date>
    </item>
    <item>
      <title>Re: nifi UnpackContent source file missing</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/nifi-UnpackContent-source-file-missing/m-p/183633#M58726</link>
      <description>&lt;P&gt;Can you elaborate more on where you see the source path and where it is getting dropped?&lt;/P&gt;&lt;P&gt;Going into FetchHDFS there should be a flow file with the content being a path to fetch like /data/foo.zip, after FetchHDFS it wrote the content of foo.zip to the flow file content and the filename attribute of the flow file should be foo.zip, then it goes to UnpackContent which produced multiple child flow files that were unpacked and each one should have segment.original.filename with foo.zip.&lt;/P&gt;&lt;P&gt;Are you asking to retain the original HDFS path that went into FetchHDFS?&lt;/P&gt;</description>
      <pubDate>Tue, 04 Apr 2017 21:39:10 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/nifi-UnpackContent-source-file-missing/m-p/183633#M58726</guid>
      <dc:creator>bbende</dc:creator>
      <dc:date>2017-04-04T21:39:10Z</dc:date>
    </item>
    <item>
      <title>Re: nifi UnpackContent source file missing</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/nifi-UnpackContent-source-file-missing/m-p/183634#M58727</link>
      <description>&lt;P&gt;From the UnpackContent processor on, the original file and path seem to be lost.&lt;/P&gt;&lt;P&gt;I see the path in queue going into the UnpackContent Processor, but after, when the new flow files are generated by the unpackcontent processor, the source file is not in the attributes anymore.&lt;/P&gt;</description>
      <pubDate>Tue, 04 Apr 2017 21:49:12 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/nifi-UnpackContent-source-file-missing/m-p/183634#M58727</guid>
      <dc:creator>camatulli</dc:creator>
      <dc:date>2017-04-04T21:49:12Z</dc:date>
    </item>
    <item>
      <title>Re: nifi UnpackContent source file missing</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/nifi-UnpackContent-source-file-missing/m-p/183635#M58728</link>
      <description>&lt;P&gt;Ok what are the exact attribute names that you see in the queue going into UnpackContent that are being lost?&lt;/P&gt;</description>
      <pubDate>Tue, 04 Apr 2017 22:05:14 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/nifi-UnpackContent-source-file-missing/m-p/183635#M58728</guid>
      <dc:creator>bbende</dc:creator>
      <dc:date>2017-04-04T22:05:14Z</dc:date>
    </item>
    <item>
      <title>Re: nifi UnpackContent source file missing</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/nifi-UnpackContent-source-file-missing/m-p/183636#M58729</link>
      <description>&lt;P&gt;segment.original.filename is gone and filename and path are with the new values after extract. i am looking for the file name and path that came from the fetch&lt;/P&gt;</description>
      <pubDate>Tue, 04 Apr 2017 22:29:20 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/nifi-UnpackContent-source-file-missing/m-p/183636#M58729</guid>
      <dc:creator>camatulli</dc:creator>
      <dc:date>2017-04-04T22:29:20Z</dc:date>
    </item>
    <item>
      <title>Re: nifi UnpackContent source file missing</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/nifi-UnpackContent-source-file-missing/m-p/183637#M58730</link>
      <description>&lt;P&gt;Before unpack, i have the path of the file in HDFS, and the actual file name in HDFS&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="14449-beforeunpack.jpg" style="width: 643px;"&gt;&lt;img src="https://community.cloudera.com/t5/image/serverpage/image-id/19052iC7C0F6AE4D91E328/image-size/medium?v=v2&amp;amp;px=400" role="button" title="14449-beforeunpack.jpg" alt="14449-beforeunpack.jpg" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;After unpack, the "segment.original.filename" contains the filename without extension, and no reference to the source path anymore.&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="14450-afterunpack.jpg" style="width: 643px;"&gt;&lt;img src="https://community.cloudera.com/t5/image/serverpage/image-id/19053iD68F3E6B432AFD12/image-size/medium?v=v2&amp;amp;px=400" role="button" title="14450-afterunpack.jpg" alt="14450-afterunpack.jpg" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;My main issue is i need that path when feeding to spark to create the relationships.&lt;/P&gt;</description>
      <pubDate>Sun, 18 Aug 2019 08:41:34 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/nifi-UnpackContent-source-file-missing/m-p/183637#M58730</guid>
      <dc:creator>camatulli</dc:creator>
      <dc:date>2019-08-18T08:41:34Z</dc:date>
    </item>
    <item>
      <title>Re: nifi UnpackContent source file missing</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/nifi-UnpackContent-source-file-missing/m-p/183638#M58731</link>
      <description>&lt;P&gt;Thanks for uploading the screenshots. I can see in the code that segment.original.filename is specifically removing the extension and this appears to have been like this since the initial code for NiFi was open-sourced, so I'm not sure if this is considered a bug or really a preference. The path attribute is being updated to reflect the path within the archive, although I believe there could be a bug here, but I believe it makes sense since the path of the children is not necessarily the path of the original flow file.&lt;/P&gt;&lt;P&gt;In the short-term, I think the easiest thing to do is stick an UpdateAttribute processor right before UnpackContent and add two properties that copy the filename and path to new attributes like this:&lt;/P&gt;&lt;P&gt;archive.filename = ${filename}&lt;/P&gt;&lt;P&gt;archive.path = ${path}&lt;/P&gt;&lt;P&gt;The flow files for the unpacked files should retain these attributes.&lt;/P&gt;</description>
      <pubDate>Thu, 06 Apr 2017 23:34:10 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/nifi-UnpackContent-source-file-missing/m-p/183638#M58731</guid>
      <dc:creator>bbende</dc:creator>
      <dc:date>2017-04-06T23:34:10Z</dc:date>
    </item>
    <item>
      <title>Re: nifi UnpackContent source file missing</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/nifi-UnpackContent-source-file-missing/m-p/183639#M58732</link>
      <description>&lt;P&gt;Thanks. This ended up being a good workaround to the attributes dropping, and gave me a couple ideas on how to extend the information i need.&lt;/P&gt;</description>
      <pubDate>Fri, 07 Apr 2017 00:27:20 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/nifi-UnpackContent-source-file-missing/m-p/183639#M58732</guid>
      <dc:creator>camatulli</dc:creator>
      <dc:date>2017-04-07T00:27:20Z</dc:date>
    </item>
  </channel>
</rss>

