<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question What is the best way to decompress/extract different types of incoming files in Apache Nifi? in Support Questions</title>
    <link>https://community.cloudera.com/t5/Support-Questions/What-is-the-best-way-to-decompress-extract-different-types/m-p/191690#M153771</link>
    <description>&lt;P&gt;Hi all, &lt;/P&gt;&lt;P&gt;thanks in advance!&lt;/P&gt;&lt;P&gt;My issue is regarding &lt;STRONG&gt;Apache Nifi&lt;/STRONG&gt;: &lt;/P&gt;&lt;P&gt;Whats the best way to decompress/extract different types of incoming files?&lt;BR /&gt;&lt;BR /&gt;In my use case I am getting a lot of files which are differently compressed (e.g. &lt;STRONG&gt;tar.gz&lt;/STRONG&gt;, &lt;STRONG&gt;.zip&lt;/STRONG&gt;, &lt;STRONG&gt;.rar&lt;/STRONG&gt;, &lt;STRONG&gt;.tar &lt;/STRONG&gt;or non-compressed&lt;STRONG&gt; .txt/.json&lt;/STRONG&gt;), but I need all of them decompressed:&lt;BR /&gt;&lt;BR /&gt;What I tried is to have every file running through every possible Compress/UnpackContent processor, but it is actually not working and probably not the best way performance-wise:&lt;BR /&gt;&lt;BR /&gt;GetFile -&amp;gt; (...) -&amp;gt; &lt;STRONG&gt;CompressContent &lt;/STRONG&gt;(uncompressing gzip) -&amp;gt; &lt;STRONG&gt;UnpackContent &lt;/STRONG&gt;(extracting .tar) -&amp;gt; &lt;STRONG&gt;UnpackContent &lt;/STRONG&gt;(extracting .zip) -&amp;gt; (...) -&amp;gt; PutFile&lt;BR /&gt;&lt;BR /&gt;for example: a "*.json"-file should run through those processors and nothing should happen, a "tar.gz"-file should get uncompressed (changes his name to ".tar") and after that getting extracted in an UnpackProcessor, so I get an uncompressed file after all.&lt;BR /&gt;&lt;BR /&gt;I hope there is a good solution, thanks once again.&lt;BR /&gt;&lt;BR /&gt;best regards&lt;/P&gt;</description>
    <pubDate>Wed, 08 Nov 2017 06:11:23 GMT</pubDate>
    <dc:creator>onestoneh</dc:creator>
    <dc:date>2017-11-08T06:11:23Z</dc:date>
  </channel>
</rss>

