<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: merge file in hdfs in Support Questions</title>
    <link>https://community.cloudera.com/t5/Support-Questions/merge-file-in-hdfs/m-p/216281#M178190</link>
    <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/45384/evaloschin.html" nodeid="45384"&gt;@eric valoschin&lt;/A&gt;
&lt;/P&gt;&lt;P&gt;Can you try the following command &lt;/P&gt;&lt;PRE&gt;hadoop jar /usr/hdp/2.5.3.0-37/hadoop-mapreduce/hadoop-streaming-2.7.3.2.5.3.0-37.jar \
                   -Dmapred.reduce.tasks=1 \
                   -input "&amp;lt;path-to-input-directory&amp;gt;" \
                   -output "&amp;lt;path-to-output-directory&amp;gt;" \
                   -mapper cat \
                   -reducer cat&lt;/PRE&gt;&lt;P&gt;make sure which version of &lt;STRONG&gt;hadoop streaming jar &lt;/STRONG&gt;you are using by going to &lt;/P&gt;&lt;PRE&gt;/usr/hdp&lt;/PRE&gt;&lt;P&gt;then give the &lt;STRONG&gt;input path&lt;/STRONG&gt; and make sure the &lt;STRONG&gt;output directory&lt;/STRONG&gt; is &lt;STRONG&gt;not existed&lt;/STRONG&gt; as this job will &lt;STRONG&gt;merge the files&lt;/STRONG&gt; and creates the &lt;STRONG&gt;output directory&lt;/STRONG&gt; for you.&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;Here what i tried:-&lt;/STRONG&gt;&lt;/P&gt;&lt;PRE&gt;#hdfs dfs -ls /user/yashu/folder2/
Found 2 items 
-rw-r--r--   3 hdfs hdfs        150 2017-09-26 17:55 /user/yashu/folder2/part1.txt 
-rw-r--r--   3 hdfs hdfs         20 2017-09-27 09:07 /user/yashu/folder2/part1_sed.txt&lt;/PRE&gt;&lt;PRE&gt;#hadoop jar /usr/hdp/2.5.3.0-37/hadoop-mapreduce/hadoop-streaming-2.7.3.2.5.3.0-37.jar \
&amp;gt;                    -Dmapred.reduce.tasks=1 \
&amp;gt;                    -input "/user/yashu/folder2/" \
&amp;gt;                    -output "/user/yashu/folder1/" \
&amp;gt;                    -mapper cat \
&amp;gt;                    -reducer cat&lt;/PRE&gt;&lt;P&gt;Folder2 having &lt;STRONG&gt;2 files&lt;/STRONG&gt; after running the above command, i am storing the merged files to &lt;STRONG&gt;folder1&lt;/STRONG&gt; directory and the&lt;STRONG&gt; 2&lt;/STRONG&gt; files got merged into &lt;STRONG&gt;1 file&lt;/STRONG&gt; as you can see below. &lt;/P&gt;&lt;PRE&gt;#hdfs dfs -ls /user/yashu/folder1/
Found 2 items 
-rw-r--r--   3 hdfs hdfs          0 2017-10-09 16:00 /user/yashu/folder1/_SUCCESS 
-rw-r--r--   3 hdfs hdfs        174 2017-10-09 16:00 /user/yashu/folder1/part-00000&lt;/PRE&gt;</description>
    <pubDate>Tue, 10 Oct 2017 03:07:17 GMT</pubDate>
    <dc:creator>Shu_ashu</dc:creator>
    <dc:date>2017-10-10T03:07:17Z</dc:date>
  </channel>
</rss>

