<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: Distcp copying between major versions in Support Questions</title>
    <link>https://community.cloudera.com/t5/Support-Questions/Distcp-copying-between-major-versions/m-p/412246#M253342</link>
    <description>&lt;P&gt;Thank you very much for the detailed answer!&lt;/P&gt;</description>
    <pubDate>Thu, 04 Sep 2025 11:38:32 GMT</pubDate>
    <dc:creator>vit</dc:creator>
    <dc:date>2025-09-04T11:38:32Z</dc:date>
    <item>
      <title>Distcp copying between major versions</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Distcp-copying-between-major-versions/m-p/411123#M252995</link>
      <description>&lt;P&gt;Hello All.&lt;/P&gt;&lt;P&gt;What problems can there be when copying data between two clusters with different major versions if you use &lt;FONT face="courier new, courier"&gt;hdfs://...&lt;/FONT&gt; instead of &lt;FONT face="courier new, courier"&gt;webhdfs://...&lt;/FONT&gt;&lt;/P&gt;&lt;P&gt;&lt;FONT face="courier new, courier"&gt;hadoop distcp hdfs://&amp;lt;namenode&amp;gt;:&amp;lt;port&amp;gt; hdfs://&amp;lt;namenode&amp;gt;&lt;/FONT&gt;&lt;/P&gt;&lt;P&gt;Examle from documetntation -&lt;/P&gt;&lt;H2&gt;&lt;FONT size="5"&gt;C&lt;FONT&gt;opying between major versions&lt;/FONT&gt;&lt;/FONT&gt;&lt;/H2&gt;&lt;P&gt;Run the &lt;FONT face="courier new, courier"&gt;distcp&lt;/FONT&gt; command on the cluster that runs the higher version of Cloudera, which should be the destination cluster. Use the following syntax:&lt;/P&gt;&lt;P&gt;&lt;FONT face="courier new, courier"&gt;hadoop distcp webhdfs://&amp;lt;namenode&amp;gt;:&amp;lt;port&amp;gt; hdfs://&amp;lt;namenode&amp;gt;&lt;/FONT&gt;&lt;/P&gt;&lt;P&gt;Note the &lt;FONT face="courier new, courier"&gt;webhdfs &lt;/FONT&gt;prefix for the remote cluster, which should be your source cluster. You must use &lt;FONT face="courier new, courier"&gt;webhdfs&lt;/FONT&gt; when the clusters run different major versions. When clusters run the same version, you can use the &lt;FONT face="courier new, courier"&gt;hdfs&lt;/FONT&gt; protocol for better performance.&lt;/P&gt;&lt;P&gt;For example, the following command copies data from a Cloudera source cluster named &lt;FONT face="courier new, courier"&gt;example-source&lt;/FONT&gt; to another Cloudera version destination cluster named &lt;FONT face="courier new, courier"&gt;example-dest:&lt;/FONT&gt;&lt;/P&gt;&lt;P&gt;&lt;FONT face="courier new, courier"&gt;hadoop distcp webhdfs://example-source.cloudera.com:8020 hdfs://example-dest.cloudera.com&lt;/FONT&gt;&lt;/P&gt;</description>
      <pubDate>Tue, 21 Apr 2026 06:18:24 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Distcp-copying-between-major-versions/m-p/411123#M252995</guid>
      <dc:creator>vit</dc:creator>
      <dc:date>2026-04-21T06:18:24Z</dc:date>
    </item>
    <item>
      <title>Re: Distcp copying between major versions</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Distcp-copying-between-major-versions/m-p/412109#M253249</link>
      <description>&lt;P&gt;Hello&amp;nbsp;&lt;a href="https://community.cloudera.com/t5/user/viewprofilepage/user-id/72872"&gt;@vit&lt;/a&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thank you for reaching out Cloudera Community.&lt;/P&gt;&lt;P&gt;The hdfs:// protocol allows DataNodes and NameNodes to communicate directly using Hadoop's internal Remote Procedure Call (RPC) mechanism. This protocol is highly optimized for performance within a single cluster version.&amp;nbsp;&amp;nbsp;However, this internal RPC protocol is &lt;STRONG&gt;not guaranteed to be compatible between major versions.&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;However&amp;nbsp;webhdfs:// protocol avoids these problems because it is not based on the internal, version-specific RPC system. Instead, it uses a &lt;STRONG&gt;standardized REST API&lt;/STRONG&gt; that communicates over HTTP/S&lt;/P&gt;&lt;P&gt;This is why Cloudera's &lt;A href="https://docs.cloudera.com/runtime/7.3.1/scaling-namespaces/topics/hdfs-distcp-syntax-examples.html#pnavId2" target="_self"&gt;documentation&lt;/A&gt; (and general Hadoop best practice) insists on using webhdfs:// when running distcp between clusters of different major versions.&lt;/P&gt;&lt;P&gt;Hope this helps.&amp;nbsp; While performing distcp, if you face any issues/challenges please don't hesitate to reach out Cloudera Support by raising ticket through MyCloudera portal.&lt;/P&gt;</description>
      <pubDate>Wed, 13 Aug 2025 14:24:45 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Distcp-copying-between-major-versions/m-p/412109#M253249</guid>
      <dc:creator>abdulpasithali</dc:creator>
      <dc:date>2025-08-13T14:24:45Z</dc:date>
    </item>
    <item>
      <title>Re: Distcp copying between major versions</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Distcp-copying-between-major-versions/m-p/412246#M253342</link>
      <description>&lt;P&gt;Thank you very much for the detailed answer!&lt;/P&gt;</description>
      <pubDate>Thu, 04 Sep 2025 11:38:32 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Distcp-copying-between-major-versions/m-p/412246#M253342</guid>
      <dc:creator>vit</dc:creator>
      <dc:date>2025-09-04T11:38:32Z</dc:date>
    </item>
    <item>
      <title>Re: Distcp copying between major versions</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Distcp-copying-between-major-versions/m-p/412249#M253345</link>
      <description>&lt;P&gt;Hello&amp;nbsp;&lt;a href="https://community.cloudera.com/t5/user/viewprofilepage/user-id/72872"&gt;@vit&lt;/a&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I'm glad that you got the answer which you are looking for.&amp;nbsp; Could you please "Accept as Solution" as well ?&lt;/P&gt;</description>
      <pubDate>Thu, 04 Sep 2025 13:32:26 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Distcp-copying-between-major-versions/m-p/412249#M253345</guid>
      <dc:creator>abdulpasithali</dc:creator>
      <dc:date>2025-09-04T13:32:26Z</dc:date>
    </item>
  </channel>
</rss>

