<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Distcp for classified/health-care data in Archives of Support Questions (Read Only)</title>
    <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Distcp-for-classified-health-care-data/m-p/147156#M44383</link>
    <description>&lt;P&gt;Hi Folks,&lt;/P&gt;&lt;P&gt;I have a nightly job to copy data from Cluster-1 to Cluster-2 using DistCp. Now the issue comes with secured, classified data which is stored on the Source Cluster-1 using TDE and various other techniques. Was referring to the documentation of distCp and looks like it puts the data first on the /tmp wanted to know where does it create this /tmp directory?&lt;/P&gt;&lt;P&gt;on Source Cluster HDFS &amp;lt;root&amp;gt;/tmp OR &lt;/P&gt;&lt;P&gt;&amp;lt;HDFS_ROOT&amp;gt;/&amp;lt;Very_secured_Data_Dir&amp;gt;/tmp ?&lt;/P&gt;&lt;P&gt;Thanks,&lt;/P&gt;&lt;P&gt;SS&lt;/P&gt;</description>
    <pubDate>Tue, 25 Oct 2016 03:04:25 GMT</pubDate>
    <dc:creator>smartninja723</dc:creator>
    <dc:date>2016-10-25T03:04:25Z</dc:date>
    <item>
      <title>Distcp for classified/health-care data</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Distcp-for-classified-health-care-data/m-p/147156#M44383</link>
      <description>&lt;P&gt;Hi Folks,&lt;/P&gt;&lt;P&gt;I have a nightly job to copy data from Cluster-1 to Cluster-2 using DistCp. Now the issue comes with secured, classified data which is stored on the Source Cluster-1 using TDE and various other techniques. Was referring to the documentation of distCp and looks like it puts the data first on the /tmp wanted to know where does it create this /tmp directory?&lt;/P&gt;&lt;P&gt;on Source Cluster HDFS &amp;lt;root&amp;gt;/tmp OR &lt;/P&gt;&lt;P&gt;&amp;lt;HDFS_ROOT&amp;gt;/&amp;lt;Very_secured_Data_Dir&amp;gt;/tmp ?&lt;/P&gt;&lt;P&gt;Thanks,&lt;/P&gt;&lt;P&gt;SS&lt;/P&gt;</description>
      <pubDate>Tue, 25 Oct 2016 03:04:25 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Distcp-for-classified-health-care-data/m-p/147156#M44383</guid>
      <dc:creator>smartninja723</dc:creator>
      <dc:date>2016-10-25T03:04:25Z</dc:date>
    </item>
    <item>
      <title>Re: Distcp for classified/health-care data</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Distcp-for-classified-health-care-data/m-p/147157#M44384</link>
      <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/104/sball.html" nodeid="104"&gt;@Simon Elliston Ball&lt;/A&gt;, &lt;A rel="user" href="https://community.cloudera.com/users/113/jstraub.html" nodeid="113"&gt;@Jonas Straub&lt;/A&gt; &lt;/P&gt;</description>
      <pubDate>Tue, 25 Oct 2016 03:05:36 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Distcp-for-classified-health-care-data/m-p/147157#M44384</guid>
      <dc:creator>smartninja723</dc:creator>
      <dc:date>2016-10-25T03:05:36Z</dc:date>
    </item>
    <item>
      <title>Re: Distcp for classified/health-care data</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Distcp-for-classified-health-care-data/m-p/147158#M44385</link>
      <description>&lt;P&gt;@Smart Solutions&lt;/P&gt;&lt;P&gt;Not sure of the answer to that, but if you're concerned about tmp data being unencrypted/intercepted then you may consider copying it over in it's unencrypted form.  This will also reduce the encryption/re-encryption overhead.  The link below talks about the different options to do this.&lt;/P&gt;&lt;P&gt;&lt;A href="https://community.hortonworks.com/articles/51909/how-to-copy-encrypted-data-between-two-hdp-cluster.html" target="_blank"&gt;https://community.hortonworks.com/articles/51909/how-to-copy-encrypted-data-between-two-hdp-cluster.html&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Fri, 16 Dec 2016 15:39:32 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Distcp-for-classified-health-care-data/m-p/147158#M44385</guid>
      <dc:creator>egarelnabi</dc:creator>
      <dc:date>2016-12-16T15:39:32Z</dc:date>
    </item>
  </channel>
</rss>

