<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: copy files within hdfs based on the modified time or access time in Support Questions</title>
    <link>https://community.cloudera.com/t5/Support-Questions/copy-files-within-hdfs-based-on-the-modified-time-or-access/m-p/128983#M91676</link>
    <description>&lt;P&gt;1) For moving files from 2nd april to another folder in hdfs.
&lt;/P&gt;&lt;P&gt;for
 i in `hdfs dfs -ls /old_data/dataset/|grep "2016-04-02"|awk '{print 
$8}'`;do echo ${i}; hdfs dfs -mv ${i} /old_data/dataset/TEST/;done &lt;/P&gt;&lt;P&gt;2) Once the above is done you can just setup a crontab.
&lt;/P&gt;&lt;P&gt;Please try this scenario out on a test folder in non prod.&lt;/P&gt;</description>
    <pubDate>Fri, 15 Jul 2016 01:51:35 GMT</pubDate>
    <dc:creator>ssingla</dc:creator>
    <dc:date>2016-07-15T01:51:35Z</dc:date>
    <item>
      <title>copy files within hdfs based on the modified time or access time</title>
      <link>https://community.cloudera.com/t5/Support-Questions/copy-files-within-hdfs-based-on-the-modified-time-or-access/m-p/128982#M91675</link>
      <description>&lt;P&gt;I have to write a script to move files(csv) from one location in hdfs to another staging location in hdfs.(based on date)
As of now I have to move files from April 2nd 2016.
Later I have to schedule it so that files will be picked up for every 1 hr and moved to staging location.
Hive tables are created on top of this staging location.&lt;/P&gt;</description>
      <pubDate>Thu, 14 Jul 2016 17:28:21 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/copy-files-within-hdfs-based-on-the-modified-time-or-access/m-p/128982#M91675</guid>
      <dc:creator>klksrinivas</dc:creator>
      <dc:date>2016-07-14T17:28:21Z</dc:date>
    </item>
    <item>
      <title>Re: copy files within hdfs based on the modified time or access time</title>
      <link>https://community.cloudera.com/t5/Support-Questions/copy-files-within-hdfs-based-on-the-modified-time-or-access/m-p/128983#M91676</link>
      <description>&lt;P&gt;1) For moving files from 2nd april to another folder in hdfs.
&lt;/P&gt;&lt;P&gt;for
 i in `hdfs dfs -ls /old_data/dataset/|grep "2016-04-02"|awk '{print 
$8}'`;do echo ${i}; hdfs dfs -mv ${i} /old_data/dataset/TEST/;done &lt;/P&gt;&lt;P&gt;2) Once the above is done you can just setup a crontab.
&lt;/P&gt;&lt;P&gt;Please try this scenario out on a test folder in non prod.&lt;/P&gt;</description>
      <pubDate>Fri, 15 Jul 2016 01:51:35 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/copy-files-within-hdfs-based-on-the-modified-time-or-access/m-p/128983#M91676</guid>
      <dc:creator>ssingla</dc:creator>
      <dc:date>2016-07-15T01:51:35Z</dc:date>
    </item>
    <item>
      <title>Re: copy files within hdfs based on the modified time or access time</title>
      <link>https://community.cloudera.com/t5/Support-Questions/copy-files-within-hdfs-based-on-the-modified-time-or-access/m-p/128984#M91677</link>
      <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/2508/klksrinivas.html" nodeid="2508"&gt;@Krishna Srinivas&lt;/A&gt; &lt;/P&gt;&lt;P&gt;Have you tried the Falcon mirroring feature ? Instead of cluster to cluster replication, you can try replicating to different directories in the same cluster.&lt;/P&gt;&lt;P&gt;&lt;A href="http://hortonworks.com/hadoop-tutorial/mirroring-datasets-between-hadoop-clusters-with-apache-falcon/" target="_blank"&gt;http://hortonworks.com/hadoop-tutorial/mirroring-datasets-between-hadoop-clusters-with-apache-falcon/&lt;/A&gt;&lt;/P&gt;&lt;P&gt;&lt;A href="https://docs.hortonworks.com/HDPDocuments/HDP2/HDP-2.3.2/bk_data_governance/content/section_mirroring_data_falcon.html" target="_blank"&gt;https://docs.hortonworks.com/HDPDocuments/HDP2/HDP-2.3.2/bk_data_governance/content/section_mirroring_data_falcon.html&lt;/A&gt;&lt;/P&gt;&lt;P&gt;&lt;A href="https://falcon.apache.org/HDFSDR.html" target="_blank"&gt;https://falcon.apache.org/HDFSDR.html&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Fri, 15 Jul 2016 02:28:11 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/copy-files-within-hdfs-based-on-the-modified-time-or-access/m-p/128984#M91677</guid>
      <dc:creator>hrongali</dc:creator>
      <dc:date>2016-07-15T02:28:11Z</dc:date>
    </item>
  </channel>
</rss>

