<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Best way to monitor/move hadoop files through command line or java api or other alternative in Support Questions</title>
    <link>https://community.cloudera.com/t5/Support-Questions/Best-way-to-monitor-move-hadoop-files-through-command-line/m-p/178378#M140625</link>
    <description>&lt;P&gt;I have used both the java api filesystem to get a file and put a file from hdfs and as well using the command line.&lt;/P&gt;&lt;P&gt;I read this article &lt;A href="https://stackoverflow.com/questions/24647992/wildcard-in-hadoops-filesystem-listing-api-calls"&gt;wildcardsHadoopFileSystems&lt;/A&gt;  and was wondering if anyone had suggestions for more complex operations on files and what tools to use&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;For example &lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;1. moving many files&lt;/P&gt;&lt;P&gt;2. Monitoring small files or corrupted blocks&lt;/P&gt;&lt;P&gt;3. Doing clean up of old data.&lt;/P&gt;&lt;P&gt;Using the java api the command line or something else.&lt;/P&gt;&lt;P&gt;Thanks&lt;/P&gt;</description>
    <pubDate>Mon, 01 Jan 2018 01:32:10 GMT</pubDate>
    <dc:creator>Former Member</dc:creator>
    <dc:date>2018-01-01T01:32:10Z</dc:date>
    <item>
      <title>Best way to monitor/move hadoop files through command line or java api or other alternative</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Best-way-to-monitor-move-hadoop-files-through-command-line/m-p/178378#M140625</link>
      <description>&lt;P&gt;I have used both the java api filesystem to get a file and put a file from hdfs and as well using the command line.&lt;/P&gt;&lt;P&gt;I read this article &lt;A href="https://stackoverflow.com/questions/24647992/wildcard-in-hadoops-filesystem-listing-api-calls"&gt;wildcardsHadoopFileSystems&lt;/A&gt;  and was wondering if anyone had suggestions for more complex operations on files and what tools to use&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;For example &lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;1. moving many files&lt;/P&gt;&lt;P&gt;2. Monitoring small files or corrupted blocks&lt;/P&gt;&lt;P&gt;3. Doing clean up of old data.&lt;/P&gt;&lt;P&gt;Using the java api the command line or something else.&lt;/P&gt;&lt;P&gt;Thanks&lt;/P&gt;</description>
      <pubDate>Mon, 01 Jan 2018 01:32:10 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Best-way-to-monitor-move-hadoop-files-through-command-line/m-p/178378#M140625</guid>
      <dc:creator>Former Member</dc:creator>
      <dc:date>2018-01-01T01:32:10Z</dc:date>
    </item>
    <item>
      <title>Re: Best way to monitor/move hadoop files through command line or java api or other alternative</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Best-way-to-monitor-move-hadoop-files-through-command-line/m-p/178379#M140626</link>
      <description>&lt;P&gt;There are multiple ways you can perform various operations on HDFS. You can choose any of the below approach as per your need.&lt;/P&gt;&lt;P&gt;1) Command Line&lt;/P&gt;&lt;P&gt;Most of users use command line to interact with HDFS. HDFS CLI is easy to use. Its easy to automate with scripts. However, HDFS CLI need hdfs client installed on the host. &lt;/P&gt;&lt;P&gt;2) Java Api&lt;/P&gt;&lt;P&gt;If you are familiar with Java and Apache Apis, You can choose to use Java api to communicate with HDFS Cluster. &lt;/P&gt;&lt;P&gt;3) Webhdfs&lt;/P&gt;&lt;P&gt;This is rest api way of accessing HDFS. This approach does not require hdfs client to be installed on host. You can use this api to connect to remote HDFS cluster too. &lt;/P&gt;</description>
      <pubDate>Wed, 03 Jan 2018 05:02:09 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Best-way-to-monitor-move-hadoop-files-through-command-line/m-p/178379#M140626</guid>
      <dc:creator>yvora</dc:creator>
      <dc:date>2018-01-03T05:02:09Z</dc:date>
    </item>
  </channel>
</rss>

