<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: parquet-tools :: No FileSystem for scheme hdfs in Support Questions</title>
    <link>https://community.cloudera.com/t5/Support-Questions/parquet-tools-No-FileSystem-for-scheme-hdfs/m-p/86995#M35849</link>
    <description>Great to hear the issue got resolved! Thanks for the feedback!</description>
    <pubDate>Wed, 27 Feb 2019 19:47:45 GMT</pubDate>
    <dc:creator>lwang</dc:creator>
    <dc:date>2019-02-27T19:47:45Z</dc:date>
    <item>
      <title>parquet-tools :: No FileSystem for scheme hdfs</title>
      <link>https://community.cloudera.com/t5/Support-Questions/parquet-tools-No-FileSystem-for-scheme-hdfs/m-p/86795#M35846</link>
      <description>&lt;P&gt;Hello Friends:&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;On a relatively new installation of &lt;STRONG&gt;CDH6.1&lt;/STRONG&gt;&amp;nbsp;(parcels) with one node for CDH manager and&amp;nbsp;a second node for Master and Slave services (combined), I'm getting this error:&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;PRE&gt;org.apache.hadoop.fs.UnsupportedFileSystemException:
    No FileSystem for scheme "hdfs"'&lt;/PRE&gt;
&lt;P&gt;after running this:&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;PRE&gt;user$ /opt/cloudera/parcels/CDH-6.1.0-1.cdh6.1.0.p0.770702/bin/parquet-tools \
          cat hdfs://tmp/1.parquet&lt;/PRE&gt;
&lt;P&gt;Here is the output of &lt;STRONG&gt;hadoop classpath&lt;/STRONG&gt;:&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;PRE&gt;/etc/hadoop/conf:/opt/cloudera/parcels/CDH-6.1.0-1.cdh6.1.0.p0.770702/lib/hadoop/libexec/../../hadoop/lib/*:/opt/cloudera/parcels/CDH-6.1.0-1.cdh6.1.0.p0.770702/lib/hadoop/libexec/../../hadoop/.//*:/opt/cloudera/parcels/CDH-
6.1.0-1.cdh6.1.0.p0.770702/lib/hadoop/libexec/../../hadoop-hdfs/./:/opt/cloudera/parcels/CDH-6.1.0-1.cdh6.1.0.p0.770702/lib/hadoop/libexec/../../hadoop-hdfs/lib/*:/opt/cloudera/parcels/CDH-6.1.0-1.cdh6.1.0.p0.770702/lib/hado
op/libexec/../../hadoop-hdfs/.//*:/opt/cloudera/parcels/CDH/lib/hadoop-mapreduce/.//*:/opt/cloudera/parcels/CDH-6.1.0-1.cdh6.1.0.p0.770702/lib/hadoop/libexec/../../hadoop-yarn/lib/*:/opt/cloudera/parcels/CDH-6.1.0-1.cdh6.1.0
.p0.770702/lib/hadoop/libexec/../../hadoop-yarn/.//*&lt;/PRE&gt;
&lt;P&gt;Some pertinent environment variables:&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;PRE&gt;user$ env | egrep -i 'hadoop|classpath'
HADOOP_HOME=/opt/cloudera/parcels/CDH/lib/hadoop&lt;/PRE&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Finally, there are two JAVA distributions installed; one &lt;STRONG&gt;OpenJDK&lt;/STRONG&gt; and the other installed by the&amp;nbsp;&lt;STRONG&gt;CDH6.x &lt;/STRONG&gt;installation&lt;STRONG&gt;&amp;nbsp;&lt;/STRONG&gt;wizard. I tried running the above &lt;STRONG&gt;parquet-tools&lt;/STRONG&gt; command with each JAVA distribution exported,&amp;nbsp; but&amp;nbsp;both yield the same error. Here are the JAVA distributions:&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;PRE&gt;user$ ls -al /usr/java /usr/lib/jvm
/usr/java:
total 12
drwxr-xr-x  3 root root 4096 Feb  1 01:52 .
drwxr-xr-x 14 root root 4096 Jan 21 21:01 ..
lrwxrwxrwx  1 root root   21 Feb  1 01:52 current.d -&amp;gt; jdk1.8.0_141-cloudera
drwxrwxr-x  8 root root 4096 Jan 21 21:01 jdk1.8.0_141-cloudera

/usr/lib/jvm:
total 24
drwxr-xr-x  4 root root  4096 Jan 21 20:44 .
dr-xr-xr-x 44 root root 12288 Feb  6 19:02 ..
lrwxrwxrwx  1 root root    26 Jan 21 20:44 java -&amp;gt; /etc/alternatives/java_sdk
lrwxrwxrwx  1 root root    32 Jan 21 20:44 java-1.8.0 -&amp;gt; /etc/alternatives/java_sdk_1.8.0
lrwxrwxrwx  1 root root    40 Jan 21 20:44 java-1.8.0-openjdk -&amp;gt; /etc/alternatives/java_sdk_1.8.0_openjdk
drwxr-xr-x  7 root root  4096 Jan 21 20:44 java-1.8.0-openjdk-1.8.0.191.b12-1.el7_6.i386
drwxr-xr-x  7 root root  4096 Jan 21 20:44 java-1.8.0-openjdk-1.8.0.191.b12-1.el7_6.x86_64
lrwxrwxrwx  1 root root    34 Jan 21 20:44 java-openjdk -&amp;gt; /etc/alternatives/java_sdk_openjdk
lrwxrwxrwx  1 root root    21 Jan 21 20:44 jre -&amp;gt; /etc/alternatives/jre
lrwxrwxrwx  1 root root    27 Jan 21 20:44 jre-1.8.0 -&amp;gt; /etc/alternatives/jre_1.8.0
lrwxrwxrwx  1 root root    35 Jan 21 20:44 jre-1.8.0-openjdk -&amp;gt; /etc/alternatives/jre_1.8.0_openjdk
lrwxrwxrwx  1 root root    49 Jan 21 20:44 jre-1.8.0-openjdk-1.8.0.191.b12-1.el7_6.i386 -&amp;gt; java-1.8.0-openjdk-1.8.0.191.b12-1.el7_6.i386/jre
lrwxrwxrwx  1 root root    51 Jan 21 20:44 jre-1.8.0-openjdk-1.8.0.191.b12-1.el7_6.x86_64 -&amp;gt; java-1.8.0-openjdk-1.8.0.191.b12-1.el7_6.x86_64/jre
lrwxrwxrwx  1 root root    29 Jan 21 20:44 jre-openjdk -&amp;gt; /etc/alternatives/jre_openjdk&lt;/PRE&gt;
&lt;P&gt;Note that the setup/cluster is set to use/prefer CDH's JAVA.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Any ideas?&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;P.S. But for this, the entire cluster is (and has been) running perfectly.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Thank you!&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Fri, 16 Sep 2022 14:10:56 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/parquet-tools-No-FileSystem-for-scheme-hdfs/m-p/86795#M35846</guid>
      <dc:creator>prismalytics</dc:creator>
      <dc:date>2022-09-16T14:10:56Z</dc:date>
    </item>
    <item>
      <title>Re: parquet-tools :: No FileSystem for scheme hdfs</title>
      <link>https://community.cloudera.com/t5/Support-Questions/parquet-tools-No-FileSystem-for-scheme-hdfs/m-p/86941#M35847</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.cloudera.com/t5/user/viewprofilepage/user-id/9326"&gt;@prismalytics&lt;/a&gt;,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;As&amp;nbsp;documented in the &lt;A href="https://github.com/apache/parquet-mr/tree/master/parquet-tools" target="_self"&gt;Apache Github&lt;/A&gt;, we need to execute with hadoop jar command for a file on HDFS filesystem.&lt;/P&gt;&lt;P&gt;---&lt;BR /&gt;#Run from hadoop&lt;/P&gt;&lt;P&gt;See Commands Usage for command to use&lt;/P&gt;&lt;P&gt;hadoop jar ./parquet-tools-&amp;lt;VERSION&amp;gt;.jar &amp;lt;command&amp;gt; my_parquet_file.lzo.parquet&lt;BR /&gt;---&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;So could you please execute hadoop jar command as following?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;hadoop jar /opt/cloudera/parcels/&amp;lt;CDH-VERSION&amp;gt;/jars/parquet-tools-&amp;lt;VERSION&amp;gt;.jar &amp;lt;command&amp;gt; &amp;lt;hdfs path to parquet file&amp;gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;e.g.&lt;/P&gt;&lt;PRE&gt;hadoop jar /opt/cloudera/parcels/CDH-6.1.0-1.cdh6.1.0.p0.770702/jars/parquet-tools-1.9.0-cdh6.1.0.jar cat hdfs://tmp/1.parquet&lt;/PRE&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thanks and hope this helps,&lt;/P&gt;&lt;P&gt;Li&lt;/P&gt;</description>
      <pubDate>Tue, 26 Feb 2019 23:29:17 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/parquet-tools-No-FileSystem-for-scheme-hdfs/m-p/86941#M35847</guid>
      <dc:creator>lwang</dc:creator>
      <dc:date>2019-02-26T23:29:17Z</dc:date>
    </item>
    <item>
      <title>Re: parquet-tools :: No FileSystem for scheme hdfs</title>
      <link>https://community.cloudera.com/t5/Support-Questions/parquet-tools-No-FileSystem-for-scheme-hdfs/m-p/86991#M35848</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.cloudera.com/t5/user/viewprofilepage/user-id/5016"&gt;@lwang&lt;/a&gt;:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Yes, your resolution worked with one minor tweak:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Need&lt;STRONG&gt; hdfs:///&lt;/STRONG&gt; instead of &lt;STRONG&gt;hdfs://&lt;/STRONG&gt; :&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;PRE&gt;user$ hadoop jar /opt/cloudera/parcels/CDH/jars/parquet-tools-1.9.0-cdh6.1.0.jar cat hdfs:///tmp/1.parquet&lt;/PRE&gt;&lt;P&gt;or, if fully-qualifying the HDFS host, then the following (where &lt;STRONG&gt;hdfs://&lt;/STRONG&gt; will do):&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;PRE&gt;user$ hadoop jar /opt/cloudera/parcels/CDH/jars/parquet-tools-1.9.0-cdh6.1.0.jar cat hdfs://vps00:8020/tmp/1.parquet&lt;/PRE&gt;&lt;P&gt;Thank you so very much! =:)&lt;/P&gt;</description>
      <pubDate>Wed, 27 Feb 2019 19:27:27 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/parquet-tools-No-FileSystem-for-scheme-hdfs/m-p/86991#M35848</guid>
      <dc:creator>prismalytics</dc:creator>
      <dc:date>2019-02-27T19:27:27Z</dc:date>
    </item>
    <item>
      <title>Re: parquet-tools :: No FileSystem for scheme hdfs</title>
      <link>https://community.cloudera.com/t5/Support-Questions/parquet-tools-No-FileSystem-for-scheme-hdfs/m-p/86995#M35849</link>
      <description>Great to hear the issue got resolved! Thanks for the feedback!</description>
      <pubDate>Wed, 27 Feb 2019 19:47:45 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/parquet-tools-No-FileSystem-for-scheme-hdfs/m-p/86995#M35849</guid>
      <dc:creator>lwang</dc:creator>
      <dc:date>2019-02-27T19:47:45Z</dc:date>
    </item>
  </channel>
</rss>

