<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: lastest HDP 2.6.5.0-292 DataFrame show() throws an error in Archives of Support Questions (Read Only)</title>
    <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/lastest-HDP-2-6-5-0-292-DataFrame-show-throws-an-error/m-p/180076#M80390</link>
    <description>&lt;P&gt; &lt;A rel="user" href="https://community.cloudera.com/users/10779/dalinqin.html" nodeid="10779"&gt;@dalin qin&lt;/A&gt; this type of errors are due multiple versions of same jar in classpath. Could you run &lt;/P&gt;&lt;PRE&gt;lsof -P -p &amp;lt;pid&amp;gt; | grep lz4&lt;/PRE&gt;&lt;P&gt;this will hopefully show the places from where the lz4 jar is being used and probably the incorrect version is being picked. Note: pid is the spark shell pid&lt;/P&gt;&lt;P&gt;HTH&lt;/P&gt;</description>
    <pubDate>Sat, 07 Jul 2018 20:39:50 GMT</pubDate>
    <dc:creator>falbani</dc:creator>
    <dc:date>2018-07-07T20:39:50Z</dc:date>
    <item>
      <title>lastest HDP 2.6.5.0-292 DataFrame show() throws an error</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/lastest-HDP-2-6-5-0-292-DataFrame-show-throws-an-error/m-p/180075#M80389</link>
      <description>&lt;P&gt;
	Hi ,&lt;/P&gt;&lt;P&gt;
	I'm using latest HDP ,version is 2.6.5.0-292. spark version is 2.3.0&lt;/P&gt;&lt;P&gt;
	when I'm trying to run show() from any DataFrame ,it always throw error :&lt;/P&gt;&lt;P&gt;
	&lt;STRONG&gt;&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;
	scala&amp;gt; spark.read.csv("/user/a.txt").show() &lt;/P&gt;&lt;P&gt;java.lang.NoSuchMethodError: net.jpountz.lz4.LZ4BlockInputStream.&amp;lt;init&amp;gt;(Ljava/io/InputStream;Z)V
  at org.apache.spark.io.LZ4CompressionCodec.compressedInputStream(CompressionCodec.scala:122)
  at org.apache.spark.sql.execution.SparkPlan.org$apache$spark$sql$execution$SparkPlan$decodeUnsafeRows(SparkPlan.scala:274)
  at org.apache.spark.sql.execution.SparkPlan$anonfun$executeTake$1.apply(SparkPlan.scala:366)
  at org.apache.spark.sql.execution.SparkPlan$anonfun$executeTake$1.apply(SparkPlan.scala:366)
  at scala.collection.TraversableLike$anonfun$flatMap$1.apply(TraversableLike.scala:241)
  at scala.collection.TraversableLike$anonfun$flatMap$1.apply(TraversableLike.scala:241)
  at scala.collection.IndexedSeqOptimized$class.foreach(IndexedSeqOptimized.scala:33)
  at scala.collection.mutable.ArrayOps$ofRef.foreach(ArrayOps.scala:186)
  at scala.collection.TraversableLike$class.flatMap(TraversableLike.scala:241)
  at scala.collection.mutable.ArrayOps$ofRef.flatMap(ArrayOps.scala:186)
  at org.apache.spark.sql.execution.SparkPlan.executeTake(SparkPlan.scala:366)
  at org.apache.spark.sql.execution.CollectLimitExec.executeCollect(limit.scala:38)
  at org.apache.spark.sql.Dataset.org$apache$spark$sql$Dataset$collectFromPlan(Dataset.scala:3272)
  at org.apache.spark.sql.Dataset$anonfun$head$1.apply(Dataset.scala:2484)
  at org.apache.spark.sql.Dataset$anonfun$head$1.apply(Dataset.scala:2484)
  at org.apache.spark.sql.Dataset$anonfun$52.apply(Dataset.scala:3253)
  at org.apache.spark.sql.execution.SQLExecution$.withNewExecutionId(SQLExecution.scala:77)
  at org.apache.spark.sql.Dataset.withAction(Dataset.scala:3252)
  at org.apache.spark.sql.Dataset.head(Dataset.scala:2484)
  at org.apache.spark.sql.Dataset.take(Dataset.scala:2698)
  at org.apache.spark.sql.execution.datasources.csv.TextInputCSVDataSource$.infer(CSVDataSource.scala:148)
  at org.apache.spark.sql.execution.datasources.csv.CSVDataSource.inferSchema(CSVDataSource.scala:63)
  at org.apache.spark.sql.execution.datasources.csv.CSVFileFormat.inferSchema(CSVFileFormat.scala:57)
  at org.apache.spark.sql.execution.datasources.DataSource$anonfun$8.apply(DataSource.scala:202)
  at org.apache.spark.sql.execution.datasources.DataSource$anonfun$8.apply(DataSource.scala:202)
  at scala.Option.orElse(Option.scala:289)
  at org.apache.spark.sql.execution.datasources.DataSource.getOrInferFileFormatSchema(DataSource.scala:201)
  at org.apache.spark.sql.execution.datasources.DataSource.resolveRelation(DataSource.scala:392)
  at org.apache.spark.sql.DataFrameReader.loadV1Source(DataFrameReader.scala:239)
  at org.apache.spark.sql.DataFrameReader.load(DataFrameReader.scala:227)
  at org.apache.spark.sql.DataFrameReader.csv(DataFrameReader.scala:596)
  at org.apache.spark.sql.DataFrameReader.csv(DataFrameReader.scala:473)&lt;/P&gt;&lt;P&gt;
	&lt;STRONG&gt;&lt;BR /&gt;
	&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;
	I've tried both pyspark and spark-shell on 3 sets of newly installed hdp  2.6.5.0-292.&lt;/P&gt;&lt;P&gt;the DataFrame writing function works well ,only show() throws the error.&lt;/P&gt;&lt;P&gt;are there anyone encountered same issue as I had? how to fix this problem?&lt;/P&gt;</description>
      <pubDate>Sat, 07 Jul 2018 11:27:32 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/lastest-HDP-2-6-5-0-292-DataFrame-show-throws-an-error/m-p/180075#M80389</guid>
      <dc:creator>dblive</dc:creator>
      <dc:date>2018-07-07T11:27:32Z</dc:date>
    </item>
    <item>
      <title>Re: lastest HDP 2.6.5.0-292 DataFrame show() throws an error</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/lastest-HDP-2-6-5-0-292-DataFrame-show-throws-an-error/m-p/180076#M80390</link>
      <description>&lt;P&gt; &lt;A rel="user" href="https://community.cloudera.com/users/10779/dalinqin.html" nodeid="10779"&gt;@dalin qin&lt;/A&gt; this type of errors are due multiple versions of same jar in classpath. Could you run &lt;/P&gt;&lt;PRE&gt;lsof -P -p &amp;lt;pid&amp;gt; | grep lz4&lt;/PRE&gt;&lt;P&gt;this will hopefully show the places from where the lz4 jar is being used and probably the incorrect version is being picked. Note: pid is the spark shell pid&lt;/P&gt;&lt;P&gt;HTH&lt;/P&gt;</description>
      <pubDate>Sat, 07 Jul 2018 20:39:50 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/lastest-HDP-2-6-5-0-292-DataFrame-show-throws-an-error/m-p/180076#M80390</guid>
      <dc:creator>falbani</dc:creator>
      <dc:date>2018-07-07T20:39:50Z</dc:date>
    </item>
    <item>
      <title>Re: lastest HDP 2.6.5.0-292 DataFrame show() throws an error</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/lastest-HDP-2-6-5-0-292-DataFrame-show-throws-an-error/m-p/180077#M80391</link>
      <description>&lt;P&gt;thank you very much ,that' my bad ,I had added some other jars in my class path leading to this error.&lt;/P&gt;</description>
      <pubDate>Sat, 07 Jul 2018 21:16:36 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/lastest-HDP-2-6-5-0-292-DataFrame-show-throws-an-error/m-p/180077#M80391</guid>
      <dc:creator>dblive</dc:creator>
      <dc:date>2018-07-07T21:16:36Z</dc:date>
    </item>
  </channel>
</rss>

