<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: I get an IllegalArgumentException error when trying to read a file with Spark 1.4.1 in Archives of Support Questions (Read Only)</title>
    <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/I-get-an-IllegalArgumentException-error-when-trying-to-read/m-p/96725#M10270</link>
    <description>&lt;P&gt;After adding org.xerial.snappy.tempdir to a newly created directory with rwx permissions, spark works fine now. &lt;/P&gt;</description>
    <pubDate>Sun, 03 Jan 2016 10:53:44 GMT</pubDate>
    <dc:creator>WELI</dc:creator>
    <dc:date>2016-01-03T10:53:44Z</dc:date>
    <item>
      <title>I get an IllegalArgumentException error when trying to read a file with Spark 1.4.1</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/I-get-an-IllegalArgumentException-error-when-trying-to-read/m-p/96722#M10267</link>
      <description>&lt;P&gt;Running a Spark command to read a file and get an illegalArgumentException. This is HDP 2.3.1 and Spark 1.4.1. Same error occurs with PySpark. The error appears to come from the SnappyCompressionCodec.&lt;/P&gt;&lt;P&gt;scala&amp;gt; &lt;STRONG&gt;var file = sc.textFile("hdfs://HdpTest:8020/user/weli/README.md")&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;java.lang.IllegalArgumentException&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;  at org.apache.spark.io.SnappyCompressionCodec.&amp;lt;init&amp;gt;(CompressionCodec.scala:152)&lt;/P&gt;&lt;P&gt;  at sun.reflect.NativeConstructorAccessorImpl.newInstance0(Native Method)&lt;/P&gt;</description>
      <pubDate>Mon, 09 Nov 2015 09:21:56 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/I-get-an-IllegalArgumentException-error-when-trying-to-read/m-p/96722#M10267</guid>
      <dc:creator>SQLShaw</dc:creator>
      <dc:date>2015-11-09T09:21:56Z</dc:date>
    </item>
    <item>
      <title>Re: I get an IllegalArgumentException error when trying to read a file with Spark 1.4.1</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/I-get-an-IllegalArgumentException-error-when-trying-to-read/m-p/96723#M10268</link>
      <description>&lt;P&gt;scala&amp;gt; var file = sc.textFile("hdfs://nsfed01.cloud.hortonworks.com:8020/tmp/expense.csv")&lt;/P&gt;&lt;P&gt;15/11/08 17:34:06 INFO MemoryStore: ensureFreeSpace(200320) called with curMem=0, maxMem=278019440&lt;/P&gt;&lt;P&gt;15/11/08 17:34:06 INFO MemoryStore: Block broadcast_0 stored as values in memory (estimated size 195.6 KB, free 264.9 MB)&lt;/P&gt;&lt;P&gt;15/11/08 17:34:06 INFO MemoryStore: ensureFreeSpace(18855) called with curMem=200320, maxMem=278019440&lt;/P&gt;&lt;P&gt;15/11/08 17:34:06 INFO MemoryStore: Block broadcast_0_piece0 stored as bytes in memory (estimated size 18.4 KB, free 264.9 MB)&lt;/P&gt;&lt;P&gt;15/11/08 17:34:06 INFO BlockManagerInfo: Added broadcast_0_piece0 in memory on localhost:40023 (size: 18.4 KB, free: 265.1 MB)&lt;/P&gt;&lt;P&gt;15/11/08 17:34:06 INFO SparkContext: Created broadcast 0 from textFile at &amp;lt;console&amp;gt;:15&lt;/P&gt;&lt;P&gt;file: org.apache.spark.rdd.RDD[String] = MapPartitionsRDD[1] at textFile at &amp;lt;console&amp;gt;:15&lt;/P&gt;&lt;P&gt;scala&amp;gt;&lt;/P&gt;&lt;P&gt;[root@nsfed01 ~]# rpm -qa | grep -i snappy&lt;/P&gt;&lt;P&gt;snappy-1.1.0-1.el6.x86_64&lt;/P&gt;&lt;P&gt;snappy-devel-1.1.0-1.el6.x86_64&lt;/P&gt;&lt;P&gt;The above is from HDP 2.3.2&lt;/P&gt;&lt;P&gt;Not sure if its related &lt;A target="_blank" href="https://issues.apache.org/jira/browse/SPARK-8946"&gt;https://issues.apache.org/jira/browse/SPARK-8946&lt;/A&gt;&lt;/P&gt;&lt;P&gt;@&lt;A href="http://community.hortonworks.com/users/186/sshaw.html"&gt;Scott Shaw&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Mon, 09 Nov 2015 09:39:23 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/I-get-an-IllegalArgumentException-error-when-trying-to-read/m-p/96723#M10268</guid>
      <dc:creator>nsabharwal</dc:creator>
      <dc:date>2015-11-09T09:39:23Z</dc:date>
    </item>
    <item>
      <title>Re: I get an IllegalArgumentException error when trying to read a file with Spark 1.4.1</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/I-get-an-IllegalArgumentException-error-when-trying-to-read/m-p/96724#M10269</link>
      <description>&lt;P&gt;HDP-2.3.2.0-2950&lt;/P&gt;&lt;P&gt;Spark: 1.4.1.2.3&lt;/P&gt;&lt;P&gt;$ rpm -qa |grep -i snappy
&lt;/P&gt;&lt;P&gt;     snappy-devel-1.1.0-3.el7.x86_64
snappy-1.1.0-3.el7.x86_64&lt;/P&gt;&lt;P&gt;Hortonwork support suggests us to apply spark 1.5.1 TP repo. But our Unix admin needs a tar.gz file to set up a local repo. Anyone knows the link to tar file?&lt;/P&gt;</description>
      <pubDate>Tue, 15 Dec 2015 06:26:45 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/I-get-an-IllegalArgumentException-error-when-trying-to-read/m-p/96724#M10269</guid>
      <dc:creator>WELI</dc:creator>
      <dc:date>2015-12-15T06:26:45Z</dc:date>
    </item>
    <item>
      <title>Re: I get an IllegalArgumentException error when trying to read a file with Spark 1.4.1</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/I-get-an-IllegalArgumentException-error-when-trying-to-read/m-p/96725#M10270</link>
      <description>&lt;P&gt;After adding org.xerial.snappy.tempdir to a newly created directory with rwx permissions, spark works fine now. &lt;/P&gt;</description>
      <pubDate>Sun, 03 Jan 2016 10:53:44 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/I-get-an-IllegalArgumentException-error-when-trying-to-read/m-p/96725#M10270</guid>
      <dc:creator>WELI</dc:creator>
      <dc:date>2016-01-03T10:53:44Z</dc:date>
    </item>
  </channel>
</rss>

