Hi Friends -
Having an issue with the tutorial on the line:
Here is the error message:
Any thoughts as to what is going on here? This seems like a pretty simple command - but apparently something is amiss.
Many thanks for your thoughts,
The first command...
wget –no-check-certificate ‘https://docs.google.com/uc?export=download&id=0BzhlOywnOpq8OWFzQjJObUtlck0’ -O /tmp/littlelog.csv
failed for me; both in Zeppelin and in my SSH session.
Were you able to get this loaded into /tmp in HDFS?
Hello Lester -
I was able to get this file - but not by using the line of the tutorial. I ended up copying and pasting the line
"https://docs.google.com/uc?export=download&id=0BzhlOywnOpq8OWFzQjJObUtlck0’ -O /tmp/littlelog.csv" into my browser, downloading the file to my Downloads folder, and they uploading to HDFS via the Ambari Sandbox.
I was able to successfully process and get the expected results through the tutorial steps up to that point...
So the file is there - I can perform operations on it - cannot figure out want went wrong.
And thank you so much for your assistance.
It is actually working fine for me (sans the cli wget issue) as shown here.
keys: org.apache.spark.rdd.RDD[String] = MapPartitionsRDD at map at <console>:35 fl id ny ny ca ca
Issue a "reboot now" command in a SSH window while logged in as root to see if when all comes back up you are still seeing this problem. If so, try doing it via the spark-shell CLI version. You are right that this is a very basic command and SHOULD be working for you.
@Rafael Coss, can someone look at the wget command at the top of http://hortonworks.com/hadoop-tutorial/interacting-with-data-on-hdp-using-scala-and-apache-spark/ as @Mike Vogt and myself both had troubles getting it to work as indicated (we just pulled the file down with our browser). Also, maybe somebody can adjust the font so it is fixed-width and shows the line breaks. Thx!!