Member since
03-03-2017
2
Posts
1
Kudos Received
0
Solutions
03-24-2017
10:08 AM
1 Kudo
Hi, I am trying to get through the HANDS-ON TOUR OF APACHE SPARK IN 5 MINUTES tutorial with the python interpreter, but when I try to run multiple columns like this part: %pyspark myLines=sc.textFile('hdfs://sandbox.hortonworks.com/tmp/Hortonworks') myLinesFiltered=myLines.filter(lambdax:len(x)>0) count=myLinesFiltered.count() print count I got syntax error at the end of the second line. If I run it line by line it works fine, but if I try to run two lines, I always get syntax error, no matter what I run. Thanks, Zsoka
... View more
Labels:
- Labels:
-
Apache Zeppelin
03-03-2017
08:27 AM
Hi, I am started to use the Sandbox and I am quite confused how to register my Java UDF in Pig. I tried to put the jar file into the HDFS and in the docker image too, but I always get the same error:
ERROR 101: file 'myudf.jar' does not exist. Should I put the file in the cent os file system? And if the answer is yes, than how can I do it? Thanks, Zsoka
... View more
Labels:
- Labels:
-
Apache Pig