Member since
07-15-2016
1
Post
0
Kudos Received
0
Solutions
07-15-2016
12:22 AM
Hi, I have a question regarding the exam. If the question
does not specify the output format (json, parquet, etc..), does it mean I
can use any of the available
options in spark? For example, would the output (which I will export via my Spark code) in hdfs
"part0000-.....gz.parquet" be valid (assuming the data inside complies
with the question conditions/criteria). Also, may I used
DataFrames & Spark SQL to process the datasets, instead of plain RDD
if the question does not specify that as well? Thanks
... View more
Labels: