Member since
08-31-2016
20
Posts
3
Kudos Received
0
Solutions
11-06-2016
10:08 PM
@Qi Wang Thank you. Apologies for repeating the same question again. Which library you have used to write to csv file. I am planning to take the exam in python. Please advise.
... View more
11-06-2016
03:49 AM
@Qi Wang Is it safe to assume that the Databricks package will be available during the test to read and write to csv files ?
pyspark --packages com.databricks:spark-csv_2.10:1.4.0 df.write.format("com.databricks.spark.csv").option("header","true").save("file.csv")
... View more
11-03-2016
04:52 PM
@Qi Wang In Spark 1.6.* version, RDD/dataframe have functions to write only to below formats rdd.saveAsTextFile / saveAsSequenceFile df.write.orc / json / parquet / text / saveAsTable Query: I am sure we can not download other csv packages (i.e. databricks..etc) during the test. Is there any way to write the output file in csv format. Please advise. Thanks in advance.
... View more
10-31-2016
07:46 PM
@Qi Wang 3. Is there anyway, to activate intellisense/auto completion work in HDP environment for spark in python. Either using vi/gedit or by using pyspark shell. Thanks in advance.
... View more
10-31-2016
07:32 PM
@Qi Wang Thank you for your prompt response. Could you please help with below queries. Current sandbox version is HDP 2.5 and supported Spark version is 1.6.2. 1. In the sandbox which I have downloaded, only vi is available, there is no gedit. Do we need to install gedit ? 2. I have learn that Apache Spark documentation and Hortonworks Spark documentation is available during exam. Apache Spark Documentation: https://spark.apache.org/docs . Is this the right link ? Hortonworks Spark documentation: What is hortonworks spark documentation link ? Thanks in advance.
... View more
10-31-2016
06:12 PM
2 Kudos
1. Which version of HDP sandbox is being used ? 2. Which version of Spark being used ? 3. What kind of IDE options are available during the exam for Python? Apart from pyspark-shell, is there any IDE available like IPython or Zeppelin ? Is there any IDE option available which have auto suggestion option and where we can submit jobs to cluster. Please advise. 4. I have read few posts in hortonworks community, that we may use Spark RDDs or Spark Dataframes for accomplishing the tasks? Please confirm. 5. What is the pass percentage on average ?
... View more
Labels:
- Labels:
-
Apache Spark
10-13-2016
03:20 PM
How to pass the user input from angular UI as the dynamic input for the hive SQL query? In the below example, I would like to pass pass the startDate and endDate from angular input --> as the dynamic input to WHERE condition in the hive SQL query?
... View more
Labels:
- Labels:
-
Apache Hive
-
Apache Zeppelin
10-13-2016
02:49 PM
In Zeppelin livy interpreter, what should be value for parameter livy.spark.master ? Please advise. livy.spark.master yarn-cluster or livy.spark.master yarn-client
... View more
Labels:
- Labels:
-
Apache Zeppelin