Support Questions

Find answers, ask questions, and share your expertise
Announcements
Celebrating as our community reaches 100,000 members! Thank you!

HDPCD Spark Certification

avatar
Contributor

1. Which version of HDP sandbox is being used ?

2. Which version of Spark being used ?

3. What kind of IDE options are available during the exam for Python? Apart from pyspark-shell, is there any IDE available like IPython or Zeppelin ? Is there any IDE option available which have auto suggestion option and where we can submit jobs to cluster. Please advise.

4. I have read few posts in hortonworks community, that we may use Spark RDDs or Spark Dataframes for accomplishing the tasks? Please confirm.

5. What is the pass percentage on average ?

1 ACCEPTED SOLUTION

avatar
Master Collaborator

the test environment is on AMS virtual. When I took the test, it was HDP2.3 and I am not sure what version is used now. you could use the current sandbox for your exercise. Spark is something later than 1.4, probably 1.5. But the knowledge covered are all basic RDD and dataframe that are not very much linked to newer versions. test environment has no IDE. You use either gedit or vi base on you preference. debug with spark-shell or pyspark

couple notes on the exam

1. know RDD and dataframe api well. Go through all the docs in the test web page.

2. know how to import and export RDD/dataframe from/to csv files.

3. there is no limit on how you finish the task, so choose the technical you are most familiar with either the API or Spark SQL

4. test environment is quite slow in response, so be patient with it and leave enough time for tasks.

Good luck taking the exam.

View solution in original post

14 REPLIES 14

avatar
Master Collaborator

That I am not sure. I did not use that library.

avatar
Contributor

@Qi Wang

Thank you. Apologies for repeating the same question again.

Which library you have used to write to csv file. I am planning to take the exam in python. Please advise.

avatar
Master Collaborator

I won't assume the package is available. Better find a way to do that in python.

avatar
New Contributor

Hi Wang,

thanks for all the tips shared. Its really helpful.

Could you please tell me, how many tasks provided and how many we have to complete to clear the test.

Also it would be great help, if you could share the questions asked, if you remember.

As there is no practice test available for HDPCD-Spark, I dont have any clue about the pattern of tasks.

Thank you

Himansu

avatar
New Contributor

@Qi Wang

Hello Sir,

Thanks for your input.

Just one doubt about the pattern. Please let us know how many questions would be asked in the exam?
and what would be the passing criteria?

Thank you.