Archives of Support Questions (Read Only)

This is an archived board for historical reference. Information and links may no longer be available or relevant
Announcements
This board is archived and read-only for historical reference. To ask a new question, please post a new topic on the appropriate active board.

CCP: Data Engineering Exam

avatar
Explorer

Hello,

 

I'm trying to study for the exam, but I am trying to find out what are the main tools/subject to study for this exam.

 

Should we focus on Sqoop (Import&Export), Flume, Kafka, Hive, Spark and Oozie?

 

Am I missing anything? Or studying more than I should once my background for now is only Sqoop and Hive?

 

Best regards,

David

 

 

1 ACCEPTED SOLUTION

avatar
Community Manager

I think the below wording from the CCP Data Engineer page should cover that question. 

 

What should you expect?

You are given five to eight customer problems each with a unique, large data set, a CDH cluster, and four hours. For each problem, you must implement a technical solution with a high degree of precision that meets all the requirements. You may use any tool or combination of tools on the cluster (see list below) -- you get to pick the tool(s) that are right for the job. You must possess enough industry knowledge to analyze the problem and arrive at an optimal approach given the time allowed. You need to know what you should do and then do it on a live cluster under rigorous conditions, including a time limit and while being watched by a proctor.


Keep the questions coming,

Cy Jervis | Senior Manager, Knowledge Programs

if (helpful) { mark_as_solution(); } | if (appreciated) { give_kudos(); }

View solution in original post

10 REPLIES 10

avatar
Super Collaborator

https://www.cloudera.com/about/training/certification/faq.html

 

What am I responsible for during the exam? 
These are practical exams. During the exam you will be asked to evaluate a scenario and implement a solution. You are responsible for everything necessary to generate that solution, such as writing code, configuring tools, and debugging any issues. You may use any approach or tools on the cluster that will produce your solution. Only the results will be graded.