Support Questions

Find answers, ask questions, and share your expertise

CCP: Data Engineering Exam

avatar
Explorer

Hello,

 

I'm trying to study for the exam, but I am trying to find out what are the main tools/subject to study for this exam.

 

Should we focus on Sqoop (Import&Export), Flume, Kafka, Hive, Spark and Oozie?

 

Am I missing anything? Or studying more than I should once my background for now is only Sqoop and Hive?

 

Best regards,

David

 

 

1 ACCEPTED SOLUTION

avatar
Community Manager

I think the below wording from the CCP Data Engineer page should cover that question. 

 

What should you expect?

You are given five to eight customer problems each with a unique, large data set, a CDH cluster, and four hours. For each problem, you must implement a technical solution with a high degree of precision that meets all the requirements. You may use any tool or combination of tools on the cluster (see list below) -- you get to pick the tool(s) that are right for the job. You must possess enough industry knowledge to analyze the problem and arrive at an optimal approach given the time allowed. You need to know what you should do and then do it on a live cluster under rigorous conditions, including a time limit and while being watched by a proctor.


Cy Jervis, Manager, Community Program
Was your question answered? Make sure to mark the answer as the accepted solution.
If you find a reply useful, say thanks by clicking on the thumbs up button.

View solution in original post

10 REPLIES 10

avatar
Community Manager

The CCP Data Engineer page explains the skills required and the test environment which you will be using to complete tasks. It should have the information you are looking for. 


Cy Jervis, Manager, Community Program
Was your question answered? Make sure to mark the answer as the accepted solution.
If you find a reply useful, say thanks by clicking on the thumbs up button.

avatar
Explorer

Ok, first of all thanks for you reply @cjervis.

 

I've checked that page, but it's not specific, it have all the Cloudera tools which is quite impossible to use them all right?

 

It is important to us, to have guidelines like the page of the Spark certification exam, what does not happens with Data Engineer page.

 

Thanks again,

David

avatar
Community Manager

Ah, I see what you mean now. The certification team has bee updating the pages to add a sample question and additional information but don't seem to have changed this one yet. I'll reach out and see what I can find out. 


Cy Jervis, Manager, Community Program
Was your question answered? Make sure to mark the answer as the accepted solution.
If you find a reply useful, say thanks by clicking on the thumbs up button.

avatar
Explorer

That would be great @cjervis.

 

Hope to hear from you as soon as possible 🙂

 

Thanks,

David

avatar
Explorer

Hi @cjervis, do you have some news regarding this topic?

avatar
Community Manager

Sorry about the delay. I heard back from the certification team the other day and didn't get around to posting a reply.

 

Basically, they advised theat the CCP: Data Engineering certification is to show your "mastery" of the subjects covered. As a result they do not provide further details on the exam or sample questions. 


Cy Jervis, Manager, Community Program
Was your question answered? Make sure to mark the answer as the accepted solution.
If you find a reply useful, say thanks by clicking on the thumbs up button.

avatar
Explorer

Thanks @cjervis.

 

So we should be able to know the best solution for each topic within the given time, no matter the tools we use?

avatar
Community Manager

I think the below wording from the CCP Data Engineer page should cover that question. 

 

What should you expect?

You are given five to eight customer problems each with a unique, large data set, a CDH cluster, and four hours. For each problem, you must implement a technical solution with a high degree of precision that meets all the requirements. You may use any tool or combination of tools on the cluster (see list below) -- you get to pick the tool(s) that are right for the job. You must possess enough industry knowledge to analyze the problem and arrive at an optimal approach given the time allowed. You need to know what you should do and then do it on a live cluster under rigorous conditions, including a time limit and while being watched by a proctor.


Cy Jervis, Manager, Community Program
Was your question answered? Make sure to mark the answer as the accepted solution.
If you find a reply useful, say thanks by clicking on the thumbs up button.

avatar
New Contributor

I guess that I am not clear still. For the data engineering examination are we expected to code in Scala or Python? It seems to me like it is more about knowing about Cloudera and lots of data usage and transformation. Can someone please answer the question clearly

 

Thanks,

Courtney