Reply
Highlighted
Explorer
Posts: 10
Registered: ‎08-12-2015
Accepted Solution

Workflow Portion of CCP: Data Engineer Exam

[ Edited ]

The Workflow portion of the exam has the following expectations:

 

The ability to create and execute various jobs and actions that move data towards greater value and use in a system.

 

This includes the following skills:

 

  • Create and execute a linear workflow with actions that include Hadoop jobs, Hive jobs, Pig jobs, custom actions, etc.
  • Create and execute a branching workflow with actions that include Hadoop jobs, Hive jobs, Pig jobs, custom action, etc.
  • Orchestrate a workflow to execute regularly at predefined times, including workflows that have data dependencies.

Would it be acceptable if we use a combination of bash scripts and cronjobs for this portion?

Cloudera Employee
Posts: 39
Registered: ‎11-19-2014

Re: Workflow Portion of CCP: Data Engineer Exam

If a question does not specify how to perform the task (which most don't), then any solution that achieves the desired result is acceptable.  In some cases, however, problems may require you to work with specific technologies, such as placing data into a table in the Hive metastore or building an Oozie workflow.  You would be best advised to be familiar with both approaches.

 

Devon