Support Questions

Find answers, ask questions, and share your expertise
Announcements
Celebrating as our community reaches 100,000 members! Thank you!

hdpcd question

avatar
Explorer

Hello Guys, I am preparing for the hdpcd certification exam and have few questions.

The exam will be on HDP2.4 whereas the version available to download from the hortonworks website is HDP2.6, is there much difference in the 2 version?

Same goes for other tools HIVE,PIG,SQOOP, they all older versions on the exam. Can I be learning on the new versions in HP2.6 and still be able to do the exam?

Will I have access to Ambari HIVE and Ambari PIG in the exam to run my scripts on? or do I have to rely on terminal?

I have done all the tutorials on the hortonworks website for PIG and HIVE, is there any additional work recommended?

thanks

1 ACCEPTED SOLUTION

avatar

@Harjinder Brar

1. You will have access to Ambari only to monitor your cluster and say start or stop a service.

2. You will not have access to any of the Ambari Views like Pig View, Hive View, HDFS Files View etc.

3. You will have to execute all your tasks using the terminal.

4. It is recommended that you practice on the same version of HDP that you are going to get in the exam. Small version changes can lead to lot of problems.

For example - in a lower version of Pig, you have to explicitly cast values when using FOREACH generate statement, but in higher versions of Pig it is not needed.

It is better to get used to the exam version and avoid any time/effort wastage during the exam.

5. The practice exam is your best resource to give you a feel of the environment.

https://hortonworks.com/wp-content/uploads/2015/02/HDPCD-PracticeExamGuide1.pdf

6. If you have finished all the related tutorials available in Hortonworks Tutorial site then you should be good.

Just practice by altering scenarios in those tutorials, create your own questions on those data sets to play around and practice.

Ensure you have covered all objectives for HDPCD:

https://hortonworks.com/services/training/certification/exam-objectives/#hdpcd

Here are a few tips that will help you in the exam -

1. Type your commands using a system editor program like gedit and then copy one line at a time into your terminal.

This way you will stop where the error occurs and also you will be able to see any syntactical errors when you first type your commands in gedit instead of directly on the terminal.

2. When doing Pig or Hive tasks, if the question does not asks you to use a specific execution engine, then always prefer to use tez so that your jobs complete faster than when using MapReduce

In hive session, set exection engine as tez:

hive> set hive.execution.engine = tez;

To open pig session with execution engine as tez, type following in your terminal:

pig -x tez

3. Read the questions properly, don't rush to start writing the solution.

The questions are very easy but their will be fine details that one may miss and ending up not scoring on that task.

For example, in a hurry you may miss reading a certain part which could indicate that you need to add multiple conditions to your where clause or filter statements in hive/pig.

So even though you may have executed task with no error, it will give incorrect answer and you will loose a point.

Wish you all the best!

View solution in original post

7 REPLIES 7

avatar
Contributor

Through terminal you need to execute the script . Go through the topics provided by hortonwork no additional work required .

avatar
Explorer

thank you @Abhishek Kumar. are you saying there will be no ambari for pig and hive available at all in the exam?

that would mean getting used to the terminal a lot.

avatar
Contributor

Ambari will be available but I don't think you will get time to use that .Practice on terminal and go for Amazon hdpcd practice set you will understand the environment for actual exam.

avatar
Contributor

avatar
Explorer

@Abhishek Kumar will do. Regarding the joins and the SQL, should I expect it to be super hard or just average difficulty? cheers

avatar
Contributor

it will be easy dw .

avatar

@Harjinder Brar

1. You will have access to Ambari only to monitor your cluster and say start or stop a service.

2. You will not have access to any of the Ambari Views like Pig View, Hive View, HDFS Files View etc.

3. You will have to execute all your tasks using the terminal.

4. It is recommended that you practice on the same version of HDP that you are going to get in the exam. Small version changes can lead to lot of problems.

For example - in a lower version of Pig, you have to explicitly cast values when using FOREACH generate statement, but in higher versions of Pig it is not needed.

It is better to get used to the exam version and avoid any time/effort wastage during the exam.

5. The practice exam is your best resource to give you a feel of the environment.

https://hortonworks.com/wp-content/uploads/2015/02/HDPCD-PracticeExamGuide1.pdf

6. If you have finished all the related tutorials available in Hortonworks Tutorial site then you should be good.

Just practice by altering scenarios in those tutorials, create your own questions on those data sets to play around and practice.

Ensure you have covered all objectives for HDPCD:

https://hortonworks.com/services/training/certification/exam-objectives/#hdpcd

Here are a few tips that will help you in the exam -

1. Type your commands using a system editor program like gedit and then copy one line at a time into your terminal.

This way you will stop where the error occurs and also you will be able to see any syntactical errors when you first type your commands in gedit instead of directly on the terminal.

2. When doing Pig or Hive tasks, if the question does not asks you to use a specific execution engine, then always prefer to use tez so that your jobs complete faster than when using MapReduce

In hive session, set exection engine as tez:

hive> set hive.execution.engine = tez;

To open pig session with execution engine as tez, type following in your terminal:

pig -x tez

3. Read the questions properly, don't rush to start writing the solution.

The questions are very easy but their will be fine details that one may miss and ending up not scoring on that task.

For example, in a hurry you may miss reading a certain part which could indicate that you need to add multiple conditions to your where clause or filter statements in hive/pig.

So even though you may have executed task with no error, it will give incorrect answer and you will loose a point.

Wish you all the best!