Support Questions

Find answers, ask questions, and share your expertise
Announcements
Celebrating as our community reaches 100,000 members! Thank you!

HDPCD: Java exam objectives,HDPCD: Java Exam Objectives

avatar
Explorer

Currently, the objectives for the HDPCD: Java exam state that candidates should be able to "perform a join on two or more datasets" and "perform a map-side join of two datasets". I found this language a little odd as the first objective is very general, but the second states a specific type of join. Does this mean that in addition to being able to complete a task that performs map-side join, we should be familiar with other join methods like a Bloom filter? Or, will the exam really just be testing for the ability to perform a map-side join?

1 ACCEPTED SOLUTION

avatar

"perform a join on two or more datasets" - implies that there are more than 2 data sets involved and thus you may ave to write a solution which can comprise only a Map Join or only a Reduce Join or a combination of both.

In essence, if the data sets are too large and could result in memory issues, then bloom filter is the route to take.

So from a conceptual perspective, it is good to know Bloom Filter even if it is not specifically mentioned in Exam Objectives.

View solution in original post

3 REPLIES 3

avatar

"perform a join on two or more datasets" - implies that there are more than 2 data sets involved and thus you may ave to write a solution which can comprise only a Map Join or only a Reduce Join or a combination of both.

In essence, if the data sets are too large and could result in memory issues, then bloom filter is the route to take.

So from a conceptual perspective, it is good to know Bloom Filter even if it is not specifically mentioned in Exam Objectives.

avatar
New Contributor

@Dinesh I agree with you. The most important question you can find related to java is this Difference between String , Stringbuilder and stringbuffer . Best of luck for the exams.

avatar

"perform a join on two or more datasets" - implies that there are more than 2 data sets involved and thus you may ave to write a solution which can comprise only a Map Join or only a Reduce Join or a combination of both.

In essence, if the data sets are too large and could result in memory issues, then bloom filter is the route to take.

So from a conceptual perspective, it is good to know Bloom Filter even if it is not specifically mentioned in Exam Objectives.