Member since
07-30-2013
165
Posts
55
Kudos Received
40
Solutions
My Accepted Solutions
Title | Views | Posted |
---|---|---|
67871 | 01-02-2016 06:01 AM | |
4175 | 12-14-2015 07:03 AM | |
3192 | 12-01-2015 06:33 PM | |
989 | 10-19-2015 04:06 AM | |
2959 | 09-15-2015 11:53 AM |
02-09-2016
06:12 PM
That’s not correct. Please see the release notes http://www.cloudera.com/documentation/enterprise/5-3-x/topics/cdh_rn_spark_ki.html SparkSQL just exited alpha and is far from stable. As such, SparkSQL is currently considered a “preview” in CDH. We love it and we’re dedicating a lot of engineering resources to bring it to our standards but as I’m sure you’re aware, it’s mainly Scala (pyspark lags), it’s very buggy, it causes all kinds of havoc (esp. with Hive)….the list goes on. Once we get it running at scale, we’ll support it fully in our distribution and we’ll test it. But today, it’s just not ready.
... View more
01-16-2016
09:24 PM
You can also find this information on our website: http://www.cloudera.com/content/www/en-us/training/certification/2016_developer_changes.html
... View more
01-02-2016
06:01 AM
16 Kudos
Our Cloudera Developer Training for Spark and Hadoop teaches you everything you need to know for the exam and gives you days worth of hands-on practice. We both train and test the same objectives (although we train on more of course) If you do not wish to take our training or cannot afford it, there are hundreds of free resources on the internet. For example, if I point my browser to the List of objectives from our website. I then copy the first objective and search on it, I get dozens of free docs and training on the skill. • Import data from a MySQL database into HDFS using Sqoop • s qoop.apache.orgdocs on importing data using Sqoop • Importing Data into HDFS using Sqoop • Video demonstration of user importing data from MySQL to HDFS using Sqoop And there are dozens more. As the exam page also tells you that you have access to some of these during the exam, I suggest becoming intimate with the those. If you take some time to search each objective and learn the skill, the exam will be easy. If you don’t want to take the time to learn on your own, and you’re not learning on your job, then training is your answer. This is always true, of all technical skills, not just this one exam. And part of the learning is figuring out which free resources are the best if that’s the route you choose to go, or which paid ones are. As for the style of question, it’s quite straightforward: for the above objective, you’ll be given the location of a MySQL database on one of the nodes and you’ll be required to use Sqoop to import a portion or all of that data from the MySQL instance we give to the instance of HDFS we give you. It's really that straightforward if you take the time to learn the skills.
... View more
12-15-2015
06:29 AM
2 Kudos
We list the exam objectives. I’d recommend you do a comparison on our website. I would say that exam helps with CCA (though you need to pick up Spark which you should anyway) but it’d be harder to tell with CCP where you could use MapReduce or Spark (you’re choosing). For example, it says that you learned ingest with Sqoop and Flume but to what level? Do you know how to limit the ingest, do incremental ingests, do you know how to evaluate the data, and transform it and land it on HDFS in a particular format? Do you understand various file formats and compression such that if we give you a scenario you could look at an evaluate whether the data should land in a row-based or columnar format, what compression and partitioning it should get, etc. For CCP, you should know all that -- and know it well (and much more of course)-- and you have to be able to do all those things, actually write the code or use the tools on the command-line to process the data. In CCA, you have to be able to do it but we’re easier on you. We say “ingest this with Sqoop and land it this way,” or “write the data to Avro” or “evolve this schema” and so on.
... View more
12-15-2015
06:16 AM
Apologies for the error. You have one full year from the date of purchase on all our exams save the ones retiring. This is correct: Please note that you will only be able to access this item until Wednesday, 12 October 2016 PDT. This is wrong: You may schedule and take your exam from your home or office location. Cloudera exams are no longer offered at testing centers. Your registration expires on 31 December, 2015 at 11:59PM US Pacific Time. You must schedule and take your exam before the expiration date as exam registrations cannot be extended. The error arose when we set the expiration of other exams, the LMS triggered that warning in all exams until someone pointed it out like you are and we were able to correct it.
... View more
12-14-2015
07:03 AM
1 Kudo
It's hard to say without knowing the quality of the training and the quality of your practice and your aptitude. We've put a fairly representative question on http://www.cloudera.com/content/www/en-us/training/certification/ccp-data-engineer.html . If that sample is immediately obvious and if you look at is and could solve it without too much trouble, then perhaps. If not, then CCA Spark and Hadoop Developer is more appropriate. The distinction is that with CCA, we're validating discrete skills, in CCP, we're evaluating both your skills and higher level mastery skills like analysis, application, evaluation, and synthesis (to put it into traditional Bloom's taxonomy terms).
... View more
12-01-2015
06:33 PM
You will always have a license number and you will always have a public transcript that shows you passed CCDH on CDH version X on date X. After all, a certification is a measurement at a point in time of your achievement at that time. That said, as the number of scenarios for which MapReduce is the answer, or even the preferred programming framework, for a workload decreases, the value of that certification diminishes over time. We hear from our customers and from hiring managers that a stale certification is worse than none. I recognize their point though I perhaps wouldn't go that far. However, we need to balance the market's desire to delineate that there is an old way and there's what is increasingly being used as newer frameworks come into the platform. This is why we're re-branding, going to hands-on exams, and requiring recertification -- so that the program can keep pace without needing to completely change paradigms. We'll need to be honest to our customers and state that the version of CDH (say version 3) is retired and mark the certification program as retired since it is, and promote and give benefits to those who are current. As the version of CDH retires (the EOM dates are http://www.cloudera.com/content/www/en-us/legal/support-lifecycle-policy.html), we'll move those certifications to retired status in the transcript portion on our verification portal (current, retired, and exam transcript are those sections)
... View more
09-15-2015
11:53 AM
More information to understand the context read Mike Olson on why Spark should replace MapReduce back in 2013: http://vision.cloudera.com/mapreduce-spark/ read our more recent announcement : http://vision.cloudera.com/one-platform/ join a webinar to hear Doug Cutting talk about it: http://go.cloudera.com/OnePlatformInitiative or read the press release: http://www.cloudera.com/content/cloudera/en/about/press-center/press-releases/2015/09/09/cloudera-launches-effort-to-unite-apache-spark-and-apache-hadoop.html
... View more
09-15-2015
11:34 AM
It means Spark replaces MapReduce, Spark replaces Pig, and tools and abstractions like Hive and Impala are more central. The exam has the same objectives as Data Engineer only much easier and the questions require less experience and less skill.
... View more
09-15-2015
06:42 AM
Currently, none of our exams are for beginners. Moreover, CCD-410 is aimed specifically at Java MapReduce which is no longer the direction of the company and community. So I wouldn’t recommend it to anyone who is getting into big data. We are beta-testing a beginner’s exam this month and, if all goes well, will launch it soon. The beginner exam is part of a learning path that provides an integrated and comprehensive hands-on training and performance-based certification program to teach and validate the skills needed to develop reliable, scalable data pipelines that take raw data from ingest to insight.
... View more
07-06-2015
08:13 AM
Subject to the conditions of purchasing, which are listed before purchasing -- all that is on the website, in the FAQ, in the terms and conditions. But yes, in general, you purchase and then choose when you want to schedule between now and Dec 31, 2015 or the EOL of the exam. All that info. is put forth on the website.
... View more
07-06-2015
07:37 AM
Cloudera will no longer approve or support test centers as we could no longer abide the issues so prevalent in test centers. I do not know of a single program in business today that isn't looking to get out of test centers. Test centers are a last-century, archaic, relic that completely ignores technological advances of the past decade. I've been working in and managing certification programs since 1991 and I can pretty safely say: In the next five years, I believe you will see the complete demise of test centers and multiple-choice exams in any highly technical arena. And I think you'll see that happen in our industry/market much much sooner. I meet montly with the directors of some of the most important and high-profile IT certification programs and not a single one of those companies is moving forward with multiple-choice exams and test centers. And in our space, all the programs are already there or moving that direction. Even the test center vendors are acquiring remote proctoring companies and technologies and moving away from brick-and-mortar delivery. Just like more and more data and storage and activity is moving to the cloud, so are exams and so are proctors.
... View more
07-02-2015
08:51 AM
No, you cannot use notes. It's a high-stakes certification exam. You can't drink during it, you have to scan the room, your biometrics are recorded, your eye movement, you cannot talk to yourself, etc. it's a secured environment. Nothing has changed with the exam. We added the note because we get daily inquiries if the exam is on Spark and it's not. The exam covers the objectives on the website.
... View more
07-01-2015
03:31 PM
well, I can't make forward-looking guarantees but no plans at this time to offer them. There's some concern with accrediting agencies that offering practice tests hurts the integrity of the exam so we have to work with them and consider the options
... View more
07-01-2015
11:29 AM
All purchased content is in your account. If you don't know how to login into your account, please email certification@cloudera.com
... View more
06-12-2015
01:00 PM
email certification@cloudera.com for help. They’ll need your account number and your order number.
... View more
06-10-2015
05:31 AM
1 Kudo
No to both. On 1, it's not that specific. On 2, we don't give discounts on any our offerings. One month is usually not near enough time to learn the concepts presented in the practice test. Rememeber, it's not a mock test that you can take 10X in one week and memorize the questions and do well through memorization. You take it once, and then spend the next few months learning the areas you don't know.
... View more
05-05-2015
07:30 AM
1 Kudo
We ended our relationship with Pearson VUE on April 30, 2015 and delivered exams through them until that day. Your friend can verify her certification at http://certification.cloudera.com/verify/ if she has a concern.
... View more
05-04-2015
06:00 AM
1 Kudo
Nothing about the exam has changed except for the delivery partner. You can find all the exam information http://cloudera.com/content/cloudera/en/training/certification/ccdh.html, the study guide http://cloudera.com/content/cloudera/en/training/certification/ccdh/prep.html and the FAQ here http://cloudera.com/content/cloudera/en/training/certification/faq.html
... View more
03-31-2015
08:13 AM
eh...I can't get the html red font color to stick. hopefully my additions are clear
... View more
03-31-2015
08:11 AM
1 Kudo
Let me try and expand a bit on the main piece of the announcement that I believe you're asking for. - DS-200 candidates have their achievement listed as part of their public transcript and license and they can use that with employers currently. That hasn't been taken away from them. I see it listed in many places on LinkedIn and it currently has value by itself in the market. That remains. - DS-200 holders will be given early access to the new exams and provided a discount I do not, however, want DS-200 holders to be able to "skip" one of the "challenge" exams because that would violate all international testing standards, but also because I know what I see in the data for those that pass DS-200 and those that pass the challenge and I feel it's imperative to keep the hands-on requirements in place to protect the integrity of the process and the exam. And everyone who is CCP today passed 4 segments (DS-200 + 3 challenge sections). Here is the section of the announcement. I'll will try and add commentary in Red We have several goals behind making these changes:
Reduce the amount of work a candidate must complete in areas they have already demonstrated mastery
Many very qualified candidates turned in perfect or near-perfect challenge scores on parts one and two and then got called off on a project and received a 0 for part three because of circumstance and thus failed. On the next challenge, they had to do the complete challenge the next time -- all three parts since all were somewhat integrated and doing only one twisted the validity. As such, we designed the last challenge as three discrete parts and now we're taking it further to say that you only have to test on those sections you failed. This allows the candidate more flexibility and allows those who are qualified to get through the program when it works for them, on their time rather than forcing them (a) to start over and do the complete challenge and (b) engage in a time window that works for them. Provide year-round access to the certification, giving candidates greater flexibility. We recognize the time-bound nature of the challenge often prevented well-qualified candidates from participating.
I think my answer directly above addresses this. Streamline the program cluster infrastructure and provide all candidates with a level exam scenario
- Streamline the grading and scoring process to provide candidates quick and relevant feedback. We saw all kinds of submissions in the past 18 months. Some people working on laptop VMs where it took 4 weeks to run one pass on the data; some working on an Oracle BDA where it took no time. We made an assumption early on that if you can do this work (data science on very large datasets) that you have access to a cluster environment. We didn't have the ability to provide each candidate one for three months; we tried with multi-tenant providers etc. but it was less than ideal and we needed to protect security and integrity of the exam. Early on, this worked but as demand increases, more and more people want to test -- simply put, more and more people want in. Many are students who no longer have access to their university infrastructure or they work in sensitive fields and must do the test off-site to their cluster. We set out to build a performance-testing cluster infrastructure that we could scale globally for all kinds of hands-on exams. And that infrastructure is now complete (or near-complete, we're still performance-tuning) which means that anyone in the world, anywhere, with a laptop and a browser (even be a cheap chromebook) can test on a live, multi-node, very large compute instances, cloud-based, big data infrastructure. As such, we wanted to migrate the exams to a level-playing field. There are many many many more reasons but let me briefly address two: 1. Predictive validity per unit of time 2. Cheating/theft 1. We're always evaluating for predictive validity per unit of time which looks at "how much of the objective set do we need to present a candidate to evaluate and make a predictive judgment or outcome." All exams are predictive -- a certification is Cloudera offering a predictive assessment of someone's ability in a job role. We noticed somewhat early on that the challenge format, while having lots going for it, had poor predictive abilities in the sense that people could rathole forever and we couldn't stop them from doing it. It scores almost perfectly on all kinds of psychometrics, but simply put, we could pass/fail people A LOT faster. So many people looked at the challenge problem and while the answer was "to the north" (let's say), they turned directly south and then spent 100+ hours trying to make "south" work and it was never going to. And when they got a 0 as their score (or less than 30%) they were angry and confused. As more people wanted in on the field, this issue increased substantially. The open-ended nature of the challenge was great for the ones who could pass easily -- some scored near-perfect in a few hours but others, those borderline candidates, spent inordinate amounts of time circling the wrong approach. And there are a number of reasons for that I won't go into but I know what I see in the data and the submissions. So we set out to shorten the feedback loop and tighten the predictive validity per unit of time. And we're still fine-tuning what we can cut and what we can include. 2. As DS-200 continued to be live, people were taking it many many times and the predictive validity decreased. It was also shared all over online. This is true of all MC tests and we were faced with enormous investment to keep it fresh and foil the cheaters. All that would be fine in isolation but combined with the next, we had to reshape a number of components. Further, the challenge as we did it required a certain honor code. We gave you the challenge, we didn't proctor, we used DS-200 to gate some of that. However, we noticed that a particular company was having someone pass them the challenge and they were solving with a team of people and then posting it publicly to their blog -- basically how to solve it. It might have been coincidence and we're not going to point fingers, but it got us looking at other security issues and, sad to say, we found enough evidence to know that there were a number of entities out there predisposed to trying to bring this program down. This is unfortunate and really sad in some ways, but we realized we needed to adjust and tighten the security as much as we could. So given all the above, plus thousdands of other pieces of data, of discussions, of feedback, and knowing that the cost will increase because we have to secure it, proctor it, etc. we opted for the stated changes. Data science is an evolving field and the technologies available to us today have evolved. As such, we felt that now was a good time to adapt the program to meet the changing needs of the community and to encourage ongoing professional growth and development in this exciting field.
... View more
03-31-2015
07:07 AM
1 Kudo
I'll reply in two posts, the first will just be the announcement since the full announcement isn't referenced. The second will be an opinion on the rationale: Here's the full announcement: Cloudera is making several changes to its CCP: Data Scientist certification program, effective immediately. We are lifting the requirement that a candidate pass the DS-200: Data Science Essentials written test before being allowed to move on to the challenge. We will no longer offer either DS-200 or the DS-200 practice test. We are making the following changes to the Data Science Challenge: - Cloudera will now provide candidates a fully configured CDH-based data science cluster rather than requiring or allowing candidates to provide their own. - The Challenge is composed of three sections; each is now offered as a separate exam, and students must take and pass all three in order to achieve the qualification of CCP: Data Scientist. Unlike the current Challenge, though, students need only retake any specific exams they fail, rather than having to complete a full new challenge. The exams may be taken in any order. The exams are: - DS700 - Descriptive and Inferential Statistics - DS701 - Unsupervised Machine Learning - DS702 - Supervised Machine Learning - The exams will remain fully hands-on on a live cluster with large datasets but will now include remote monitoring by a proctor via webcam, video, audio, and in a secured cluster environment. Candidates may take the exam at their home or office location; attendance at a testing center is not required. - The three exams can be taken in any order and at anytime as long as all three are passed within 365 days of each other. Candidates who fail an exam will need to pay to retake that exam; no free retakes will be offered. - The new exams will be offered beginning July 2015. At that time, we will publish the cluster specifications, what tools candidates may install. We have several goals behind making these changes: - Reduce the amount of work a candidate must complete in areas they have already demonstrated mastery - Provide year-round access to the certification, giving candidates greater flexibility. We recognize the time-bound nature of the challenge often prevented well-qualified candidates from participating. - Streamline the program cluster infrastructure and provide all candidates with a level exam scenario - Streamline the grading and scoring process to provide candidates quick and relevant feedback. Data science is an evolving field and the technologies available to us today have evolved. As such, we felt that now was a good time to adapt the program to meet the changing needs of the community and to encourage ongoing professional growth and development in this exciting field.
... View more
03-25-2015
05:09 PM
Not sure if I’m answering your question: - do certifications retire? Sure. We haven’t yet retired one but a certification program can retire if the market changes and we need to end or change a program. I’m using the term certification to mean a certification not an exam. - do exams retire? Yes, but CCD-410 is current to C5. There is not, nor will there be a CCD-5x exam.
... View more
02-12-2015
06:23 AM
Hi, http://cloudera.com/content/cloudera/en/training/certification/faq.html#fail What happens if I fail a test? As with every Cloudera test, you are given a score report that informs you of your overall score and how you scored on each section of the test. From this score report, you will know which areas of the test you need to study before scheduling to take the test again. Candidates who fail a test must wait a period of seven (7) calendar days, beginning the day after the failed attempt, before they may retest for the same test. There is no limit to the number of retakes a candidate may attempt. There are no discounts for retake exams. On May 1, we’re going to a 30-day cooling off period before retake.
... View more
01-24-2015
12:35 PM
The DS-200 practice test is not available at this time, nor do we plans to make it available in the future.
... View more