2208
Posts
230
Kudos Received
82
Solutions
About
My expertise is not in hadoop but rather online communities, support and social media. Interests include: photography, travel, movies and watching sports.
My Accepted Solutions
Title | Views | Posted |
---|---|---|
450 | 05-07-2025 11:41 AM | |
926 | 02-27-2025 12:49 PM | |
2802 | 06-29-2023 05:42 AM | |
2372 | 05-22-2023 07:03 AM | |
1748 | 05-22-2023 05:42 AM |
02-12-2018
01:46 PM
1 Kudo
Congratulations on solving your issue @SGeorge. Also, rest assured, your posts on the community do not enter a black hole from which there is no return. The community is peer to peer based forum so there are cases where questions go unanswered or take some time to receive a reply.
... View more
01-30-2018
12:59 PM
The certification team advised me that they have reviewed each of the exams they were contacted about and didn't find any performance issues.
... View more
01-30-2018
05:34 AM
Hi @axie,
If you look at the CCA175 certification page you will see the following wording on templates
Exam Question Format
Each CCA question requires you to solve a particular scenario. In some cases, a tool such as Impala or Hive may be used. In other cases, coding is required. In order to speed up development time of Spark questions, a template is often provided that contains a skeleton of the solution, asking the candidate to fill in the missing lines with functional code. This template is written in either Scala or Python.
You are not required to use the template and may solve the scenario using a language you prefer. Be aware, however, that coding every problem from scratch may take more time than is allocated for the exam.
As for Flume and availability of documentations you can see at the bottom of that page (I put Flume items in red):
Exam delivery and cluster information
CCA175 is a remote-proctored exam available anywhere, anytime. See the FAQ for more information and system requirements.
CCA175 is a hands-on, practical exam using Cloudera technologies. Each user is given their own CDH5 (currently 5.10.0) cluster pre-loaded with Spark 1.6, Impala, Crunch, Hive, Pig, Sqoop, Kafka, Flume, Kite, Hue, Oozie, DataFu, and many others (See a full list). In addition the cluster also comes with Python (2.6, 2.7, and 3.4), Perl 5.10, Elephant Bird, Cascading 2.6, Brickhouse, Hive Swarm, Scala 2.11, Scalding, IDEA, Sublime, Eclipse, and NetBeans.
Documentation Available online during the exam
Cloudera Product Documentation Apache Hadoop Apache Hive Apache Impala (Incubating) Apache Sqoop Spark Apache Crunch Apache Pig Kite SDK Apache Avro Apache Parquet Cloudera HUE Apache Oozie Apache Flume DataFu JDK 7 API Docs Python 2.7 Documentation Python 3.4 Documentation Scala Documentation
Only the documentation, links, and resources listed above are accessible during the exam. All other websites, including Google/search functionality is disabled. You may not use notes or other exam aids.
... View more
01-17-2018
05:43 AM
1 Kudo
Summary: Due to a kernel security exploit on system CPUs (Spectre and Meltdown), OS updates are required for all systems. The OS updates have performance implications. Cloudera is in the process of testing the impact across various Cloudera software services and workloads. Initial test results, based on the limited testing (a subset of services and workloads) we have completed as of January 12, indicate that the performance impact on Cloudera software is minor.
Users affected: While we are not aware of customers who have been impacted by these CPU vulnerabilities, we understand and expect that all Cloudera customers will apply Spectre and Meltdown patches as they become available from their OS suppliers. As such, Cloudera is performing basic functional testing to ensure that CDH works with the patches as well as determining performance impact on typical workloads.
Impact: Cloudera’s initial testing has focused on a subset of commonly used CDH services. Based on the limited testing (a subset of services and workloads) we have completed as of January 12, these CDH services continue to function on patched systems and the performance impact on Cloudera software is minor. For example:
MapReduce jobs run with 3 - 9% slowdown
Impala queries run with 5 - 10% slowdown
Hive on Spark queries run with up to 5% slowdown
Spark jobs run with 0 - 12% slowdown
HBase queries run with up to 5% slowdown
Cloudera understands that the performance of your data clusters can affect important applications and may have business impact. As such, we are taking this situation seriously and treating this with urgency. Cloudera will provide additional performance results as soon as they are available.
Action required: While we are not aware of customers who have been impacted by these CPU vulnerabilities, we understand and expect that all Cloudera customers will apply Spectre and Meltdown patches as they become available from their OS suppliers.
... View more
01-12-2018
11:27 AM
Sorry about that @saranvisa. It seems that the community's spam filter grabbed your previous replies to this thread for some reason. If I happens again, I'll dig deeper into it.
... View more
01-07-2018
06:13 AM
I am sorry to see your post about running into an issue taking the certification. While I cannot speak to your specific experience, I can advise that emailing Cloudera is the correct route to take. Did you email certification@cloudera.com or a different address?
... View more
01-01-2018
06:40 AM
Great to hear you got it working. 🙂
I can usually work through the installation issues but beyond that I'm not much help. If the misconfigurations do become an issue, I would suggest starting a new thread so it can catch the eye of some of the more knowledgeable community members.
... View more
12-31-2017
05:47 AM
Thanks for the detailed reply, it is very helpful. The processor and RAM seem to be fine for the default QuickStart VM (not using Cloudera Manager). If you are trying to use Cloudera Manager, the RAM would need to be increased depending on which version you are trying to use. With RAM out of the way, I'm leaning towards a possible bad download. While we recommend 7-zip, I'm thinking the possibility of a bad extraction is low. Since the file is large and can easily get corrupted due to connectivity issues which is why we recommend using a download manager. The only other thing I can think of at the moment to suggest is changing how you create the VM. You may have already done this and if so you can disregard and focus on the download. Here is the instruction from the community article: Installing the VM: Using the specific documentation and instructions provided by your hypervisor application, open the extracted file into that hypervisor application. For example, if you elected to use VirtualBox, you would have downloaded and extracted a *.ovf file from Cloudera. Use the “File -> Import Appliance” menu inside VirtualBox to open your downloaded *.ovf file, or simply double-click on the file itself and VirtualBox should handle it from there.
... View more