2213
Posts
231
Kudos Received
82
Solutions
About
My expertise is not in hadoop but rather online communities, support and social media. Interests include: photography, travel, movies and watching sports.
My Accepted Solutions
| Title | Views | Posted |
|---|---|---|
| 781 | 05-07-2025 11:41 AM | |
| 1643 | 02-27-2025 12:49 PM | |
| 3455 | 06-29-2023 05:42 AM | |
| 3009 | 05-22-2023 07:03 AM | |
| 2119 | 05-22-2023 05:42 AM |
02-13-2018
09:28 PM
Just quick info you can run pig in local mode as well as in mapreduce mode , By default, load looks for your data on HDFS in a tab-delimited file using the default load function PigStorage. also if you start you pig -x which local mode it will look for local fs . Nice that you found the fix. @SGeorge ,
... View more
02-12-2018
07:22 PM
I am doing some practices, for some questions, there could be various solutions, for example, I can use RDD operations to do some filtering, sorting, and grouping; with DataFrame and SparkSQL, it is even easier to me to get the same result. My question is will there be a requirement in the exam that some questions must be resolved using RDD, not DataFrame+SparkSQL. or vice versa? Thank you.
... View more
01-17-2018
05:43 AM
1 Kudo
Summary: Due to a kernel security exploit on system CPUs (Spectre and Meltdown), OS updates are required for all systems. The OS updates have performance implications. Cloudera is in the process of testing the impact across various Cloudera software services and workloads. Initial test results, based on the limited testing (a subset of services and workloads) we have completed as of January 12, indicate that the performance impact on Cloudera software is minor.
Users affected: While we are not aware of customers who have been impacted by these CPU vulnerabilities, we understand and expect that all Cloudera customers will apply Spectre and Meltdown patches as they become available from their OS suppliers. As such, Cloudera is performing basic functional testing to ensure that CDH works with the patches as well as determining performance impact on typical workloads.
Impact: Cloudera’s initial testing has focused on a subset of commonly used CDH services. Based on the limited testing (a subset of services and workloads) we have completed as of January 12, these CDH services continue to function on patched systems and the performance impact on Cloudera software is minor. For example:
MapReduce jobs run with 3 - 9% slowdown
Impala queries run with 5 - 10% slowdown
Hive on Spark queries run with up to 5% slowdown
Spark jobs run with 0 - 12% slowdown
HBase queries run with up to 5% slowdown
Cloudera understands that the performance of your data clusters can affect important applications and may have business impact. As such, we are taking this situation seriously and treating this with urgency. Cloudera will provide additional performance results as soon as they are available.
Action required: While we are not aware of customers who have been impacted by these CPU vulnerabilities, we understand and expect that all Cloudera customers will apply Spectre and Meltdown patches as they become available from their OS suppliers.
... View more
01-08-2018
09:43 AM
it is not working for me...
... View more
01-01-2018
06:40 AM
Great to hear you got it working. 🙂
I can usually work through the installation issues but beyond that I'm not much help. If the misconfigurations do become an issue, I would suggest starting a new thread so it can catch the eye of some of the more knowledgeable community members.
... View more
12-27-2017
12:57 PM
Registering for Cloudera.com is free and provides access to things like posting here on the community. If you are looking for access to the support portal or other areas your free account doesn't allow, I would refer you to the contact us page which will get your information to sales.
... View more
12-26-2017
06:34 AM
Hi @RamkSrid. This is thread is over 2 years old at this point. Can you explain your specific needs in a new thread perhaps?
... View more