Member since
07-29-2019
640
Posts
114
Kudos Received
48
Solutions
My Accepted Solutions
| Title | Views | Posted |
|---|---|---|
| 10973 | 12-01-2022 05:40 PM | |
| 2747 | 11-24-2022 08:44 AM | |
| 3969 | 11-12-2022 12:38 PM | |
| 1446 | 10-10-2022 06:58 AM | |
| 2086 | 09-11-2022 05:43 PM |
10-19-2021
08:09 PM
Hi @Sipping1n0s I think I can help. First, the current Enterprise Data Platform product offered by Cloudera as of Oct 2021, is Cloudera Data Platform (CDP); Cloudera is the name of the company that markets CDP. Second, in it's on-premises "form factor", Cloudera Data Platform Private Cloud, you can download and install a "free trial" which expires after 60 days in a non-production environment for demonstration and proof-of-concept use cases without obtaining a license. You can read over the operating system requirements for installing the CDP Private Cloud Base trial (which is the easiest way to install CDP Private Cloud) here: Operating System Requirements I am not aware of a Docker image version available for download from Cloudera that would enable you to create a CDP cluster on your desktop and also test the docker containers on multiple clouds, although I am certain that someone with the requisite knowledge of Docker and sufficient skill and abilities with the various required development tools could create one. Indeed, some member of the Cloudera Community may have already done so and be willing to share their method in response to your question. Prior to its merger with Cloudera in 2018, Hortonworks, Inc. distributed a Docker image of its distribution called The HDP Sandbox and that still happens to be available for download here: Deploying Hortonworks Sandbox on Docker (among other places) …along with a tutorial which provides detailed steps to install that combination on Linux, Mac OS X and MS Windows, but that in no way could be called up-to-date or equivalent to what Cloudera markets as an Enterprise Data Platform product today (The Sandbox is based on a version of the base distribution which is nearing end of support status). The Sandbox is intended as a pre-configured learning environment for developers who are just getting started. Getting installations of the HDP Sandbox running on multiple clouds would be challenging, but possible, again assuming a developer knowledgeable about the various required development tools.
... View more
10-13-2021
05:23 AM
Hi, I can use this machine to my magister work? I need licence and I don't know if this source is legal.
... View more
10-11-2021
02:19 PM
Hi @Yogesh771 It would help members of the community in offering possible answers to your question if you posted a link to the tutorial you are presumably following which includes the source code for this python program, describes how to run it and what the expected output is.
... View more
10-04-2021
08:42 PM
Hi @migration What .jar files are these classes packaged in for cdh5.16? Can you share the relevant file names?
... View more
10-04-2021
07:03 PM
Yes. You can allocate more of the host machine's RAM to the virtual machine. Take your current setting for that configuration, double it, then perform the same query against the same data set and compare the result. I also strongly recommend a close reading of Deploying Hortonworks Sandbox on VMWare
... View more
10-04-2021
07:44 AM
I am following up here on one part of the original question, regarding the Apache Sqoop project being retired, just for the record (which is to say for the benefit of people who might arrive at this thread via search engine at some point in the near-to-medium term future). I still stand behind what I previously wrote about the relative strengths/weaknesses of using these two tools to extract data from SQL server and ingesting it to hdfs, but I do want to clarify that while Sqoop was moved to the Apache Attic in June 2021, the software will continue to be supported by Cloudera and shipped as part of CDP Public Cloud and CDP Private Cloud. See Cloudera's statement of support on this matter here: Apache Sqoop Support on Cloudera Data Platform
... View more
10-04-2021
07:31 AM
I am following up here on one small point, just for the record (which is to say for the benefit of people who might arrive at this thread via search engine at some point in the near-to-medium term future). The original question from @vbar included this: However, here's where it gets tricky. Sqoop was official [sic] retired this June and after that I don't really know if I am supposed to get prepared for a sqoop question or a "JDBC" format question. I can't speak at all to the question of whether or not the exam will have questions on Sqoop, but I do want to clarify that while Sqoop was moved to the Apache Attic in June 2021, the software will continue to be supported by Cloudera and shipped as part of CDP Public Cloud and CDP Private Cloud. See Cloudera's statement of support on this matter here: Apache Sqoop Support on Cloudera Data Platform
... View more
10-03-2021
04:15 PM
Hello @xyz123 It may be true that the HDP 2.6.5 sandbox requires less than the 10 GB required by 3.0.1, but that does not mean that you'll be able to get everything working on a Mac with 8 GB RAM using on VirtualBox. IIRC, you'll still need to allocate 8 GB RAM to the virtual machine, and if that is not possible on a Big Sur-running Mac with 8 GB RAM total, then you are going to have to do without some services (at a minimum; it still may not be possible with the RAM available). Some of the services you describe when you say you "end up with a dashboard full of red flags", simply don't start by default when running in a memory-deficient environment. if you're looking for pointers on troubleshooting the HDP Sandbox, I strongly recommend closely reading the tutorial Learning the Ropes of the HDP Sandbox
... View more
09-29-2021
01:56 PM
1 Kudo
Hi @Jaspal The more detail you provide the better community members can assist with your question. I think you'll need to provide a tad more detail about what you mean when you say you "… want to mask these data sources" or "mask Hive tables". What does the result look like when you have the "mask" you desire in place?
... View more
09-28-2021
10:24 PM
In June 2021, Apache Sqoop was retired and moved to the Apache Attic. While Sqoop will no longer be maintained at Apache, Cloudera CDP Public and Private Cloud customers can still expect full support including patches, hotfixes and prompt consideration of enhancement requests. For more information, please see the following resources: Migrating Data Using Sqoop in CDP Public Cloud Apache Sqoop changes after upgrading from CDH to CDP Private Cloud Base Apache Sqoop's apache.org page Apache Software Foundation's Board Resolution Terminating the Apache Sqoop Project 16 June 2021 Apache Sqoop in the Apache Attic
... View more
Labels: