Member since
07-29-2019
640
Posts
114
Kudos Received
48
Solutions
My Accepted Solutions
| Title | Views | Posted |
|---|---|---|
| 14426 | 12-01-2022 05:40 PM | |
| 3294 | 11-24-2022 08:44 AM | |
| 4950 | 11-12-2022 12:38 PM | |
| 1791 | 10-10-2022 06:58 AM | |
| 2579 | 09-11-2022 05:43 PM |
10-19-2021
08:09 PM
Hi @Sipping1n0s I think I can help. First, the current Enterprise Data Platform product offered by Cloudera as of Oct 2021, is Cloudera Data Platform (CDP); Cloudera is the name of the company that markets CDP. Second, in it's on-premises "form factor", Cloudera Data Platform Private Cloud, you can download and install a "free trial" which expires after 60 days in a non-production environment for demonstration and proof-of-concept use cases without obtaining a license. You can read over the operating system requirements for installing the CDP Private Cloud Base trial (which is the easiest way to install CDP Private Cloud) here: Operating System Requirements I am not aware of a Docker image version available for download from Cloudera that would enable you to create a CDP cluster on your desktop and also test the docker containers on multiple clouds, although I am certain that someone with the requisite knowledge of Docker and sufficient skill and abilities with the various required development tools could create one. Indeed, some member of the Cloudera Community may have already done so and be willing to share their method in response to your question. Prior to its merger with Cloudera in 2018, Hortonworks, Inc. distributed a Docker image of its distribution called The HDP Sandbox and that still happens to be available for download here: Deploying Hortonworks Sandbox on Docker (among other places) …along with a tutorial which provides detailed steps to install that combination on Linux, Mac OS X and MS Windows, but that in no way could be called up-to-date or equivalent to what Cloudera markets as an Enterprise Data Platform product today (The Sandbox is based on a version of the base distribution which is nearing end of support status). The Sandbox is intended as a pre-configured learning environment for developers who are just getting started. Getting installations of the HDP Sandbox running on multiple clouds would be challenging, but possible, again assuming a developer knowledgeable about the various required development tools.
... View more
10-16-2021
11:21 AM
Hi @PrernaU It would help members of the community who are inclined to do so offer you possible answers to your question if you posted which distribution you are using. If this is indeed on Cloudera Data Platform, are you using Public Cloud (i.e., on AWS or Azure, for example) or Private Cloud (on premises installation)?
... View more
10-11-2021
04:52 PM
@Sam7 No, you don't have to start a new question, just go ahead and post your screen shot of the query in Hue and the resulting error message and I'll revise the label for you if it's necessary. Also, let us know what distribution you're using and what version of Hue you're running, that would help us help you a lot.
... View more
10-11-2021
02:19 PM
Hi @Yogesh771 It would help members of the community in offering possible answers to your question if you posted a link to the tutorial you are presumably following which includes the source code for this python program, describes how to run it and what the expected output is.
... View more
10-11-2021
08:49 AM
@Sam7 DAS is Data Analytics Studio, which was the label you originally applied to your post. See the following partial page shot: So that is why @balajip was asking if the query was working fine before you made the latest change to capitalize the initial letter. DAS is a completely different tool than Hue. It would help members of the community in offering possible answers to your question if you could post a screen shot of the query you're submitting in the actual tool you are using as well as the error message that tool is returning when it attempts to execute the query.
... View more
10-04-2021
07:44 AM
I am following up here on one part of the original question, regarding the Apache Sqoop project being retired, just for the record (which is to say for the benefit of people who might arrive at this thread via search engine at some point in the near-to-medium term future). I still stand behind what I previously wrote about the relative strengths/weaknesses of using these two tools to extract data from SQL server and ingesting it to hdfs, but I do want to clarify that while Sqoop was moved to the Apache Attic in June 2021, the software will continue to be supported by Cloudera and shipped as part of CDP Public Cloud and CDP Private Cloud. See Cloudera's statement of support on this matter here: Apache Sqoop Support on Cloudera Data Platform
... View more
10-03-2021
04:15 PM
Hello @xyz123 It may be true that the HDP 2.6.5 sandbox requires less than the 10 GB required by 3.0.1, but that does not mean that you'll be able to get everything working on a Mac with 8 GB RAM using on VirtualBox. IIRC, you'll still need to allocate 8 GB RAM to the virtual machine, and if that is not possible on a Big Sur-running Mac with 8 GB RAM total, then you are going to have to do without some services (at a minimum; it still may not be possible with the RAM available). Some of the services you describe when you say you "end up with a dashboard full of red flags", simply don't start by default when running in a memory-deficient environment. if you're looking for pointers on troubleshooting the HDP Sandbox, I strongly recommend closely reading the tutorial Learning the Ropes of the HDP Sandbox
... View more
09-29-2021
01:56 PM
1 Kudo
Hi @Jaspal The more detail you provide the better community members can assist with your question. I think you'll need to provide a tad more detail about what you mean when you say you "… want to mask these data sources" or "mask Hive tables". What does the result look like when you have the "mask" you desire in place?
... View more
09-28-2021
10:24 PM
In June 2021, Apache Sqoop was retired and moved to the Apache Attic. While Sqoop will no longer be maintained at Apache, Cloudera CDP Public and Private Cloud customers can still expect full support including patches, hotfixes and prompt consideration of enhancement requests. For more information, please see the following resources: Migrating Data Using Sqoop in CDP Public Cloud Apache Sqoop changes after upgrading from CDH to CDP Private Cloud Base Apache Sqoop's apache.org page Apache Software Foundation's Board Resolution Terminating the Apache Sqoop Project 16 June 2021 Apache Sqoop in the Apache Attic
... View more
Labels:
09-27-2021
03:46 PM
Hi @vijaysahu You didn't provide the version of Impala you're targeting, but assuming that what you are using is fairly recent, I would answer: Yes, it is possible, but probably not in any automated fashion. The Impala equivalent to SQL Server Stored Procedures are called User-defined functions (or UDFs). Starting in Impala 1.2, you can code your UDFs in C++ and/or Java instead of the proprietary programming language Transact-SQL (T-SQL) commonly used in MS SQL server. You can start reading up on Impala User-defined functions here: User-Defined Functions (UDFs) I am not aware of any translator or "converter" that will take T-SQL code and transform it into the equivalent syntax in Java (for example) that can be used in Impala; perhaps another member of the Cloudera Community is aware of one and will weigh in here with a pointer. Even if one is available, I think you'd be better off sitting down, doing some analysis and getting an understanding of what the original procedure with the more than 500 lines of code does and then write a new UDF or set of UDFs that does the equivalent thing in Impala (there are important limitations to the functionality of UDFs that don't limit corresponding T-SQL procedures). Any competent software developer should be able to accomplish that task in a reasonable amount of time, given reasonable complexity of the original code, and probably less time than it will take you, in terms of calendar days, than it would take to acquire and learn the quirks of an automated translator. That person might decide that there are better ways of satisfying the requirements than using UDFs and that the original stored procedures never should have been written at all.
... View more