Support Questions

Find answers, ask questions, and share your expertise

Running Cloudera HDFS on Windows without a VM for Metadata Analysis

avatar

We have requirements in a project to Analyze whole workload on Cloudera HDFS data and metadata. We have created a tool which have integration with AWS Redshift, GCP BigQuery and Snowflake. This would be our first on-premise integration in to the tool.

Any way I can replicate Cloudera Hadoop on Windows for initial prototype testing without VM which makes system very slow ?

Regards
Rahul

1 REPLY 1

avatar
Community Manager

Hi Rahul,

Welcome to the community. While I’m not a technical expert, I’d like to help get you closer to an answer here. Could you clarify a few of the "big picture" details?

  • Which version are you targeting? Do you know which specific version of Cloudera your project needs to replicate? I'm assuming the latest Cloudera has but want to be sure. 

  • What is the scale of the data? Since you are looking to avoid a VM due to speed, roughly how much data or metadata are you planning to run through this initial prototype?

  • Windows constraints: Are there specific reasons the prototype must stay on Windows (like company security policies), or is it just for the convenience of your current hardware?

Having these details will make it much easier for others here to make suggestions.


Keep the questions coming,

Cy Jervis | Senior Manager, Knowledge Programs

if (helpful) { mark_as_solution(); } | if (appreciated) { give_kudos(); }