Hello
A lot details are provided here to your questions
https://docs.cloudera.com/cdp-private-cloud-base/7.1.6/concepts/topics/cm-sdx-vpc.html
Just to highlight a few points
1. not all components are available for VPC, limited to the following (no Flink)
Only the following services can be installed on Compute clusters:
- Hive Execution Service (This service supplies the HiveServer2 role only.)
- Hue
- Kafka
- Spark 2
- Oozie (only when Hue is available, and is a requirement for Hue)
- YARN
- HDFS (required)
2. data will be transferred to VPC during runtime, so check the network, performance requirements etc
3. only certain versions of the platform supports VPC, limited to the following:
- CDH 5.15
- CDH 5.16
- CDH 6.0
- CDH 6.1
- CDH 6.2
- CDH 6.3
- Cloudera Runtime 7.0.3
- Cloudera Runtime 7.1.1 and higher
4. it is good you need to add temp compute resources to your cluster for some ad-hoc workloads, but it has the trade-off of the performance (refer to point 2)
5. VPC can be fully managed using Cloudera Manager, so it is a fairly easy process to implement