Created on 07-14-2019 03:01 PM - edited 09-16-2022 07:30 AM
Do table statistics get replicated as part of hive BDR replication ?
Created 07-16-2019 11:27 AM
Hi @Daggers ,
Based on the public doc here:
It says:
Table and partition-level column statistics stored in the Hive metastore and used by Impala are now replicated during Hive Replication. This is supported between a replication source with Cloudera Manager running version 5.10 or higher and a replication target running Cloudera Manager 5.10 or higher. Because this change replicates more information, the same schedule may take more time to complete if column statistics are present.
Thanks and hope this helps.
Li
Li Wang, Technical Solution Manager
Created 07-16-2019 11:27 AM
Hi @Daggers ,
Based on the public doc here:
It says:
Table and partition-level column statistics stored in the Hive metastore and used by Impala are now replicated during Hive Replication. This is supported between a replication source with Cloudera Manager running version 5.10 or higher and a replication target running Cloudera Manager 5.10 or higher. Because this change replicates more information, the same schedule may take more time to complete if column statistics are present.
Thanks and hope this helps.
Li
Li Wang, Technical Solution Manager
Created 11-04-2022 04:54 AM
Hi!
Do you know the steps how to disable the replication of table table column statistics since it's effect the performance while we run the BDR?
Created 11-04-2022 09:06 AM
@DNADatangineer As this is an older post, you would have a better chance of receiving a resolution by starting a new thread. This will also be an opportunity to provide details specific to your environment that could aid others in assisting you with a more accurate answer to your question. You can link this thread as a reference in your new post.
Regards,
Diana Torres,