Created on 09-28-2020 11:47 PM - edited 09-29-2020 12:59 AM
I'm trying to deploy apache griffin (data quality) in ambari cluster. However, It unsuccessfully deployed.
Is there any data quality tool or any recommendation data quality tool that can successfully implement with ambari cluster?
Created 09-30-2020 04:54 AM
@Elf Can you sure the method and failure details for install with ambari? There is always a way to work around those issues even if its some manual adjustments in ambari or the agent during install.
Created 09-30-2020 08:04 AM
I use apache griffin deployment guide to install griffin in ambari cluster https://github.com/apache/griffin/blob/master/griffin-doc/deploy/deploy-guide.md
Not sure ambari is supported for griffin or not
Created on 09-30-2020 08:41 AM - edited 09-30-2020 08:43 AM
@Elf IMO anything is possible with ambari. That said, out of the box, maybe it would not appear to be possible without some advanced ambari admin skills.
I took a look at the link you provided and that is an example of how to spin up a single machine with many of the services you may already have in your ambari cluster. To install griffin in an ambari cluster you would need to pick a node, install griffin, missing requirements (services/components not in your cluster), and thoughtfully modify the configuration to use the existing services from the ambari cluster. For example, feed griffin you configuration locations for hadoop, hdfs, hive, etc and NOT use the specific directions to install those parts based on sample documentation.
If you do decide to go down this path, please update here with your progress or create new Questions with specific errors you may have.