02-03-2016 12:06 AM
Based on your experience/knowledge, can you suggest which database (Mysql or Oracle) should be used for databases of services like Cloudera Manager, Hive, sqoop, sentry, oozie....
I have experience with mysql but we plan to use Oracle due to certain advantages of maintaining it in our org.
It would be great if someone highlights the advantages/disadvantages of oracle?
07-17-2016 02:52 AM
Hello, I have this question too and if you don't mind, I'd like to add some other considerations.
I see that CDH services usually declares compatibility to Oracle, MySQL and Postgres. However, not all of them supports those three (Hue for instance), and looking closely only MySQL seems to be the one very cross-service compatible.
So I think that for now the best bet is on MySQL (I don't want Oracle, anyway).
I am doing some research for a DB supporting HA. At last in my quest I found that there are two solutions to support HA for MySQL: Percona XtraDB Cluster and MariaDB Galera, where the first actually uses libraries from the latter and adds some other interesting things.
My question is: what is the position of Cloudera regarding backend DB in HA ?
Let me to say that there's not great support for this in documentation: there are guides to make HS2 and HMS read from a HA DB, but not that much considerations and best practices. My ultimate goal is to truly make HMS and HS2 HA, adding a HA backend DB with a load-balancer on top of it, so I can:
I know that Cloudera would probably not stand for one of them over the other, but I'd like to have some recommendations (maybe they are partners of Cloudera already) or there have been some tests in past.
I am interested in Percona: while Galera is in alpha state (though they says it is affordable), Percona offers support and reports some companies already using it in Production environments.
I am also interested in paying support.
Looking forward for your reply, thanks
04-05-2017 10:40 AM
I too am working on a similar no single point of failure solution for the back end.
So far Galara Cluster with MySQL or MariaDB seem to be the one solution that will work. But it may not be as easy.
Take a look at this OOZIE-2854
04-06-2017 12:26 AM