About Darren

Darren · ‎09-05-2014

Use the API command to create the HDFS temp dir. Were you not able to find it in your version of the API?

Darren · ‎09-05-2014

If you click on the Hive Metastore and look in the processes tab, you should be able to find the stderr. You do not need to manually modify hive-site to add the connection URL. You are probably looking at the wrong copy hive-site.xml. The one used by the metastore is also shown on the processes tab. The java heap size charts should be visible if you click on the Hive Metastore Server and then on Charts Library and look for the Resident Memory chart.

Darren · ‎09-05-2014

Did you do the step to create the HDFS /tmp directory? This is described via command line in the blog post you linked, but there is also an API command to do this. Those instructions are fairly dated. You should manually set up a cluster, look at all the steps performed by First Run when initially setting up the cluster, and make sure you do all of those steps.

Darren · ‎09-05-2014

You should not have needed to modify hive-site.xml. What instructions did you follow? Is there any information in the role's stderr log? Do the Java heap size charts look troubling around the time of the failure?

Darren · ‎09-05-2014

Glad you solved this! Keep in mind that when using the symlink, you may need to re-create it whenever you upgrade your cloudera-scm-server-db package in the future, since the symlink confuses the packaging code. Thanks, Darren

Darren · ‎08-27-2014

Hi, Sentry service stores policy information in a relational database, whereas the Policy File implementation uses a file. You should never use both at the same time as that would be redundant. When using Sentry service, you issue grants and revokes via the HiveServer2 client beeline. The descriptions for the Sentry configuration in the CM UI have links to documentation explaining the usage, which should answer all of your questions. Thanks, Darren

Darren · ‎08-26-2014

Hi, You need to be using CDH5.1 or higher, and make sure CM knows that it is CDH5.1 or higher. On the home page, CM will report what it believes the cluster's version is. If that doesn't say 5.1, then you need to fix it by either installing the correct CDH version or configuring the version in CM, as described here: http://www.cloudera.com/content/cloudera-content/cloudera-docs/CM5/latest/Cloudera-Manager-Managing-Clusters/cm5mc_config_package_version.html

Darren · ‎08-25-2014

See example here: http://cloudera.github.io/cm_api/docs/python-client/#managing_parcels

Darren · ‎08-19-2014

Yes, that's basically the reason. You also wouldn't have to leave the data directory in the hardcoded location that CM uses, which doesn't always work well for folks as the database grows large.

Darren · ‎08-19-2014

Hi, In general, we suggest that you use an external database for production. The embedded database is just handy for getting started. The embedded database is just a regular postgresql that is started by custom init scripts on a custom port in a custom data directory. When creating the Hive service, the wizard will prompt you if you'd like to use the embedded database or an external one. If you use the embedded database, a user role and a database will be created with the correct permissions for you automatically. If you use an external one, you must do these steps yourself and provide the host / port / database name / username / password. The CM documentation does not say to do a yum install of postgres. You just use the Cloudera Manager UI and click the Add Service option in the dropdown menu by your cluster name, same as adding any other service. For smaller clusters, it's fine to consolidate onto a single database. As your load grows, you'll want to migrate some databases and possibly their roles to different hosts. I wouldn't run two PostgreSQL on the same host as that will just consume more RAM than is really needed. It would be better to consolidate onto the external PostgreSQL, as the embedded one is not intended for production. Thanks, Darren

Online	Offline
Last Visited	‎05-21-2019 01:27 PM

Member Since	‎07-30-2013 10:59 AM
Last Visited	‎05-21-2019 01:27 PM
Posts	509
Kudos received	112

Cloudera Community

Re: Install using CM of Datanodes with different n...

Re: Cloudera API doesn't return any roles for a ho...

Re: Stopping selected roles from Service action me...

Re: CSD Role with Multiple SSLServers

Re: CSD - SSLServer Paramaters

Re: MR's Jobtracker can't be started due to permis...

Re: CDH 5.1 Hive Metastore Server can't stay up

Re: MR's Jobtracker can't be started due to permis...

Re: CDH 5.1 Hive Metastore Server can't stay up

Re: Unable to delete hosts or assign roles

Re: Where can I configure sentry policies in cloud...

Re: Sentry Service not found in Cloudera Manager 5...

Re: How to install a parcel using CM API?

Re: Embedded vs External database?

Re: Embedded vs External database?