About james_jones

james_jones · ‎07-13-2016

@Saurabh Kumar - Since this example is using the Sandbox zookeeper rather than embedded zk, try adding /solr to the end of your zookeeper entry in the create command like this sandbox.hortonworks.com:2181/solr. I'm not sure if that will solve the issue but when Solr runs in cloud mode with an external zookeeper it makes /solr the zookeeper root to keep it's data separate from other services.

james_jones · ‎07-12-2016

It's telling you that it cannot find the file in your confdir (the directory after -d in your command). Do an ls on the directory and either the directory is missing or the solrconfig.xml is missing from /opt/lucidworks-hdpsearch/solr/server/solr/configsets/data_driven_schema_configs_hdfs/conf. You may have overlooked a step in the setup (Step 2). It includes creating the conf directory and modifying the solrconfig.xml file... cp -R /opt/lucidworks-hdpsearch/solr/server/solr/configsets/data_driven_schema_configs /opt/lucidworks-hdpsearch/solr/server/solr/configsets/data_driven_schema_configs_hdfs

james_jones · ‎06-30-2016

Thanks for the info @vpoornalingam. We resolved the issue. It turns out that when we changed the path, Ambari suggested making other changes, which we mindlessly accepted. Obviously we should have paid more attention. One of the changes was to drop the embedded hbase master heap size to a much lower value. I realized this after looking at the log hbase-ams-master-<FQDN>.out rather than ambari-metrics-collector.log that I was looking at.

james_jones · ‎06-29-2016

Changing it broke Metrics Collector... I need to move the AMS hbase.rootdir to another partition, so I created a directory, did a chown -R ams:hadoop MYDIR, changed the configuration value and restarted AMS. The Metrics Collector will not start. It's throwing a connection refused exception when trying to connect to zookeeper on localhost:61181. Unfortunately, I don't have access to the exact exception at the moment). This is on HDP 2.4 Nothing is listening on port 61181, which I believe should be the embedded ZK port. hbase.zookeeper.property.clientPort={{zookeeper_clientPort}}. I killed all ams processes to be sure something was not in a bad state. I also tried copying the old hbase.rootdir to my new directory with the same permissions but it still fails. When I switch back to the old location it works fine. This seems very similar to changing to distributed mode, so I don't understand what's going wrong.

james_jones · ‎06-29-2016

Awesome. Thanks. I wasn't sure if under the covers Ranger was just doing sql grants.

james_jones · ‎06-29-2016

We are enabling the Ranger authorization in Hive, but previously we created roles & grants in beeline. Should we remove manually created Hive grants and roles in beeline before switching from StdSqlAuth to Ranger authorization or will magic happen? The roles we created match our AD group names in the policies. (HDP 2.4)

james_jones · ‎06-27-2016

Awesome. Thanks.

james_jones · ‎06-23-2016

@rbiswas, You may have read this but there's some good info here in what they describe as a "real world" production configuration using the new cross-data-center replication: https://cwiki.apache.org/confluence/pages/viewpage.action?pageId=62687462 Since this feature only came out in 6.0 which was released less than 2 months ago, there's probably been limited production use. ALSO....Not a best practice, but since way before Solr Cloud existed, we used a brute force method of cross-data-center replication for stand-by Solrs with the magic of rsync. You can reliably use rsync to copy indexes as they are being updated, but there's a bit of scripting required. I have only done this in non-cloud environments, but I'm pretty sure it can be done in cloud as well. It is crude, but it worked for years and uses some of the great features of linux. Example script, run in crontab from the DR site nodes: #step 1 - create a backup first, assuming your current copy is good. cp -rl ${data_dir} ${data_dir}.BAK #step 2 - Now copy from the primary site status=1 while [ $status != 0 ]; do rsync -a --delete ${primary_site_node}:${data_dir} ${data_dir} status=$? done echo "COPY COMPLETE!" That script will create local backup (instantly via hard-links, not soft links) and then copies [only] new files and deletes files from DR that are have been deleted from Primary/remote. If files disappear during the rsync copy, it will copy again until nothing changes during the rsync. This can be run from crontab, but it does need a bit of bullet-proofing. Simple. Crude. It works.

james_jones · ‎06-21-2016

Note that Solr Cloud's replication is not intended to go across data centers due to volume of traffic and dependency on zookeeper ensembles. However, the recently released 6.x added a special replication to go across data centers. https://issues.apache.org/jira/browse/SOLR-6273, which is based on this description: http://yonik.com/solr-cross-data-center-replication/ Basically, this is a cross-cluster replication, which is different from the standard Solr Cloud's replication mechanism.

james_jones · ‎06-08-2016

@David Lam did this work for you?

Online	Offline
Last Visited	‎09-25-2025 01:25 PM

Member Since	‎01-18-2016 02:01 PM
Last Visited	‎09-25-2025 01:25 PM
Posts	169
Kudos received	31

Cloudera Community

Re: Connect Trino to Cloudera Hive with Kerberos A...

Re: How do HDFS Permissions work after Kerberos is...

Re: Ambari SPN creation on remote AD

Re: Solr on HDF

Re: Wrong timezone in Ranger admin

Re: Not able create SOLR COLLECTION NAMED

Re: Not able create SOLR COLLECTION NAMED

Re: How can I change the AMS hbase.rootdir in embe...

How can I change the AMS hbase.rootdir in embedded...

Re: Should we remove Hive CLI roles & grants befor...

Should we remove Hive CLI roles & grants before en...

Re: Was the Hex Viewer for application/octet-strea...

Re: What are the best practices/guidelines for sol...

Re: What are the best practices/guidelines for sol...

Re: How to call a SOAP web service from within a s...