Support Questions

Find answers, ask questions, and share your expertise
Celebrating as our community reaches 100,000 members! Thank you!

SolrCloud 7.3 Backup and Restore of Indexes and Collections


Solrcloud 7.3

I have been tasked with writing a backup and restore demo for indexes and collections. I have a question regarding the location when taking a backup. In my address box I have the following URL to get my backup started... http://IPADDRESS:8983/solr/#/collections?action=BACKUP&name=test_catalog&location= the documentation says I should back up to a shared drive, but shared drive where? Is the shared drive on the zookeeper, or on my local machine? If on the zookeeper, do I need to create a backup directory first? And if on my local machine, what do I need to type in as the name and IP Address would not work?

I am also interested to know if it is possible to backup to Azure storage, as that would be ideal?

What is the difference between the location and repository? I would really appreciate any guidance, as there appears to be no step by step on the internet (hard to believe I know), if there is I cannot find it. The official documentation is vague at best (at least for me), any help will be appreciated.

Thank you.




OK, so things have moved on, and now I am hoping someone can help now things have moved on. I have created a mount point on the head node of my HDInsight cluster, this mount point goes to Azure Blob storage (File Share) and I am achieving partial backup, or at least this is what the error is telling me. I think what it really means is that it is not able to do any backups at all as I have yet to see anything appear in the blob storage.

I created a file share using steps from these URLs, the first tells me how to create a file share, the second tells me how to create mount point on an Linux (ubuntu) node to that file share.

I have tested the link to the file share by uploading a txt file to it, which then appears on the Linux node hosting the mount point.

If I execute the following line in the Solr UI, I receive the subsequent error...


{ "responseHeader":{ "status":500, "QTime":216}, "success":{ "1STPartofIPaddress.202:8983_solr":{ "responseHeader":{ "status":0, "QTime":6}}}, "failure":{ "1STPartofIPaddress.207:8983_solr":"org.apache.solr.client.solrj.impl.HttpSolrClient$RemoteSolrException:Error from server at http://1STPartofIPaddress.207:8983/solr: Failed to backup core=products_yan_1_replica_n4 because org.apache.solr.common.SolrException: Directory to contain snapshots doesn't exist: file:///mnt/solrbkp/prod_cat5"}, "Operation backup caused exception:":"org.apache.solr.common.SolrException:org.apache.solr.common.SolrException: Could not backup all replicas", "exception":{ "msg":"Could not backup all replicas", "rspCode":500}, "error":{ "metadata":[ "error-class","org.apache.solr.common.SolrException", "root-error-class","org.apache.solr.common.SolrException"], "msg":"Could not backup all replicas", "trace":"org.apache.solr.common.SolrException: Could not backup all replicas\n\tat org.apache.solr.client.solrj.SolrResponse.getException(\n\tat org.apache.solr.handler.admin.CollectionsHandler.invokeAction(\n\tat org.apache.solr.handler.admin.CollectionsHandler.handleRequestBody(\n\tat org.apache.solr.handler.RequestHandlerBase.handleRequest(\n\tat org.apache.solr.servlet.HttpSolrCall.handleAdmin(\n\tat org.apache.solr.servlet.HttpSolrCall.handleAdminRequest(\n\tat\n\tat org.apache.solr.servlet.SolrDispatchFilter.doFilter(\n\tat org.apache.solr.servlet.SolrDispatchFilter.doFilter(\n\tat org.eclipse.jetty.servlet.ServletHandler$CachedChain.doFilter(\n\tat org.eclipse.jetty.servlet.ServletHandler.doHandle(\n\tat org.eclipse.jetty.server.handler.ScopedHandler.handle(\n\tat\n\tat org.eclipse.jetty.server.handler.HandlerWrapper.handle(\n\tat org.eclipse.jetty.server.handler.ScopedHandler.nextHandle(\n\tat org.eclipse.jetty.server.session.SessionHandler.doHandle(\n\tat org.eclipse.jetty.server.handler.ScopedHandler.nextHandle(\n\tat org.eclipse.jetty.server.handler.ContextHandler.doHandle(\n\tat org.eclipse.jetty.server.handler.ScopedHandler.nextScope(\n\tat org.eclipse.jetty.servlet.ServletHandler.doScope(\n\tat org.eclipse.jetty.server.session.SessionHandler.doScope(\n\tat org.eclipse.jetty.server.handler.ScopedHandler.nextScope(\n\tat org.eclipse.jetty.server.handler.ContextHandler.doScope(\n\tat org.eclipse.jetty.server.handler.ScopedHandler.handle(\n\tat org.eclipse.jetty.server.handler.ContextHandlerCollection.handle(\n\tat org.eclipse.jetty.server.handler.HandlerCollection.handle(\n\tat org.eclipse.jetty.server.handler.HandlerWrapper.handle(\n\tat org.eclipse.jetty.rewrite.handler.RewriteHandler.handle(\n\tat org.eclipse.jetty.server.handler.HandlerWrapper.handle(\n\tat org.eclipse.jetty.server.Server.handle(\n\tat org.eclipse.jetty.server.HttpChannel.handle(\n\tat org.eclipse.jetty.server.HttpConnection.onFillable(\n\tat$ReadCallback.succeeded(\n\tat\n\tat$\n\tat org.eclipse.jetty.util.thread.strategy.EatWhatYouKill.doProduce(\n\tat org.eclipse.jetty.util.thread.strategy.EatWhatYouKill.produce(\n\tat\n\tat org.eclipse.jetty.util.thread.ReservedThreadExecutor$\n\tat org.eclipse.jetty.util.thread.QueuedThreadPool.runJob(\n\tat org.eclipse.jetty.util.thread.QueuedThreadPool$\n\tat\n", "code":500}}

If I run the same backup command again, I receive...

  "Operation backup caused exception:":"org.apache.solr.common.SolrException:org.apache.solr.common.SolrException: The backup directory already exists: file:///mnt/solrbkp/prod_cat5/",
    "msg":"The backup directory already exists: file:///mnt/solrbkp/prod_cat5/",
    "msg":"The backup directory already exists: file:///mnt/solrbkp/prod_cat5/",

So here it says the location already exists, but it does not exist in the blob store file share where I would expect to find it, and the previous error states it does not exist. So where is it? If I specify the backup name, is the last part of the desired backup location supposed to be created automatically or I'm I correct in specifying it?

Do I need to create the same mount point on multiple nodes? If so, just the nodes that are hosting collections/indexes/shards? Or also on the Head and Zookeeper nodes?

I have tested the link to the file share by uploading a txt file to it, which then appears on the Linux node hosting the mount point.

Thank you in advance for any help or advice.



New Contributor

So I'm guessing because this is old, you were able to resolve. I'm in the middle of doing this similar thing with Solr in AKS. It looks like maybe the location of the backup should have been /mnt and not ///mnt but I'm not sure about that.