<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: Cloudera Search: How HDFS replica and shards/replica collaborate to achieve Fault Tolerance in R in Support Questions</title>
    <link>https://community.cloudera.com/t5/Support-Questions/Cloudera-Search-How-HDFS-replica-and-shards-replica/m-p/24599#M19959</link>
    <description>&lt;P&gt;Stefano,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;gt; Is it possible to force no HDFS replication when i create a collection (if HDFS replication factor has already been setted to &amp;gt;=2 )?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I don't think Solr can handle the number of block replication in HDFS. If you just want to reduce the number of block replica, doing "hdfs dfs -setrep" should work (please note that the reliability would be decreasing).&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;gt; Are there best practice to add replicas after I have created a n-sharded collection without replicas? Can you give me an official link?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;While we don't usually recommend to do the operation officially, I could find the following answer in stackoverflow.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;A href="http://stackoverflow.com/questions/18441893/add-shard-replica-in-solrcloud" target="_blank"&gt;http://stackoverflow.com/questions/18441893/add-shard-replica-in-solrcloud&lt;/A&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;Please note that, this can be dangerous if done incorrectly.&lt;/SPAN&gt;&lt;/P&gt;</description>
    <pubDate>Thu, 12 Feb 2015 13:05:12 GMT</pubDate>
    <dc:creator>dice</dc:creator>
    <dc:date>2015-02-12T13:05:12Z</dc:date>
    <item>
      <title>Cloudera Search: How HDFS replica and shards/replica collaborate to achieve Fault Tolerance in R/W?</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Cloudera-Search-How-HDFS-replica-and-shards-replica/m-p/24507#M19956</link>
      <description>&lt;P&gt;Hi all,&lt;BR /&gt;I would like to have a clarification about Cloudera Search architecture, in &amp;nbsp;particular on the way it manage Replicas.&lt;BR /&gt;I know that indexes are stored in HDFS and HDFS has its own replication &amp;nbsp;factor, so a directory is replicated N times in the cluster.&lt;BR /&gt;I know that with "sharding" you can divide index in more pieces stored in different directory over HDFS. "Replica", as well, add more directories with&lt;BR /&gt;index pieces copies.&lt;/P&gt;&lt;P&gt;I would like to know:&lt;BR /&gt;- why I need to add collection replica nodes, if I already have HDFS &amp;nbsp;replica?&lt;BR /&gt;- in a N nodes cluster without collection replicas, what is the behaviour if a nodes goes down? How can I read and write the index's piece owned by&lt;BR /&gt;the corresponding shard?&lt;BR /&gt;- What is the best way to save storage without leverage both hdfs replicas &amp;nbsp;and shard replicas?&lt;BR /&gt;- How can I achive high availability on Cloudera Search?&lt;/P&gt;&lt;P&gt;I have never found documentation or clear explanation over this topic.&lt;BR /&gt;Thank you so much for your support,&lt;BR /&gt;Stefano&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;</description>
      <pubDate>Fri, 16 Sep 2022 09:21:14 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Cloudera-Search-How-HDFS-replica-and-shards-replica/m-p/24507#M19956</guid>
      <dc:creator>sgazzola</dc:creator>
      <dc:date>2022-09-16T09:21:14Z</dc:date>
    </item>
    <item>
      <title>Re: Cloudera Search: How HDFS replica and shards/replica collaborate to achieve Fault Tolerance in R</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Cloudera-Search-How-HDFS-replica-and-shards-replica/m-p/24512#M19957</link>
      <description>&lt;P&gt;Stefano,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;gt; why I need to add collection replica nodes, if I already have HDFS &amp;nbsp;replica?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Queries are being issued against indices (shards), not against block replicas. That's why you need to have shards in SolrCloud, while you have HDFS block replicas. Replicas for shards are for robustness in SolrCloud. See the following link as well.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;A target="_blank" href="http://www.cloudera.com/content/cloudera/en/documentation/core/latest/topics/search_glossary.html"&gt;http://www.cloudera.com/content/cloudera/en/documentation/core/latest/topics/search_glossary.html&lt;/A&gt;&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&amp;gt; in a N nodes cluster without collection replicas, what is the behaviour if a nodes goes down? How can I read and write the index's piece owned by the corresponding shard?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;If you have only one replica for a given shard, it will be unsearchable once the node (which has the replica) goes down.&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&amp;gt; What is the best way to save storage without leverage both hdfs replicas &amp;nbsp;and shard replicas?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;It depends on your requirement. That's a trade-off between saving storages and keeping fault tolerance/reliable.&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&amp;gt; How can I achive high availability on Cloudera Search?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;It's being covered by the following guide:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;A target="_blank" href="http://www.cloudera.com/content/cloudera/en/documentation/core/latest/topics/search_ha_proxy.html"&gt;http://www.cloudera.com/content/cloudera/en/documentation/core/latest/topics/search_ha_proxy.html&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Tue, 10 Feb 2015 11:23:17 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Cloudera-Search-How-HDFS-replica-and-shards-replica/m-p/24512#M19957</guid>
      <dc:creator>dice</dc:creator>
      <dc:date>2015-02-10T11:23:17Z</dc:date>
    </item>
    <item>
      <title>Re: Cloudera Search: How HDFS replica and shards/replica collaborate to achieve Fault Tolerance in R</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Cloudera-Search-How-HDFS-replica-and-shards-replica/m-p/24516#M19958</link>
      <description>&lt;P&gt;Thank you very much for your quick and precise answers.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I would like also to know:&lt;/P&gt;&lt;P&gt;- Is it possible to force no HDFS replication when i create a collection (if HDFS replication factor has already been setted to &amp;gt;=2 )?&lt;/P&gt;&lt;P&gt;-&amp;nbsp;Are there best practice to add replicas after I have created a n-sharded collection without replicas? Can you give me an official link?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thank you,&lt;/P&gt;&lt;P&gt;Stefano&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Tue, 10 Feb 2015 13:42:35 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Cloudera-Search-How-HDFS-replica-and-shards-replica/m-p/24516#M19958</guid>
      <dc:creator>sgazzola</dc:creator>
      <dc:date>2015-02-10T13:42:35Z</dc:date>
    </item>
    <item>
      <title>Re: Cloudera Search: How HDFS replica and shards/replica collaborate to achieve Fault Tolerance in R</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Cloudera-Search-How-HDFS-replica-and-shards-replica/m-p/24599#M19959</link>
      <description>&lt;P&gt;Stefano,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;gt; Is it possible to force no HDFS replication when i create a collection (if HDFS replication factor has already been setted to &amp;gt;=2 )?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I don't think Solr can handle the number of block replication in HDFS. If you just want to reduce the number of block replica, doing "hdfs dfs -setrep" should work (please note that the reliability would be decreasing).&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;gt; Are there best practice to add replicas after I have created a n-sharded collection without replicas? Can you give me an official link?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;While we don't usually recommend to do the operation officially, I could find the following answer in stackoverflow.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;A href="http://stackoverflow.com/questions/18441893/add-shard-replica-in-solrcloud" target="_blank"&gt;http://stackoverflow.com/questions/18441893/add-shard-replica-in-solrcloud&lt;/A&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;Please note that, this can be dangerous if done incorrectly.&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Thu, 12 Feb 2015 13:05:12 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Cloudera-Search-How-HDFS-replica-and-shards-replica/m-p/24599#M19959</guid>
      <dc:creator>dice</dc:creator>
      <dc:date>2015-02-12T13:05:12Z</dc:date>
    </item>
    <item>
      <title>Re: Cloudera Search: How HDFS replica and shards/replica collaborate to achieve Fault Tolerance in R</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Cloudera-Search-How-HDFS-replica-and-shards-replica/m-p/24626#M19960</link>
      <description>&lt;P&gt;I would probably suggest using &amp;nbsp;curl command to add replica &amp;nbsp;and not the solr UI.This is the wiki reference on how &amp;nbsp;you &amp;nbsp;can do that using the collection APIs.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;A href="https://cwiki.apache.org/confluence/display/solr/Collections+API#CollectionsAPI-api_addreplica" target="_blank"&gt;https://cwiki.apache.org/confluence/display/solr/Collections+API#CollectionsAPI-api_addreplica&lt;/A&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;You may want to do this during a quite time as it would have more I/O on the system,again depends on what your cluster environment and index size looks like.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thanks,&lt;/P&gt;&lt;P&gt;Nishan&lt;/P&gt;</description>
      <pubDate>Thu, 12 Feb 2015 21:06:43 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Cloudera-Search-How-HDFS-replica-and-shards-replica/m-p/24626#M19960</guid>
      <dc:creator>Nishan</dc:creator>
      <dc:date>2015-02-12T21:06:43Z</dc:date>
    </item>
    <item>
      <title>Re: Cloudera Search: How HDFS replica and shards/replica collaborate to achieve Fault Tolerance in R</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Cloudera-Search-How-HDFS-replica-and-shards-replica/m-p/24814#M19961</link>
      <description>&lt;P&gt;Thank you so much for your advices&lt;/P&gt;&lt;P&gt;Stefano&lt;/P&gt;</description>
      <pubDate>Wed, 18 Feb 2015 13:55:13 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Cloudera-Search-How-HDFS-replica-and-shards-replica/m-p/24814#M19961</guid>
      <dc:creator>sgazzola</dc:creator>
      <dc:date>2015-02-18T13:55:13Z</dc:date>
    </item>
  </channel>
</rss>

