<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question S3Guard Suggested to help fix Consistency in Support Questions</title>
    <link>https://community.cloudera.com/t5/Support-Questions/S3Guard-Suggested-to-help-fix-Consistency/m-p/92370#M35321</link>
    <description>&lt;P&gt;Hi All&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Is Cloudera suggesting to use S3Guard as a solution for the consistency problem in&amp;nbsp;&lt;SPAN&gt;multi-step ETL&lt;/SPAN&gt;? Cause in the reference architecture, the suggested option is to load data from S3 into HDFS and then write back to S3?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thanks&lt;/P&gt;&lt;P&gt;CK&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
    <pubDate>Mon, 08 Jul 2019 15:19:21 GMT</pubDate>
    <dc:creator>CK71</dc:creator>
    <dc:date>2019-07-08T15:19:21Z</dc:date>
    <item>
      <title>S3Guard Suggested to help fix Consistency</title>
      <link>https://community.cloudera.com/t5/Support-Questions/S3Guard-Suggested-to-help-fix-Consistency/m-p/92370#M35321</link>
      <description>&lt;P&gt;Hi All&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Is Cloudera suggesting to use S3Guard as a solution for the consistency problem in&amp;nbsp;&lt;SPAN&gt;multi-step ETL&lt;/SPAN&gt;? Cause in the reference architecture, the suggested option is to load data from S3 into HDFS and then write back to S3?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thanks&lt;/P&gt;&lt;P&gt;CK&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Mon, 08 Jul 2019 15:19:21 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/S3Guard-Suggested-to-help-fix-Consistency/m-p/92370#M35321</guid>
      <dc:creator>CK71</dc:creator>
      <dc:date>2019-07-08T15:19:21Z</dc:date>
    </item>
    <item>
      <title>Re: S3Guard Suggested to help fix Consistency</title>
      <link>https://community.cloudera.com/t5/Support-Questions/S3Guard-Suggested-to-help-fix-Consistency/m-p/92416#M35322</link>
      <description>&lt;P&gt;Yes that is correct, and the motivations/steps-to-use are reflected here too: &lt;A href="https://www.cloudera.com/documentation/enterprise/6/latest/topics/cm_s3guard.html" target="_blank" rel="noopener"&gt;https://www.cloudera.com/documentation/enterprise/6/latest/topics/cm_s3guard.html&lt;/A&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Note: On your point of 'load data from S3 into HDFS', it is better stated as simply 'read data from S3', where HDFS gets used as a transient storage (where/when required). There does not need to be a 'download X GiB data from S3 to HDFS first, only then begin jobs' step, as distributed jobs can read off of S3 via s3a:// URLs in the same way they do from HDFS hdfs://.&lt;/P&gt;</description>
      <pubDate>Tue, 09 Jul 2019 07:55:39 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/S3Guard-Suggested-to-help-fix-Consistency/m-p/92416#M35322</guid>
      <dc:creator>Harsh J</dc:creator>
      <dc:date>2019-07-09T07:55:39Z</dc:date>
    </item>
  </channel>
</rss>

