<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Setting max S3 connections in Archives of Support Questions (Read Only)</title>
    <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Setting-max-S3-connections/m-p/48125#M47442</link>
    <description>&lt;P&gt;Hi all,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;A href="https://www.cloudera.com/documentation/enterprise/5-8-x/topics/impala_s3.html#s3_best_practices" target="_self"&gt;Best Practices for Using Impala with S3&lt;/A&gt; states "Set the safety valve fs.s3a.connection.maximum to 1500 for &lt;SPAN class="keyword cmdname"&gt;impalad&lt;/SPAN&gt;."&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Can annyone clarify which safety valve field should be used and with what syntax? I'm reading somewhere that this setting belongs to core-site.xml but Impala configuration in Cloudera Manger does not seem to have a safety valve for core-site.xml. The instructions mentions safety valve for impalad but that safety valve seems to be for command line arguments to impalad.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;The problem we are trying to adress is&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;hdfsSeek(desiredPos=503890631): FSDataInputStream#seek error:&lt;BR /&gt;com.cloudera.com.amazonaws.AmazonClientException: Unable to execute HTTP request: Timeout waiting for connection from pool&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;that we keep getting when using Impala for querying data stored in S3.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;We are using CDH 5.8.3&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thanks,&lt;/P&gt;&lt;P&gt;Petter&lt;/P&gt;</description>
    <pubDate>Fri, 16 Sep 2022 10:50:05 GMT</pubDate>
    <dc:creator>Pettax</dc:creator>
    <dc:date>2022-09-16T10:50:05Z</dc:date>
    <item>
      <title>Setting max S3 connections</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Setting-max-S3-connections/m-p/48125#M47442</link>
      <description>&lt;P&gt;Hi all,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;A href="https://www.cloudera.com/documentation/enterprise/5-8-x/topics/impala_s3.html#s3_best_practices" target="_self"&gt;Best Practices for Using Impala with S3&lt;/A&gt; states "Set the safety valve fs.s3a.connection.maximum to 1500 for &lt;SPAN class="keyword cmdname"&gt;impalad&lt;/SPAN&gt;."&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Can annyone clarify which safety valve field should be used and with what syntax? I'm reading somewhere that this setting belongs to core-site.xml but Impala configuration in Cloudera Manger does not seem to have a safety valve for core-site.xml. The instructions mentions safety valve for impalad but that safety valve seems to be for command line arguments to impalad.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;The problem we are trying to adress is&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;hdfsSeek(desiredPos=503890631): FSDataInputStream#seek error:&lt;BR /&gt;com.cloudera.com.amazonaws.AmazonClientException: Unable to execute HTTP request: Timeout waiting for connection from pool&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;that we keep getting when using Impala for querying data stored in S3.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;We are using CDH 5.8.3&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thanks,&lt;/P&gt;&lt;P&gt;Petter&lt;/P&gt;</description>
      <pubDate>Fri, 16 Sep 2022 10:50:05 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Setting-max-S3-connections/m-p/48125#M47442</guid>
      <dc:creator>Pettax</dc:creator>
      <dc:date>2022-09-16T10:50:05Z</dc:date>
    </item>
    <item>
      <title>Re: Setting max S3 connections</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Setting-max-S3-connections/m-p/48200#M47443</link>
      <description>&lt;P&gt;Hi Pettax,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;You should be able to find the safety valve in the Cloudera Manager under the HDFS service. The S3AConnector used by Impala is managed by the HDFS service. It will be under the title: "&lt;SPAN&gt;Cluster-wide Advanced Configuration Snippet (Safety Valve) for core-site.xml".&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;Let me know if you have any other issues.&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;- Sailesh&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Wed, 30 Nov 2016 16:32:19 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Setting-max-S3-connections/m-p/48200#M47443</guid>
      <dc:creator>saileshmukil</dc:creator>
      <dc:date>2016-11-30T16:32:19Z</dc:date>
    </item>
    <item>
      <title>Re: Setting max S3 connections</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Setting-max-S3-connections/m-p/48339#M47444</link>
      <description>&lt;P&gt;Thank you &lt;SPAN&gt;Sailesh&lt;/SPAN&gt;!&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;This solved my problem.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Br,&lt;/P&gt;&lt;P&gt;Petter&lt;/P&gt;</description>
      <pubDate>Tue, 06 Dec 2016 16:12:08 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Setting-max-S3-connections/m-p/48339#M47444</guid>
      <dc:creator>Pettax</dc:creator>
      <dc:date>2016-12-06T16:12:08Z</dc:date>
    </item>
  </channel>
</rss>

