<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: Hive aggregate query failing for External table in Archives of Support Questions (Read Only)</title>
    <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Hive-aggregate-query-failing-for-External-table/m-p/145757#M19896</link>
    <description>&lt;P&gt;I'm using a custom S3 for Eucalyptus, not the AWS one. I have been trying to resolve this since past few weeks.&lt;/P&gt;</description>
    <pubDate>Wed, 17 Feb 2016 20:41:34 GMT</pubDate>
    <dc:creator>phoncy_joseph</dc:creator>
    <dc:date>2016-02-17T20:41:34Z</dc:date>
    <item>
      <title>Hive aggregate query failing for External table</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Hive-aggregate-query-failing-for-External-table/m-p/145755#M19894</link>
      <description>&lt;P&gt;I have a Hadoop cluster(HDP 2.2) set-up in Eucalyptus environment. I have created an external table in Hive(0.14), using the below query:&lt;/P&gt;&lt;PRE&gt;CREATE EXTERNAL TABLE tempbatting (col_value STRING) LOCATION 's3n://hive-bucket/';
&lt;/PRE&gt;&lt;P&gt;I'm using a custom S3 location, so I have set jets3t property in Hive configuration directory as below:&lt;/P&gt;&lt;PRE&gt;set s3service.https-only = true;
set s3service.s3-endpoint = s3-customlocation.net;
set s3service.s3-endpoint-http-port   = 80;
set s3service.s3-endpoint-https-port = 443;
set s3service.disable-dns-buckets = true;
set s3service.enable-storage-classes = false;
&lt;/PRE&gt;&lt;P&gt;Though I'm able to execute simple select queries on the table successfully, the aggregate queries are failing. Below are the logs:&lt;/P&gt;&lt;PRE&gt;Error: java.io.IOException: java.lang.reflect.InvocationTargetException
        at org.apache.hadoop.hive.io.HiveIOExceptionHandlerChain.handleRecordReaderCreationException(HiveIOExceptionHandlerChain.java:97)
        at org.apache.hadoop.hive.io.HiveIOExceptionHandlerUtil.handleRecordReaderCreationException(HiveIOExceptionHandlerUtil.java:57)
 
Caused by: org.apache.http.conn.ConnectTimeoutException: Connect to hive-bucket.s3.amazonaws.com:443 timed out
        at org.apache.http.conn.ssl.SSLSocketFactory.connectSocket(SSLSocketFactory.java:416)
&lt;/PRE&gt;&lt;P&gt;From the logs, the map-reduce job seems to access Amazon S3.  I have tried using the  set command for Hive(set fs.s3n.endpoint=s3-customlocation.net), but it didn't seem to work. Is there a way to specify custom end-point?&lt;/P&gt;</description>
      <pubDate>Wed, 17 Feb 2016 20:35:22 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Hive-aggregate-query-failing-for-External-table/m-p/145755#M19894</guid>
      <dc:creator>phoncy_joseph</dc:creator>
      <dc:date>2016-02-17T20:35:22Z</dc:date>
    </item>
    <item>
      <title>Re: Hive aggregate query failing for External table</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Hive-aggregate-query-failing-for-External-table/m-p/145756#M19895</link>
      <description>&lt;A rel="user" href="https://community.cloudera.com/users/2713/phoncyjoseph.html" nodeid="2713"&gt;@Phoncy Joseph&lt;/A&gt;&lt;P&gt; This is from AWS forums &lt;/P&gt;&lt;P&gt;"I tried to rerun my job again and this time it is finished successfully. So I guess it may be related with s3 service unstable in rent two days at least from error message.&lt;/P&gt;&lt;P&gt;I hope this issue not happen again."&lt;/P&gt;&lt;P&gt;&lt;A target="_blank" href="https://forums.aws.amazon.com/message.jspa?messageID=478835"&gt;link&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Wed, 17 Feb 2016 20:38:02 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Hive-aggregate-query-failing-for-External-table/m-p/145756#M19895</guid>
      <dc:creator>nsabharwal</dc:creator>
      <dc:date>2016-02-17T20:38:02Z</dc:date>
    </item>
    <item>
      <title>Re: Hive aggregate query failing for External table</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Hive-aggregate-query-failing-for-External-table/m-p/145757#M19896</link>
      <description>&lt;P&gt;I'm using a custom S3 for Eucalyptus, not the AWS one. I have been trying to resolve this since past few weeks.&lt;/P&gt;</description>
      <pubDate>Wed, 17 Feb 2016 20:41:34 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Hive-aggregate-query-failing-for-External-table/m-p/145757#M19896</guid>
      <dc:creator>phoncy_joseph</dc:creator>
      <dc:date>2016-02-17T20:41:34Z</dc:date>
    </item>
    <item>
      <title>Re: Hive aggregate query failing for External table</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Hive-aggregate-query-failing-for-External-table/m-p/145758#M19897</link>
      <description>&lt;P&gt;OK just to repeat. You can access s3 through hive with simple queries? So it cannot be a connection problem right?&lt;/P&gt;&lt;P&gt;Perhaps too many parallel connections timing out when all the mappers spin up?&lt;/P&gt;&lt;P&gt;Do you see some tasks successfully completing and then some tasks failing after 3 retries? In this case it sounds like a timeout issue.&lt;/P&gt;&lt;P&gt;I have seen some issues in google like this that tried to fix it by increasing connection timeouts and retries. However mostly in presto forums.&lt;/P&gt;&lt;P&gt;However there are s3 parameters available in the hdfs-site configuration&lt;/P&gt;&lt;P&gt;&lt;A href="https://hadoop.apache.org/docs/r2.6.3/hadoop-project-dist/hadoop-common/core-default.xml" target="_blank"&gt;https://hadoop.apache.org/docs/r2.6.3/hadoop-project-dist/hadoop-common/core-default.xml&lt;/A&gt;&lt;/P&gt;&lt;P&gt;fs.s3a.connection.timeout&lt;/P&gt;</description>
      <pubDate>Wed, 17 Feb 2016 20:57:37 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Hive-aggregate-query-failing-for-External-table/m-p/145758#M19897</guid>
      <dc:creator>bleonhardi</dc:creator>
      <dc:date>2016-02-17T20:57:37Z</dc:date>
    </item>
    <item>
      <title>Re: Hive aggregate query failing for External table</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Hive-aggregate-query-failing-for-External-table/m-p/145759#M19898</link>
      <description>&lt;P&gt;Thanks for the response. Yes, I'm able to access S3 through simple Hive queries.From the logs, I could see that the map-reduce job is trying to connect to "hive-bucket.s3.amazonaws.com:443", which doesn't exist. I need to connect to a custom S3 endpoint, which is "s3-customlocation.net". I have gone through the hdfs-site configuration,but I couldnt find any parameter to set custom endpoint.&lt;/P&gt;</description>
      <pubDate>Wed, 17 Feb 2016 21:08:23 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Hive-aggregate-query-failing-for-External-table/m-p/145759#M19898</guid>
      <dc:creator>phoncy_joseph</dc:creator>
      <dc:date>2016-02-17T21:08:23Z</dc:date>
    </item>
    <item>
      <title>Re: Hive aggregate query failing for External table</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Hive-aggregate-query-failing-for-External-table/m-p/145760#M19899</link>
      <description>&lt;P&gt;&lt;A href="https://issues.apache.org/jira/browse/HADOOP-11261"&gt;https://issues.apache.org/jira/browse/HADOOP-11261&lt;/A&gt;&lt;/P&gt;&lt;P&gt;Which version of Hadoop are you using? &lt;/P&gt;&lt;P&gt;"It also enables using a custom url pointing to an S3-compatible object store."&lt;/P&gt;</description>
      <pubDate>Wed, 17 Feb 2016 21:33:43 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Hive-aggregate-query-failing-for-External-table/m-p/145760#M19899</guid>
      <dc:creator>bleonhardi</dc:creator>
      <dc:date>2016-02-17T21:33:43Z</dc:date>
    </item>
    <item>
      <title>Re: Hive aggregate query failing for External table</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Hive-aggregate-query-failing-for-External-table/m-p/145761#M19900</link>
      <description>&lt;P&gt;I'm using Hadoop 2.6.&lt;/P&gt;</description>
      <pubDate>Thu, 18 Feb 2016 00:34:02 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Hive-aggregate-query-failing-for-External-table/m-p/145761#M19900</guid>
      <dc:creator>phoncy_joseph</dc:creator>
      <dc:date>2016-02-18T00:34:02Z</dc:date>
    </item>
    <item>
      <title>Re: Hive aggregate query failing for External table</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Hive-aggregate-query-failing-for-External-table/m-p/145762#M19901</link>
      <description>&lt;P&gt;I only understand half of the s3 problems but it might be that you need to upgrade if a custom url is what you want.&lt;/P&gt;&lt;P&gt;&lt;A href="https://issues.apache.org/jira/browse/HADOOP-11261"&gt;https://issues.apache.org/jira/browse/HADOOP-11261&lt;/A&gt;&lt;/P&gt;&lt;P&gt;"It also enables using a custom url pointing to an S3-compatible object store."&lt;/P&gt;&lt;UL&gt;
&lt;LI&gt;Fix Version/s:&lt;A href="https://issues.apache.org/jira/browse/HADOOP/fixforversion/12327583"&gt;2.7.0&lt;/A&gt;&lt;/LI&gt;&lt;/UL&gt;</description>
      <pubDate>Thu, 18 Feb 2016 03:23:21 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Hive-aggregate-query-failing-for-External-table/m-p/145762#M19901</guid>
      <dc:creator>bleonhardi</dc:creator>
      <dc:date>2016-02-18T03:23:21Z</dc:date>
    </item>
    <item>
      <title>Re: Hive aggregate query failing for External table</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Hive-aggregate-query-failing-for-External-table/m-p/145763#M19902</link>
      <description>&lt;P&gt;@phoncy Joseph any progress on this?&lt;/P&gt;</description>
      <pubDate>Sun, 21 Feb 2016 02:19:00 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Hive-aggregate-query-failing-for-External-table/m-p/145763#M19902</guid>
      <dc:creator>aervits</dc:creator>
      <dc:date>2016-02-21T02:19:00Z</dc:date>
    </item>
    <item>
      <title>Re: Hive aggregate query failing for External table</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Hive-aggregate-query-failing-for-External-table/m-p/145764#M19903</link>
      <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/393/aervits.html" nodeid="393"&gt;@Artem Ervits&lt;/A&gt; Copied jets3t.properties to all data nodes. Currently I'm getting below exception:&lt;/P&gt;&lt;PRE&gt;org.apache.hadoop.fs.s3.S3Exception: org.jets3t.service.ServiceException: S3 Error Message. -- ResponseCode: 403, ResponseStatus: Forbidden, XML Error Message: &amp;lt;?xml version="1.0" encoding="UTF-8"?&amp;gt;&amp;lt;Error&amp;gt;&amp;lt;Code&amp;gt;AccessDenied&amp;lt;/Code&amp;gt;&amp;lt;Message&amp;gt;Access Denied&amp;lt;/Message&amp;gt;&amp;lt;Resource&amp;gt;/hive-bucket&amp;lt;/Resource&amp;gt;&amp;lt;RequestId&amp;gt;&amp;lt;/RequestId&amp;gt;&amp;lt;/Error&amp;gt;
        at org.apache.hadoop.fs.s3native.Jets3tNativeFileSystemStore.processException(Jets3tNativeFileSystemStore.java:470)



&lt;/PRE&gt;</description>
      <pubDate>Mon, 22 Feb 2016 18:09:45 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Hive-aggregate-query-failing-for-External-table/m-p/145764#M19903</guid>
      <dc:creator>phoncy_joseph</dc:creator>
      <dc:date>2016-02-22T18:09:45Z</dc:date>
    </item>
    <item>
      <title>Re: Hive aggregate query failing for External table</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Hive-aggregate-query-failing-for-External-table/m-p/145765#M19904</link>
      <description>&lt;P&gt;tey the s3a &lt;A href="https://hadoop.apache.org/docs/stable/hadoop-aws/tools/hadoop-aws/index.html" target="_blank"&gt;https://hadoop.apache.org/docs/stable/hadoop-aws/tools/hadoop-aws/index.html&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Mon, 22 Feb 2016 20:58:43 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Hive-aggregate-query-failing-for-External-table/m-p/145765#M19904</guid>
      <dc:creator>aervits</dc:creator>
      <dc:date>2016-02-22T20:58:43Z</dc:date>
    </item>
    <item>
      <title>Re: Hive aggregate query failing for External table</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Hive-aggregate-query-failing-for-External-table/m-p/145766#M19905</link>
      <description>&lt;P&gt;Though have not yet upgraded to Hadoop 2.7, I made the configuration changes for s3a as per the documentation. On executing Hive create query, I got the below exception:&lt;/P&gt;&lt;PRE&gt;FAILED: AmazonClientException Unable to execute HTTP request: Connect to hive-bucket.s3.amazonaws.com:443 timed out
&lt;/PRE&gt;</description>
      <pubDate>Fri, 26 Feb 2016 14:49:08 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Hive-aggregate-query-failing-for-External-table/m-p/145766#M19905</guid>
      <dc:creator>phoncy_joseph</dc:creator>
      <dc:date>2016-02-26T14:49:08Z</dc:date>
    </item>
    <item>
      <title>Re: Hive aggregate query failing for External table</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Hive-aggregate-query-failing-for-External-table/m-p/145767#M19906</link>
      <description>&lt;P&gt;I have upgraded to Hadoop 2.7 now. I have done configurations changes for s3a and the queries are executing successfully. Thank you.&lt;/P&gt;</description>
      <pubDate>Tue, 22 Mar 2016 13:00:36 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Hive-aggregate-query-failing-for-External-table/m-p/145767#M19906</guid>
      <dc:creator>phoncy_joseph</dc:creator>
      <dc:date>2016-03-22T13:00:36Z</dc:date>
    </item>
  </channel>
</rss>

