<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: Is CombineHiveInputFormat deprecated by OrcInputFormat? in Archives of Support Questions (Read Only)</title>
    <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Is-CombineHiveInputFormat-deprecated-by-OrcInputFormat/m-p/159679#M21020</link>
    <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/393/aervits.html" nodeid="393"&gt;@Artem Ervits&lt;/A&gt; &lt;/P&gt;&lt;P&gt;I tried to reproduce it. I used a sample_07 database and CTASed it as TEXT and as ORC table. Get the same error message for both.&lt;/P&gt;&lt;P&gt;It sound weird but my guess would be that this syntax has not worked for a long time. He checks if the classname equals to CombinHiveInputFormat and since All classes extend CombineFileInputFormat I am not sure how that could be true anymore.&lt;/P&gt;&lt;P&gt;My guess would be that in the good old times CombineFileInputFormat was the actual class being used and now the classes just extend it so the check doesn't work anymore. But just a guess.&lt;/P&gt;</description>
    <pubDate>Fri, 26 Feb 2016 18:55:41 GMT</pubDate>
    <dc:creator>bleonhardi</dc:creator>
    <dc:date>2016-02-26T18:55:41Z</dc:date>
    <item>
      <title>Is CombineHiveInputFormat deprecated by OrcInputFormat?</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Is-CombineHiveInputFormat-deprecated-by-OrcInputFormat/m-p/159676#M21017</link>
      <description>&lt;P&gt;I'm trying to run a TABLESAMLE query with PERCENTAGE and I'm getting&lt;/P&gt;&lt;PRE&gt;Error: Error while compiling statement: FAILED: SemanticException 1:67 Percentage sampling is not supported in org.apache.hadoop.hive.ql.io.HiveInputFormat. Error encountered near token '2' (state=42000,code=40000)

&lt;/PRE&gt;&lt;PRE&gt;
String inputFormat = HiveConf.getVar(conf, HiveConf.ConfVars.HIVEINPUTFORMAT);
-      if (!inputFormat.equals(
-        CombineHiveInputFormat.class.getName())) {
-        throw new SemanticException(generateErrorMessage((ASTNode) tabref.getChild(1),
-            "Percentage sampling is not supported in " + inputFormat));
-      }&lt;/PRE&gt;&lt;P&gt;the above is from test code I found referencing the error and it's from at most Hive 0.12. So I guess my real question is, is TABLESAMPLE with PERCENTAGE still supported and if yes, can it be used with ORC?&lt;/P&gt;</description>
      <pubDate>Fri, 26 Feb 2016 10:27:13 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Is-CombineHiveInputFormat-deprecated-by-OrcInputFormat/m-p/159676#M21017</guid>
      <dc:creator>aervits</dc:creator>
      <dc:date>2016-02-26T10:27:13Z</dc:date>
    </item>
    <item>
      <title>Re: Is CombineHiveInputFormat deprecated by OrcInputFormat?</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Is-CombineHiveInputFormat-deprecated-by-OrcInputFormat/m-p/159677#M21018</link>
      <description>&lt;P&gt;Very weird, someone from the dev team might have a better idea but ORCFileInputFormat actually is implementing CombineHiveInputFormat. CombineHiveInputFormat is the layer between much of Hive and the most commonly used underlying Inputformats. So ORCInputFormat should be a CombineHiveInputFormat and this should not be happening.&lt;/P&gt;&lt;P&gt;Update: I think TABLESAMPLE ( PERCENTAGE ) is broken in general I just tried it with two tables one text format one ORC and neither works. Both with the Error message you got. Now its weird because a CombineHiveInputFormat is also a HiveInputFormat ( which he says I have ). &lt;/P&gt;&lt;PRE&gt;public class OrcInputFormat  implements .... CombineHiveInputFormat&lt;/PRE&gt;</description>
      <pubDate>Fri, 26 Feb 2016 17:57:58 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Is-CombineHiveInputFormat-deprecated-by-OrcInputFormat/m-p/159677#M21018</guid>
      <dc:creator>bleonhardi</dc:creator>
      <dc:date>2016-02-26T17:57:58Z</dc:date>
    </item>
    <item>
      <title>Re: Is CombineHiveInputFormat deprecated by OrcInputFormat?</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Is-CombineHiveInputFormat-deprecated-by-OrcInputFormat/m-p/159678#M21019</link>
      <description>&lt;P&gt;Can you try to reproduce? It is an CSV dataset loaded into an ORC using CTAS. Try to run TABLESAMPLE with percentage.&lt;/P&gt;</description>
      <pubDate>Fri, 26 Feb 2016 18:08:12 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Is-CombineHiveInputFormat-deprecated-by-OrcInputFormat/m-p/159678#M21019</guid>
      <dc:creator>aervits</dc:creator>
      <dc:date>2016-02-26T18:08:12Z</dc:date>
    </item>
    <item>
      <title>Re: Is CombineHiveInputFormat deprecated by OrcInputFormat?</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Is-CombineHiveInputFormat-deprecated-by-OrcInputFormat/m-p/159679#M21020</link>
      <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/393/aervits.html" nodeid="393"&gt;@Artem Ervits&lt;/A&gt; &lt;/P&gt;&lt;P&gt;I tried to reproduce it. I used a sample_07 database and CTASed it as TEXT and as ORC table. Get the same error message for both.&lt;/P&gt;&lt;P&gt;It sound weird but my guess would be that this syntax has not worked for a long time. He checks if the classname equals to CombinHiveInputFormat and since All classes extend CombineFileInputFormat I am not sure how that could be true anymore.&lt;/P&gt;&lt;P&gt;My guess would be that in the good old times CombineFileInputFormat was the actual class being used and now the classes just extend it so the check doesn't work anymore. But just a guess.&lt;/P&gt;</description>
      <pubDate>Fri, 26 Feb 2016 18:55:41 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Is-CombineHiveInputFormat-deprecated-by-OrcInputFormat/m-p/159679#M21020</guid>
      <dc:creator>bleonhardi</dc:creator>
      <dc:date>2016-02-26T18:55:41Z</dc:date>
    </item>
    <item>
      <title>Re: Is CombineHiveInputFormat deprecated by OrcInputFormat?</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Is-CombineHiveInputFormat-deprecated-by-OrcInputFormat/m-p/159680#M21021</link>
      <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/168/bleonhardi.html" nodeid="168"&gt;@Benjamin Leonhardi&lt;/A&gt; it doesn't matter whether table is text or ORC, percentage for tablesample is not working. &lt;A rel="user" href="https://community.cloudera.com/users/301/gopal.html" nodeid="301"&gt;@gopal&lt;/A&gt; is this a bug?&lt;/P&gt;&lt;PRE&gt;hive&amp;gt; SELECT * FROM medicare_part_b.medicare_part_b_2013_orc TABLESAMPLE(20 percent);
FAILED: SemanticException 1:67 Percentage sampling is not supported in org.apache.hadoop.hive.ql.io.HiveInputFormat. Error encountered near token '20'
hive&amp;gt; SELECT * FROM medicare_part_b.medicare_part_b_2013_text TABLESAMPLE(20 percent);
FAILED: SemanticException 1:68 Percentage sampling is not supported in org.apache.hadoop.hive.ql.io.HiveInputFormat. Error encountered near token '20'
hive&amp;gt; SELECT * FROM medicare_part_b.medicare_part_b_2013_raw TABLESAMPLE(20 percent);
FAILED: SemanticException 1:67 Percentage sampling is not supported in org.apache.hadoop.hive.ql.io.HiveInputFormat. Error encountered near token '20'
&lt;/PRE&gt;</description>
      <pubDate>Sat, 19 Mar 2016 01:13:34 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Is-CombineHiveInputFormat-deprecated-by-OrcInputFormat/m-p/159680#M21021</guid>
      <dc:creator>aervits</dc:creator>
      <dc:date>2016-03-19T01:13:34Z</dc:date>
    </item>
    <item>
      <title>Re: Is CombineHiveInputFormat deprecated by OrcInputFormat?</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Is-CombineHiveInputFormat-deprecated-by-OrcInputFormat/m-p/159681#M21022</link>
      <description>&lt;P&gt;I'm going to close this as it's a confirmed bug in Hive 1.2.1. I opened a Jira &lt;A href="https://issues.apache.org/jira/browse/HIVE-13312" target="_blank"&gt;https://issues.apache.org/jira/browse/HIVE-13312&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Sat, 19 Mar 2016 03:06:06 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Is-CombineHiveInputFormat-deprecated-by-OrcInputFormat/m-p/159681#M21022</guid>
      <dc:creator>aervits</dc:creator>
      <dc:date>2016-03-19T03:06:06Z</dc:date>
    </item>
    <item>
      <title>Re: Is CombineHiveInputFormat deprecated by OrcInputFormat?</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Is-CombineHiveInputFormat-deprecated-by-OrcInputFormat/m-p/159682#M21023</link>
      <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/393/aervits.html" nodeid="393"&gt;@Artem Ervits&lt;/A&gt; &lt;A rel="user" href="https://community.cloudera.com/users/301/gopal.html" nodeid="301"&gt;@gopal&lt;/A&gt; as said from looking in the code I am pretty sure it is. They check for the hive input format class but sometimes they refactored it to become an interface so the check doesn't work anymore.  &lt;/P&gt;</description>
      <pubDate>Sun, 20 Mar 2016 05:32:48 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Is-CombineHiveInputFormat-deprecated-by-OrcInputFormat/m-p/159682#M21023</guid>
      <dc:creator>bleonhardi</dc:creator>
      <dc:date>2016-03-20T05:32:48Z</dc:date>
    </item>
    <item>
      <title>Re: Is CombineHiveInputFormat deprecated by OrcInputFormat?</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Is-CombineHiveInputFormat-deprecated-by-OrcInputFormat/m-p/159683#M21024</link>
      <description>&lt;P&gt;I confirmed it and opened jira, see below&lt;/P&gt;</description>
      <pubDate>Sun, 20 Mar 2016 06:16:23 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Is-CombineHiveInputFormat-deprecated-by-OrcInputFormat/m-p/159683#M21024</guid>
      <dc:creator>aervits</dc:creator>
      <dc:date>2016-03-20T06:16:23Z</dc:date>
    </item>
    <item>
      <title>Re: Is CombineHiveInputFormat deprecated by OrcInputFormat?</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Is-CombineHiveInputFormat-deprecated-by-OrcInputFormat/m-p/159684#M21025</link>
      <description>&lt;P&gt;Ah cool didn't see that!&lt;/P&gt;</description>
      <pubDate>Sun, 20 Mar 2016 21:25:57 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Is-CombineHiveInputFormat-deprecated-by-OrcInputFormat/m-p/159684#M21025</guid>
      <dc:creator>bleonhardi</dc:creator>
      <dc:date>2016-03-20T21:25:57Z</dc:date>
    </item>
  </channel>
</rss>

