<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: issue with hive table in Archives of Support Questions (Read Only)</title>
    <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/issue-with-hive-table/m-p/83473#M85173</link>
    <description>If "set parquet.column.index.access=true", this means the names of the columns in Parquet data is different compared with the columns defined in Hive.&lt;BR /&gt;&lt;BR /&gt;The workaround is to set this at the table level:&lt;BR /&gt;&lt;BR /&gt;ALTER TABLE parquet_table SET TBLPROPERTIES ('parquet.column.index.access'='true');&lt;BR /&gt;&lt;BR /&gt;Hope above can help.</description>
    <pubDate>Mon, 10 Dec 2018 03:24:21 GMT</pubDate>
    <dc:creator>EricL</dc:creator>
    <dc:date>2018-12-10T03:24:21Z</dc:date>
    <item>
      <title>issue with hive table</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/issue-with-hive-table/m-p/82823#M85169</link>
      <description>&lt;P&gt;I just created table store as parquet in hive&amp;amp;impala in the HUE, but when I exec query sql, I found the issue:&lt;/P&gt;&lt;P&gt;the same sql which could exec successfully in impala, but find no record in hive. Sometimes it gave results with many columns values NULL which have values in impala query. I'm not sure whether it is&amp;nbsp;a bug in hive; I tried the both "set parquet.column.index.access=true" and "ALTER TABLE&amp;nbsp;abc SET TBLPROPERTIES ("parquet.column.index.access"="true");"&amp;nbsp; the sql still doesn't work in hive. The sql with easy logic just like :&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;select * from dbname.tablename ad where&lt;BR /&gt;ad.month = '2018_11'&lt;BR /&gt;and ad.day = '10'&lt;BR /&gt;and length(ad.x)&amp;gt;0&lt;BR /&gt;and (length(ad.y)=0 or ad.y is null)&lt;BR /&gt;and ad.z = 'z'&lt;BR /&gt;limit 100&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;And when I want to change the format to orc. I justed found the impala couldn't recognize this format. So how could I do to solve this?&lt;/P&gt;</description>
      <pubDate>Tue, 21 Apr 2026 13:43:31 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/issue-with-hive-table/m-p/82823#M85169</guid>
      <dc:creator>Striver</dc:creator>
      <dc:date>2026-04-21T13:43:31Z</dc:date>
    </item>
    <item>
      <title>Re: issue with hive table</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/issue-with-hive-table/m-p/82829#M85170</link>
      <description>Couple of questions to understand the issue a bit more:&lt;BR /&gt;&lt;BR /&gt;1. did you insert the data using Impala or hive?&lt;BR /&gt;2. did SELECT * from Hive works OK?&lt;BR /&gt;3. when query is running, do you see any warning or error messages from HS2 log?&lt;BR /&gt;4. Have you tried to remove some conditions to see if some of them might be the cause of the issue?&lt;BR /&gt;5. What version of CDH are you using?&lt;BR /&gt;&lt;BR /&gt;Thanks</description>
      <pubDate>Mon, 26 Nov 2018 11:18:26 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/issue-with-hive-table/m-p/82829#M85170</guid>
      <dc:creator>EricL</dc:creator>
      <dc:date>2018-11-26T11:18:26Z</dc:date>
    </item>
    <item>
      <title>Re: issue with hive table</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/issue-with-hive-table/m-p/82900#M85171</link>
      <description>&lt;P&gt;1.not insert, handle kafka data to hdfs,and use hive table&amp;amp;impala table to do the query&lt;/P&gt;&lt;P&gt;2.the select * from Hive works fine,but sometimes some columns of the table may be NULL in the results&lt;/P&gt;&lt;P&gt;3.the sql ran successfully with no errors&lt;/P&gt;&lt;P&gt;4.yes, I tried. I found the "set parquet.column.index.access=true" worked before I do the query, but it just took effect in the session lifetime(about 30mins); that could be a tmp solution. Is there any idea could solve this totally? Or I should set the parms every time before I want to do a query for this table.&lt;/P&gt;&lt;P&gt;5.the CDH version is 5.14&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Any way, thanks for your kind reply.&lt;/P&gt;</description>
      <pubDate>Tue, 27 Nov 2018 07:27:09 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/issue-with-hive-table/m-p/82900#M85171</guid>
      <dc:creator>Striver</dc:creator>
      <dc:date>2018-11-27T07:27:09Z</dc:date>
    </item>
    <item>
      <title>Re: issue with hive table</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/issue-with-hive-table/m-p/83377#M85172</link>
      <description>&lt;P&gt;Is it a parquet format problem in hive table? Actually when I use scala spark to connect this table, still have no result, seems could not recognize the value of the columns.&lt;/P&gt;</description>
      <pubDate>Thu, 06 Dec 2018 09:51:27 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/issue-with-hive-table/m-p/83377#M85172</guid>
      <dc:creator>Striver</dc:creator>
      <dc:date>2018-12-06T09:51:27Z</dc:date>
    </item>
    <item>
      <title>Re: issue with hive table</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/issue-with-hive-table/m-p/83473#M85173</link>
      <description>If "set parquet.column.index.access=true", this means the names of the columns in Parquet data is different compared with the columns defined in Hive.&lt;BR /&gt;&lt;BR /&gt;The workaround is to set this at the table level:&lt;BR /&gt;&lt;BR /&gt;ALTER TABLE parquet_table SET TBLPROPERTIES ('parquet.column.index.access'='true');&lt;BR /&gt;&lt;BR /&gt;Hope above can help.</description>
      <pubDate>Mon, 10 Dec 2018 03:24:21 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/issue-with-hive-table/m-p/83473#M85173</guid>
      <dc:creator>EricL</dc:creator>
      <dc:date>2018-12-10T03:24:21Z</dc:date>
    </item>
    <item>
      <title>Re: issue with hive table</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/issue-with-hive-table/m-p/83475#M85174</link>
      <description>&lt;P&gt;Thanks EricL first for your kind reply.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Yes, I know it's a tmp plan to set the parameter, I also use it at the table level, the whole create table sql as below:&lt;/P&gt;&lt;P&gt;CREATE TABLE `adjust_data_new`(&lt;BR /&gt;2 `app_version` string,&lt;BR /&gt;3 `tracker` string,&lt;BR /&gt;4 `network_name` string,&lt;BR /&gt;5 `campaign_name` string,&lt;BR /&gt;6 `is_organic` string,&lt;BR /&gt;7 `rejection_reason` string,&lt;BR /&gt;8 `click_time` string,&lt;BR /&gt;9 `impression_time` string,&lt;BR /&gt;10 `engagement_time` string,&lt;BR /&gt;11 `installed_at` string,&lt;BR /&gt;12 `install_begin_time` string,&lt;BR /&gt;13 `created_at` bigint,&lt;BR /&gt;14 `reattributed_at` string,&lt;BR /&gt;15 `time_to_uninstall` string,&lt;BR /&gt;16 `time_to_reinstall` string,&lt;BR /&gt;17 `uninstalled_at` string,&lt;BR /&gt;18 `reinstalled_at` string,&lt;BR /&gt;19 `adid` string,&lt;BR /&gt;20 `idfa` string,&lt;BR /&gt;21 `android_id` string,&lt;BR /&gt;22 `idfv` string,&lt;BR /&gt;23 `gps_adid` string,&lt;BR /&gt;24 `region` string,&lt;BR /&gt;25 `country` string,&lt;BR /&gt;26 `city` string,&lt;BR /&gt;27 `device_type` string,&lt;BR /&gt;28 `os_name` string,&lt;BR /&gt;29 `timezone` string,&lt;BR /&gt;30 `event` string,&lt;BR /&gt;31 `event_name` string,&lt;BR /&gt;32 `deeplink` string,&lt;BR /&gt;33 `fb_campaign_name` string,&lt;BR /&gt;34 `fb_campaign_id` string,&lt;BR /&gt;35 `fb_adgroup_id` string,&lt;BR /&gt;36 `fb_account_id` string,&lt;BR /&gt;37 `activity_kind` string,&lt;BR /&gt;38 `created_time` bigint)&lt;BR /&gt;39 PARTITIONED BY (&lt;BR /&gt;40 `month` string,&lt;BR /&gt;41 `day` string)&lt;BR /&gt;42 ROW FORMAT SERDE&lt;BR /&gt;43 'org.apache.hadoop.hive.ql.io.parquet.serde.ParquetHiveSerDe'&lt;BR /&gt;44 STORED AS INPUTFORMAT&lt;BR /&gt;45 'org.apache.hadoop.hive.ql.io.parquet.MapredParquetInputFormat'&lt;BR /&gt;46 OUTPUTFORMAT&lt;BR /&gt;47 'org.apache.hadoop.hive.ql.io.parquet.MapredParquetOutputFormat'&lt;BR /&gt;48 LOCATION&lt;BR /&gt;49 'hdfs://nameservice1/user/hive/warehouse/kafka_table.db/adjust_data_new'&lt;BR /&gt;50 TBLPROPERTIES (&lt;BR /&gt;51 'DO_NOT_UPDATE_STATS'='true',&lt;BR /&gt;52 'STATS_GENERATED_VIA_STATS_TASK'='true',&lt;BR /&gt;53 'last_modified_by'='hive',&lt;BR /&gt;54 'last_modified_time'='1542947483',&lt;BR /&gt;55 'numRows'='570367760',&lt;BR /&gt;56 'parquet.column.index.access'='true',&lt;BR /&gt;57 'spark.sql.create.version'='2.2 or prior',&lt;BR /&gt;58 'spark.sql.sources.schema.numPartCols'='2',&lt;BR /&gt;59 'spark.sql.sources.schema.numParts'='1',&lt;BR /&gt;60 'spark.sql.sources.schema.part.0'='{\"type\":\"struct\",\"fields\":[{\"name\":\"app_version\",\"type\":\"string\",\"nullable\":true,\"metadata\":{}},{\"name\":\"tracker\",\"type\":\"string\",\"nullable\":true,\"metadata\":{}},{\"name\":\"network_name\",\"type\":\"string\",\"nullable\":true,\"metadata\":{}},{\"name\":\"campaign_name\",\"type\":\"string\",\"nullable\":true,\"metadata\":{}},{\"name\":\"is_organic\",\"type\":\"string\",\"nullable\":true,\"metadata\":{}},{\"name\":\"rejection_reason\",\"type\":\"string\",\"nullable\":true,\"metadata\":{}},{\"name\":\"click_time\",\"type\":\"string\",\"nullable\":true,\"metadata\":{}},{\"name\":\"impression_time\",\"type\":\"string\",\"nullable\":true,\"metadata\":{}},{\"name\":\"engagement_time\",\"type\":\"string\",\"nullable\":true,\"metadata\":{}},{\"name\":\"installed_at\",\"type\":\"string\",\"nullable\":true,\"metadata\":{}},{\"name\":\"install_begin_time\",\"type\":\"string\",\"nullable\":true,\"metadata\":{}},{\"name\":\"created_at\",\"type\":\"long\",\"nullable\":true,\"metadata\":{}},{\"name\":\"reattributed_at\",\"type\":\"string\",\"nullable\":true,\"metadata\":{}},{\"name\":\"time_to_uninstall\",\"type\":\"string\",\"nullable\":true,\"metadata\":{}},{\"name\":\"time_to_reinstall\",\"type\":\"string\",\"nullable\":true,\"metadata\":{}},{\"name\":\"uninstalled_at\",\"type\":\"string\",\"nullable\":true,\"metadata\":{}},{\"name\":\"reinstalled_at\",\"type\":\"string\",\"nullable\":true,\"metadata\":{}},{\"name\":\"adid\",\"type\":\"string\",\"nullable\":true,\"metadata\":{}},{\"name\":\"idfa\",\"type\":\"string\",\"nullable\":true,\"metadata\":{}},{\"name\":\"android_id\",\"type\":\"string\",\"nullable\":true,\"metadata\":{}},{\"name\":\"idfv\",\"type\":\"string\",\"nullable\":true,\"metadata\":{}},{\"name\":\"gps_adid\",\"type\":\"string\",\"nullable\":true,\"metadata\":{}},{\"name\":\"region\",\"type\":\"string\",\"nullable\":true,\"metadata\":{}},{\"name\":\"country\",\"type\":\"string\",\"nullable\":true,\"metadata\":{}},{\"name\":\"city\",\"type\":\"string\",\"nullable\":true,\"metadata\":{}},{\"name\":\"device_type\",\"type\":\"string\",\"nullable\":true,\"metadata\":{}},{\"name\":\"os_name\",\"type\":\"string\",\"nullable\":true,\"metadata\":{}},{\"name\":\"timezone\",\"type\":\"string\",\"nullable\":true,\"metadata\":{}},{\"name\":\"event\",\"type\":\"string\",\"nullable\":true,\"metadata\":{}},{\"name\":\"event_name\",\"type\":\"string\",\"nullable\":true,\"metadata\":{}},{\"name\":\"deeplink\",\"type\":\"string\",\"nullable\":true,\"metadata\":{}},{\"name\":\"fb_campaign_name\",\"type\":\"string\",\"nullable\":true,\"metadata\":{}},{\"name\":\"fb_campaign_id\",\"type\":\"string\",\"nullable\":true,\"metadata\":{}},{\"name\":\"fb_adgroup_id\",\"type\":\"string\",\"nullable\":true,\"metadata\":{}},{\"name\":\"fb_account_id\",\"type\":\"string\",\"nullable\":true,\"metadata\":{}},{\"name\":\"activity_kind\",\"type\":\"string\",\"nullable\":true,\"metadata\":{}},{\"name\":\"created_time\",\"type\":\"long\",\"nullable\":true,\"metadata\":{}},{\"name\":\"month\",\"type\":\"string\",\"nullable\":true,\"metadata\":{}},{\"name\":\"day\",\"type\":\"string\",\"nullable\":true,\"metadata\":{}}]}',&lt;BR /&gt;61 'spark.sql.sources.schema.partCol.0'='month',&lt;BR /&gt;62 'spark.sql.sources.schema.partCol.1'='day',&lt;BR /&gt;63 'totalSize'='24309750927',&lt;BR /&gt;64 'transient_lastDdlTime'='1542947483')&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;but when I do the query:&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;&lt;EM&gt;select count(*) from adjust_data_new &lt;/EM&gt;&lt;/P&gt;&lt;P&gt;&lt;EM&gt;where month='2018_12' &lt;/EM&gt;&lt;/P&gt;&lt;P&gt;&lt;EM&gt;and day='10' &lt;/EM&gt;&lt;/P&gt;&lt;P&gt;&lt;EM&gt;and activity_kind='&lt;STRONG&gt;session&lt;/STRONG&gt;'&lt;/EM&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;it seems that the condition couldn't be recognized in hive table&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;in impala the result is:&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;count(*)&lt;BR /&gt;&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;290430&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;but in hive the result is &lt;/SPAN&gt;&lt;STRONG&gt;0&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;I confused why the query by&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;EM&gt;select activity_kind from adjust_data_new &lt;/EM&gt;&lt;/P&gt;&lt;P&gt;&lt;EM&gt;where month='2018_12' &lt;/EM&gt;&lt;/P&gt;&lt;P&gt;&lt;EM&gt;and day='10' limit 100&lt;/EM&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;could work fine in hive, and&amp;nbsp;their are many '&lt;STRONG&gt;session&lt;/STRONG&gt;' in the result list.&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Mon, 10 Dec 2018 04:16:15 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/issue-with-hive-table/m-p/83475#M85174</guid>
      <dc:creator>Striver</dc:creator>
      <dc:date>2018-12-10T04:16:15Z</dc:date>
    </item>
    <item>
      <title>Re: issue with hive table</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/issue-with-hive-table/m-p/83589#M85175</link>
      <description>&lt;P&gt;Hi Ericl,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Finally we found the correct direction and got the solution as below:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;change the hive-site.xml in system&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;add&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;PRE&gt;&amp;lt;property&amp;gt;
    &amp;lt;name&amp;gt;parquet.column.index.access&amp;lt;/name&amp;gt;
    &amp;lt;value&amp;gt;true&amp;lt;/value&amp;gt;
    &amp;lt;description&amp;gt;set parquet index access&amp;lt;/description&amp;gt;
&amp;lt;/property&amp;gt;&lt;/PRE&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;then restart hive&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Then everything is fine. This issue gave me deep impression,&amp;nbsp; hope this reply could help other guys who have the same issue like me or even guide a way to find the solution. Thanks in the end.&lt;/P&gt;</description>
      <pubDate>Tue, 11 Dec 2018 09:45:58 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/issue-with-hive-table/m-p/83589#M85175</guid>
      <dc:creator>Striver</dc:creator>
      <dc:date>2018-12-11T09:45:58Z</dc:date>
    </item>
  </channel>
</rss>

