About vtpcnk

vtpcnk · ‎11-30-2018

I tried the below but it didn't work : hive> create view testview as select * from test1 where id = "{$hiveconf:id}"; OK Time taken: 0.13 seconds hive> set id=1; hive> select * from testview; Above query did not return any rows.

vtpcnk · ‎11-30-2018

I have two tables. test1 and test2 hive> select * from test1; OK id year 1 2017 2 2017 hive> select * from test2; OK no year 2 2017 3 2017 query : select id, year from test1 where id > 1 union all select no, year from test2 where no > 1 question 1: if i put the above query in a view and can I pass a parameter to it to use in the where clause (for id and no) ? question 2 : can i frame the above query without the union all appreciate the feedback.

vtpcnk · ‎11-16-2018

ok, this can be done simply as : partitioned by (yr string, mth string). tks.

vtpcnk · ‎11-16-2018

I tried this but wouldn't work : create table test_part_bkt_tbl (id string, cd string, dttm string) partitioned by (yr string) clustered by (month(dttm)) into 12 buckets;

vtpcnk · ‎11-16-2018

if i partition a table by year - can i further bucket it by month? so the idea is the year will be the top level and months will be at a level beneath it. so the directory structure would be : 2018 -> 1, 2, 3 ... 12 2019 -> 1, 2, 3 ... 12 Is this what bucketing is about? Or should i be doing this someway with partitions itself? Appreciate the insights.

vtpcnk · ‎11-15-2018

actually I am working with cloudera now and i dont see hive.exec.parallel as a configurable option in cloudera manager.

vtpcnk · ‎11-15-2018

The below hive performance parameter - is it usually set within a map reduce program to be set at the time of execution : SET hive.exec.parallel=true Or can it be set at the global level in Ambari? Appreciate the feedback.

vtpcnk · ‎11-14-2018

i have a table with two string columns and one datetime column (which is also defined as string datatype). I want to partition the table on monthly basis ie month(the datetime column). So I did the below : create table test_part_tbl (id string, cd string, dttm string) partitioned by (mth string); insert into test_part_tbl partition(mth) select id, cd, create_dt, month(create_dt) from real_table; hive> select * from test_part_tbl ; OK test_part_tbl.id test_part_tbl.cd test_part_tbl.dttm test_part_tbl.mth id1 cd1 2018-10-24 10 id2 cd1 2018-10-24 10 Time taken: 0.13 seconds, Fetched: 2 row(s) So the month ie "10" is actually appearing as part of the table data. Is that correct? Is it possible to partition the table as above and not have the partition column/value as part of the table data? ie when querying can't I use "month(dttm)" and search based on month? Appreciate the insights.

vtpcnk · ‎11-05-2018

I have below query : select c1 as "col1", c2 as "col2", c3 as "col3", c4 as "col4", "xxx" as "col5" from tbl1 "xxx" represents a static value to be shown in the query for all resultant rows with a column name of "col5". But hive is giving error. How do i correctly code this? BTW we need this to use in a "union" clause with tables with differing columns. Appreciate the feedback.

vtpcnk · ‎09-13-2018

@Mahesh Balakrishnan @Naresh P R @Shu can i have feedback on how the data should be formatted to be loaded (load inpath command) into a table created as : CREATE EXTERNAL TABLE staging3 ROW FORMAT SERDE 'org.apache.hadoop.hive.serd2.avro.AvroSerDe' STORED AS INPUTFORMAT 'org.apache.hadoop.hive.ql.io.avro.AvroContainerInputFormat' OUTPUTFORMAT 'org.apache.hadoop.hive.ql.io.avro.AvroContainerOutputFormat' LOCATION '/tmp/staging' TBLPROPERTIES ('avro.schema.url'='hdfs:///tmp/avroschemas/testtbl.json'); The schema is the same as described earlier. But there is no delimiter specified. Appreciate if you could provide a sample piece of data for this.

Online	Offline
Last Visited	‎05-05-2021 09:04 AM

Member Since	‎11-24-2015 05:10 AM
Last Visited	‎05-05-2021 09:04 AM
Posts	223
Kudos received	10

Cloudera Community

Re: hive view parameter and union all

hive view parameter and union all

Re: Hive partition - partition column as part of t...

Re: Hive partition - partition column as part of t...

Re: Hive partition - partition column as part of t...

Re: Hive partition - partition column as part of t...

Re: Hive partition - partition column as part of t...

Hive partition - partition column as part of the d...

Hive HQL - static columns and values

Re: How to insert data into this table