About nramanaiah

nramanaiah · ‎09-12-2018

I assume raw data is in text & u want to convert & load the data into avro tables. If so, u can create another identical text table & specifiy the delimiter in data.. i.e., create table staging(id struct<tid:string,action:string,createdts:timestamp>, cid string, anumber string) row format delimited fields terminated by ',' collection items terminated by '|' stored as textfile; sample text data can be as below 1|success|150987428888,3,12345 insert into testtbl select * from staging; If kafka or flume is generating avro files directly, then those files can be written into table path directly. Its better to create external table if source files are written directly on table path.

nramanaiah · ‎09-12-2018

Its not possible to use functions in insert into table values statement.

nramanaiah · ‎09-12-2018

Try below insert statement 0: jdbc:hive2://abcd:10000> with t as (select NAMED_STRUCT('tid','1','action','success', 'createdts',current_timestamp) as id ,'1' as cid,'12345' as anumber) 0: jdbc:hive2://abcd:10000> insert into testtbl select * from t; No rows affected (20.464 seconds) 0: jdbc:hive2://abcd:10000> select * from testtbl; +-----------------------------------------------------------------------+--------------+------------------+--+ | testtbl.id | testtbl.cid | testtbl.anumber | +-----------------------------------------------------------------------+--------------+------------------+--+ | {"tid":"1","action":"success","createdts":"2018-09-12 15:06:27.075"} | 1 | 12345 | +-----------------------------------------------------------------------+--------------+------------------+--+

nramanaiah · ‎09-11-2018

Can you check what is the value for tez.runtime.unordered.output.buffer.size-mb ? I think its configured to an higher value.

nramanaiah · ‎08-17-2018

I am thinking of solution for the jira.. This needs to be implemented in code. There is no config to do this for now.

nramanaiah · ‎08-17-2018

Concatenation depends on which files are chosen first. The ordering of the files not deterministic with CombineHiveInputFormat, since grouping happens at hadoop layer Concatenation will split or combine files based on orc file size > or < maxSplitSize. for eg., say if you have 5 files.. 64MB, 64MB, 64MB, 64MB, 512MB & mapreduce.input.fileinputformat.split.minsize=256mb this can result in 2 files 256MB, 512MB.. or it may result in 3 files 256MB, 256MB, 256MB. I raised a jira for the same Easy solution for this would be to add a path filter to skip files > maxSplitSize.

nramanaiah · ‎08-16-2018

In beeline or cli, after creating table, u can either do show create or describe to know the table path in hdfs. After exiting from beeline or cli, u can use below command to see the table folder & files inside it hadoop fs -ls -R <tablePath>

nramanaiah · ‎08-01-2018

This validation is intentionally added in spark with SPARK-15279. As it doesn't make sense to provide DELIMITERS for ORC | PARQUET files.

nramanaiah · ‎07-30-2018

After create external table with location, can you run "msck repair table data" ? It should automatically update partition information from folder path to hive metadata.

nramanaiah · ‎05-28-2018

@cskbhatt, i assume external table location is "hdfs://<emr node>:8020/poc/test_table/" This issue is happening because hdfs://<emr node>:8020/poc/test_table/.metadata/descriptor.properties is not a Parquet file, but exist inside table folder. When Hive ParquetRecordReader tries to read this file, its throwing above exception. Remove all non parquet files from table location & retry your query.

Online	Offline
Last Visited	‎11-22-2024 12:30 PM

Member Since	‎08-18-2017 06:55 AM
Last Visited	‎11-22-2024 12:30 PM
Posts	145
Kudos received	19

Cloudera Community

Re: Hive query to check mathematical values from m...

Re: Partition Retention Period not working on Hive...

Re: Parsing multiple pipe delimited columns into r...

Re: Hive CLI session warnings

Re: question about clustered only table

Re: How to insert data into this table

Re: How to insert data into this table

Re: How to insert data into this table

Re: hive insert query is failing and giving the be...

Re: Hive:Partitions:Small Files:Concatenate

Re: Hive:Partitions:Small Files:Concatenate

Re: I have created a table in hive. What is the co...

Re: Why Row Format Delimited does not work with Sp...

Re: Adding a partition to an external Hive table w...

Re: Reading data from Hive External Table on Parq...