About balavignesh_nag

balavignesh_nag · ‎09-15-2017

@Naveen Dabas It should work actually. Try removing the '`' and execute it. Orelse the other way is to drop the table and re-create it. As it an external table it wont affect the data. Hope it helps!!

balavignesh_nag · ‎09-15-2017

@Naveen Dabas It should work actually. Try removing the '`' and execute it. Orelse the other way is to drop the table and re-create it. As it an external table it wont affect the data. Hope it helps!!

balavignesh_nag · ‎09-15-2017

@kenny creed you are using regexp_replace in spark which gives you string datatype. In spark you have to use cast to convert it. Given below an example which might help in solving your problem: Hope it helps! val res = df.select($"id", $"date", unix_timestamp($"date", "yyyy/MM/dd HH:mm:ss").cast(TimestampType).as("timestamp"), current_timestamp(), current_date())

balavignesh_nag · ‎09-15-2017

Hope it helps! If so then please accept it as best answer!

balavignesh_nag · ‎09-15-2017

@I1095 Check this blog. You have a detailed comparison between ORC and Parquet. Other than that there are very in terms of use case. But in the future most I believe most of the improvements are being developed based on ORC i believe. 1. Many of the performance improvements provided in the Stinger initiative are dependent on features of the ORC format including block level index for each column. This leads to potentially more efficient I/O allowing Hive to skip reading entire blocks of data if it determines predicate values are not present there. Also the Cost Based Optimizer has the ability to consider column level metadata present in ORC files in order to generate the most efficient graph. 2. ACID transactions are only possible when using ORC as the file format

balavignesh_nag · ‎09-15-2017

@n c No we cant get the something similar to DDL of a table in terms of database. But we can use describe database to see the other properties. Hope it helps!

balavignesh_nag · ‎09-14-2017

Hi @n c You can use Insert Overwrite Local Directory command in hive to export to the desired format and use distcp to copy the files or even the complete database in hive( which means entire files which are created under each tables in a database) into the second cluster. Once the files are moved to new cluster take the DDL for previous cluster and create the hive tables. Once its done you can either insert/copy the files into hive tables in new cluster. Hope it Helps!!

balavignesh_nag · ‎09-14-2017

Hi @Harjinder Brar concat('{', u.swid ,'}') will concatenate bracs with the value from u.swid. For Example if the value of u.swid is TEST then it will be converted to {TEST} which will be used to join with o.swid column. Hope it Helps!!

balavignesh_nag · ‎09-13-2017

Is there any certifications available in Hortonworks for Bigdata/Hadoop Architect? If available the could someone help me with links to check its information.

balavignesh_nag · ‎09-12-2017

Hi @Vijay Parmar Apart from the concatenate option in hive which was mentioned by @Steven O'Neill try using these options below : Depending upon the execution engine first set property differ. You can also modify the size of file in the below option. By these options you can merge the small files based on the input data however it will alter the existing data in the target table but it will be able to able to solve the problem in the future if there are small files being created. set hive.merge.tezfiles=true; -- Notifying that merge step is required set hive.merge.smallfiles.avgsize=128000000; --128MB set hive.merge.size.per.task=128000000; -- 128MB

Online	Offline
Last Visited	‎10-03-2019 09:01 AM

Member Since	‎05-02-2017 01:47 PM
Last Visited	‎10-03-2019 09:01 AM
Posts	360
Kudos received	64

Cloudera Community

Re: what is the best way to get ftp file to hdfs c...

Re: when yarn communicates with the namenodes when...

Re: [TEZ] are partition, sort and shuffle built-in...

Re: CASE statement Error in Beeline HIVE

Re: hive query to display Week of the timestamp an...

Re: Alter external hive table fails

Re: Alter external hive table fails

Re: spark sql transformation of string to timestam...

Re: Since ACID Transactions cannot be done throug...

Re: Since ACID Transactions cannot be done throug...

Re: Moving a hive database from one cluster to ano...

Re: Moving a hive database from one cluster to ano...

Re: please explain how does this join work

Hortonworks exam for Architect

Re: Facing small file issue on Hive