About csguna

csguna · ‎09-11-2017

@syamsri Ok Could you please let ,me know the file format that you are using for Hive ( testTableNew ) , Hive supports Delete Update only on ORC format starting from 0.14 . Try creating a table with ORC format , if you want more flexibility then try Apache KUDU but it has it owns merits and demerits . Hope this helps . CREATE TABLE Sample ( id int, name string ) CLUSTERED BY (id) INTO 2 BUCKETS STORED AS ORC TBLPROPERTIES ("transactional"="true", "compactor.mapreduce.map.memory.mb"="2048", "compactorthreshold.hive.compactor.delta.num.threshold"="4", "compactorthreshold.hive.compactor.delta.pct.threshold"="0.5" );

csguna · ‎09-09-2017

@manuspark3 @SupriyaPS If scale is not specified, it defaults to 0 . If no precision is supplied , it defaults to 10. Yes decimal works without precison.

csguna · ‎09-08-2017

When ever we use ORC we go with String since it has vectorization which gives a , extra kick try the below query sometimes copy / paste will cause issue because of the extra space that comes along with sometimes . so type if you could Method 1 set hive.enforce.bucketing = true; CREATE TABLE TEST (id DECIMAL,name STRING) CLUSTERED BY (id) INTO 2 BUCKETS STORED AS ORC TBLPROPERTIES("transactional"="true"); Note - I did not test query in my local vm please let me know if this helps .

csguna · ‎09-06-2017

if you have hive shell or beeline - you can excute the same code nothing different or you can try hue web ui to export the hive results as .csv althought hue is not that good in downloading big tables .

csguna · ‎09-04-2017

I am glad , did you fix your ip /hostname in /etc/hosts file ?

csguna · ‎09-03-2017

Apache Nifi is more of data routing and transformation tool outside the Hadoop cluster ,it has its own JVM but needs hadoop configuration if you are trying to push data to hdfs. It has many built processor that comes handy for pushing data quickly , nifi community is little big Streamsets is similar , you can deploy inside hadoop cluster and manage using Cloduera manager Apache kafak is again a streaming platform more of real time data just NIFI but if you want to do any aggregation on the fly Apache flume - Flume is also disturbing platform has concept of source - channel - sink . we mostly use to push logs to hdfs ,hbase has many built-in source ,sink and you can create custom one just like nifi processor if you looking for alternative for Apache Nifi you can use Apache flume, Apache kafaka , streamsets. Each one has its own use case though. Cloudera Navigator - Is used for Data management and security under one hood for hadoop platform

csguna · ‎09-02-2017

Change your host file / hostname 192.168.200.21 server1 # The following lines are desirable for IPv6 capable hosts ::1 ip6-localhost ip6-loopback fe00::0 ip6-localnet ff00::0 ip6-mcastprefix ff02::1 ip6-allnodes ff02::2 ip6-allrouters also change the hostname i am not sure the path in Ubuntu it should be under /etc/hostname -> server1 restart the network Also do an echo $hostname or $HOSTNAME to see if it is reflecting Finally restart all the dameons . let me know if that helps

csguna · ‎09-01-2017

whats your CDH / Hive version ? could you let me know whether you had set the below properties if not set and could you let me know if that helps . set hive.exec.reducers.bytes.per.reducer=<number> set hive.exec.reducers.max=<number> set mapreduce.job.reduces=<number>

csguna · ‎08-31-2017

Could you please share the logs using the "Insert code " tool in the toolbar . beside the bold, italic, underline spoiler tag in the reply text area.

csguna · ‎08-30-2017

@josholsan Sure thing 🙂

Online	Offline
Last Visited	‎10-28-2024 06:24 AM

Member Since	‎05-16-2016 09:33 PM
Last Visited	‎10-28-2024 06:24 AM
Posts	785
Kudos received	112

Cloudera Community

Re: Kerberos / Sentry Integration

Re: How to upgrade Hive from 2.1 to 3.0 via CDH 6....

Re: How does nameservice id works for HA, how does...

Re: What license does the express edition fall und...

Re: Sqoop2 over Sqoop1 in CDH6

Re: Update and Delete are not working in Hive ?

Re: ParseException Line missing ) at 'clustered' ...

Re: ParseException Line missing ) at 'clustered' ...

Re: how to download hive data into csv format

Re: cdh5.12.0 pseudo distributed mode nodemanager ...

umeRe: Apache Nifi - Cloudera Alternative?

Re: cdh5.12.0 pseudo distributed mode nodemanager ...

Re: Hive error : 'java.lang.RuntimeException(Error...

Re: cdh5.12.0 pseudo distributed mode nodemanager ...

Re: [Impala] - GC overhead limit exceeded error in...