About csguna

csguna · ‎12-19-2017

Hi Just fire the below command in the terminal if you are using QuickStartVm sudo su Quickstart Sudo

csguna · ‎12-14-2017

I dont kow if you have a custom trigger or a built in trigger for health test. Is the health test showing warning or critical or bad ? either way the test is to find the data locality in the host . " Make sure that Impala Daemon is co-located with a DataNode, and that the IP address of each Impala Daemon matches the IP address of its co-located DataNode" Please make sure if you have enabled the below properties in hdfs-site.xml <property> <name>dfs.client.read.shortcircuit</name> <value>true</value> </property> <property> <name>dfs.domain.socket.path</name> <value>/var/run/hdfs-sockets/dn</value> </property> <property> <name>dfs.client.file-block-storage-locations.timeout.millis</name> <value>10000</value> </property> Reference https://www.cloudera.com/documentation/enterprise/5-9-x/topics/impala_config_performance.html#config_performance

csguna · ‎12-13-2017

@medloh Any time mate .

csguna · ‎12-13-2017

@NguyenBac I am glad that it worked

csguna · ‎12-07-2017

@AlexMulti Deletes can only be performed on tables that support ACID. More over your tables's file format should be ORC Below are the properties that needs to be set . hive.support.concurrency true (default is false) hive.enforce.bucketing true (default is false) (Not required as of Hive 2.0) hive.exec.dynamic.partition.mode nonstrict (default is strict) Also i would suggest you to take look into kudu. Reference https://cwiki.apache.org/confluence/display/Hive/Hive+Transactions#HiveTransactions-NewConfigurationParametersforTransactions

csguna · ‎12-07-2017

check if there are multiple supervisor runining on that host using the below command . ps aux |grep supervisor if so you will need to kill it and then start the cloudera -scm agent and kick off the hive metastore .

csguna · ‎12-05-2017

Based on the response , I would go for Parquet with splittable Snappy . more over Hive - default Compression is DeflateCodec Impala - default Compression is Snappy i would put a link of my response to a similar post in the community that will give you little more info. http://community.cloudera.com/t5/Batch-SQL-Apache-Hive/Parquet-table-snappy-compressed-by-default/m-p/51914#M1822 please let me know if that helps

csguna · ‎12-05-2017

Can anyone please suggest me where should I start? Well I started it off with Apache Hadoop binary on Ubuntu Then moved on to manual installation of Cloudera hadoop finally landed in Cloudera manager . to start with I would suggest you from apache hadoop manual deployment and then side by side Cloudera Quickstart to explore other cdh eco systesm like hive , impala , many more . Do you recommend taking training courses with cloud era in order to eventually build an career in this area? That totally depends on to be frank to bring up a cluster i mean single node cluster :)) it took me a good20 days because i am from Java J2ee development guy had to roll up my sleeves to vmare , Linux Os then hang on to hadoop. Cloudera community is pretty active so we have your back on any troubleshooting :))) Welcome to the Cloudera Hadoop Community .

csguna · ‎12-05-2017

did you try runining the query in the hive shell or beeline ? was your hiveserver2 up and runining during the query execution time ? you may want to take peek in this current settings in your cluster hive.server2.session.check.interval hive.server2.idle.operation.timeout hive.server2.idle.session.timeout

csguna · ‎12-05-2017

are you looking for fast compression or decompression ? are you looking for disk space consumption ?

Online	Offline
Last Visited	‎10-28-2024 06:24 AM

Member Since	‎05-16-2016 09:33 PM
Last Visited	‎10-28-2024 06:24 AM
Posts	785
Kudos received	112

Cloudera Community

Re: Kerberos / Sentry Integration

Re: How to upgrade Hive from 2.1 to 3.0 via CDH 6....

Re: How does nameservice id works for HA, how does...

Re: What license does the express edition fall und...

Re: Sqoop2 over Sqoop1 in CDH6

Re: Unable to login as root - setuid not 0, permi...

Re: Impala Assignment locality concerning

Re: Recommended file size for Impala Parquet files...

Re: Status: Waiting for catalog update from the St...

Re: Update and Delete are not working in Hive ?

Re: Hive Metastore fails to start for a newly inst...

Re: Recommended file size for Impala Parquet files...

Re: Career Change! Advice please :-)

Re: Getting connection reset exception when the hi...

Re: Recommended file size for Impala Parquet files...