Member since
05-16-2016
785
Posts
114
Kudos Received
39
Solutions
My Accepted Solutions
| Title | Views | Posted |
|---|---|---|
| 2328 | 06-12-2019 09:27 AM | |
| 3579 | 05-27-2019 08:29 AM | |
| 5725 | 05-27-2018 08:49 AM | |
| 5243 | 05-05-2018 10:47 PM | |
| 3113 | 05-05-2018 07:32 AM |
12-19-2017
07:29 PM
Hi Just fire the below command in the terminal if you are using QuickStartVm sudo su Quickstart Sudo
... View more
12-14-2017
07:04 PM
1 Kudo
I dont kow if you have a custom trigger or a built in trigger for health test. Is the health test showing warning or critical or bad ? either way the test is to find the data locality in the host . " Make sure that Impala Daemon is co-located with a DataNode, and that the IP address of each Impala Daemon matches the IP address of its co-located DataNode" Please make sure if you have enabled the below properties in hdfs-site.xml <property>
<name>dfs.client.read.shortcircuit</name>
<value>true</value>
</property>
<property>
<name>dfs.domain.socket.path</name>
<value>/var/run/hdfs-sockets/dn</value>
</property>
<property>
<name>dfs.client.file-block-storage-locations.timeout.millis</name>
<value>10000</value>
</property> Reference https://www.cloudera.com/documentation/enterprise/5-9-x/topics/impala_config_performance.html#config_performance
... View more
12-13-2017
10:56 PM
@medloh Any time mate .
... View more
12-13-2017
10:35 PM
@NguyenBac I am glad that it worked
... View more
12-07-2017
09:33 PM
@AlexMulti Deletes can only be performed on tables that support ACID. More over your tables's file format should be ORC Below are the properties that needs to be set . hive.support.concurrency true (default is false)
hive.enforce.bucketing true (default is false) (Not required as of Hive 2.0)
hive.exec.dynamic.partition.mode nonstrict (default is strict) Also i would suggest you to take look into kudu. Reference https://cwiki.apache.org/confluence/display/Hive/Hive+Transactions#HiveTransactions-NewConfigurationParametersforTransactions
... View more
12-07-2017
07:32 AM
check if there are multiple supervisor runining on that host using the below command . ps aux |grep supervisor if so you will need to kill it and then start the cloudera -scm agent and kick off the hive metastore .
... View more
12-05-2017
08:30 PM
Based on the response , I would go for Parquet with splittable Snappy . more over Hive - default Compression is DeflateCodec
Impala - default Compression is Snappy i would put a link of my response to a similar post in the community that will give you little more info. http://community.cloudera.com/t5/Batch-SQL-Apache-Hive/Parquet-table-snappy-compressed-by-default/m-p/51914#M1822 please let me know if that helps
... View more
12-05-2017
09:05 AM
Can anyone please suggest me where should I start? Well I started it off with Apache Hadoop binary on Ubuntu Then moved on to manual installation of Cloudera hadoop finally landed in Cloudera manager . to start with I would suggest you from apache hadoop manual deployment and then side by side Cloudera Quickstart to explore other cdh eco systesm like hive , impala , many more . Do you recommend taking training courses with cloud era in order to eventually build an career in this area? That totally depends on to be frank to bring up a cluster i mean single node cluster :)) it took me a good20 days because i am from Java J2ee development guy had to roll up my sleeves to vmare , Linux Os then hang on to hadoop. Cloudera community is pretty active so we have your back on any troubleshooting :))) Welcome to the Cloudera Hadoop Community .
... View more
12-05-2017
08:58 AM
did you try runining the query in the hive shell or beeline ? was your hiveserver2 up and runining during the query execution time ? you may want to take peek in this current settings in your cluster hive.server2.session.check.interval
hive.server2.idle.operation.timeout
hive.server2.idle.session.timeout
... View more
12-05-2017
08:44 AM
are you looking for fast compression or decompression ? are you looking for disk space consumption ?
... View more