About ssubhas

ssubhas · ‎06-23-2016

@rahul jain You can use Hive view from Ambari and run queries on the Hive table. As first step, hive table needs to be created on top the HDFS file. Thanks and Regards, Sindhu

ssubhas · ‎06-22-2016

@alain TSAFACK You can load the data from csv file to a temp hive table with same structure as orc table, then insert the data into orc table as: insert into table table_orc as select * from table_textfile; Thanks and Regards, Sindhu

ssubhas · ‎06-21-2016

@Michel Sumbul CBO is mainly for optimization decisions which reduces the cost of query execution and is independent of storage formats like ORC. Below is some of decisions based on CBO: How to order Join What algorithm to use for a given Join Should the intermediate result be persisted or should it be recomputed on operator failure. The degree of parallelism at any operator (specifically number of reducers to use). Semi Join selection For details, please refer to below link: https://cwiki.apache.org/confluence/display/Hive/Cost-based+optimization+in+Hive Thanks and Regards, Sindhu

ssubhas · ‎06-21-2016

@ARUNKUMAR RAMASAMY MySQL might be rejecting connections to the extract the data from the tables from remote host. We need to grant privileges for the IP's of the data nodes at the database end as below: GRANTALL PRIVILEGES ON*.*TO'user'@'ipadress' Thanks and Regards, Sindhu

ssubhas · ‎06-21-2016

@ARUNKUMAR RAMASAMY The communication between the datanodes and mysql needs to be open. Make sure telnet <mysql_server> <port> works on all the nodes in the cluster. Also, need to verify the bind address at mysql end to verify the connectivity. You can refer to below link for more debugging at mysql end: http://stackoverflow.com/questions/2121829/com-mysql-jdbc-exceptions-jdbc4-communicationsexceptioncommunications-link-fail Hope this helps. Thanks and Regards, Sindhu

ssubhas · ‎06-20-2016

@Pradeep Bhadani Run mysql manual install on the machine itself as: rpm -e --nodeps mysql-libs Hope this helps. Thanks and Regards, Sindhu

ssubhas · ‎06-20-2016

Could you please share the steps that resolved the issue and mark as best answer? Thanks, Sindhu

ssubhas · ‎06-17-2016

@Simran Kaur Seems like TotalRecords is a keyword. Try using TotalRecords_1 and see if it helps. Thanks and Regards, Sindhu

ssubhas · ‎06-16-2016

@khushi kalra You can also use RJDBC as below to connect to Hive: library("DBI") library("rJava") library("RJDBC") hive.class.path = list.files(path=c("/usr/hdp/current/hive-client/lib"), pattern="jar", full.names=T); hadoop.lib.path = list.files(path=c("/usr/hdp/current/hive-client/lib"), pattern="jar", full.names=T); hadoop.class.path = list.files(path=c("/usr/hdp/2.4.0.0-169/hadoop"), pattern="jar", full.names=T); cp = c(hive.class.path, hadoop.lib.path, hadoop.class.path, "/usr/hdp/2.4.0.0-169/hadoop-mapreduce/hadoop-mapreduce-client-core.jar") .jinit(classpath=cp) drv <- JDBC("org.apache.hive.jdbc.HiveDriver","hive-jdbc.jar",identifier.quote="`") url.dbc <- paste0("jdbc:hive2://ironhide.hdp.local:10000/default"); conn <- dbConnect(drv, url.dbc, "hive", “redhat"); log4j:WARN No appenders could be found for logger (org.apache.hadoop.util.Shell). log4j:WARN Please initialize the log4j system properly. log4j:WARN See http://logging.apache.org/log4j/1.2/faq.html#noconfig for more info. dbListTables(conn); Thanks and Regards, Sindhu

ssubhas · ‎06-16-2016

@Roberto Sancho Please refer to below Hortonworks blog with inputs to improve Hive query performance: http://hortonworks.com/blog/5-ways-make-hive-queries-run-faster/ Hope this helps. Thanks and Regards, Sindhu

Online	Offline
Last Visited	‎06-08-2020 12:38 PM

Member Since	‎04-11-2016 05:12 AM
Last Visited	‎06-08-2020 12:38 PM
Posts	535
Kudos received	147

Cloudera Community

Re: What does --m 1 represent in sqoop import sta...

Re: HDP-2.6.4.0 - Superset startup failes with err...

Re: Unable to import data from Informix non transa...

Re: We have an AWS cluster setup for HCP Metron a...

Re: HDP support for Sqoop 2.x ?

Re: Can we take user input in hive

Re: import csv data into hive table orc format

Re: CBO for Hive over hbase

Re: Sqoop import

Re: Sqoop import

Re: Mysql as Hive metastore installation failed on...

Re: Error while compiling statement: FAILED: Seman...

Re: Error while compiling statement: FAILED: Seman...

Re: Please if anyone can give me good examples of ...

Re: hive very big table